Deep ML

A repository for my solutions to problems on Deep-ML, a site for LeetCode-style questions for machine learning and data science. For each problem, I decided to use either numpy or pure Python, depending on the type signature of the method, i.e. if the method takes in 2 np.arrays, then I use numpy, else Python.

Collections

Note

Collections have duplicate questions.

AlexNet
- Implement ReLU Activation Function
- Simple Convolutional 2D Layer
- Overlapping Max Pooling
- PCA Colour Augmentation
- Dropout Layer
Attention is All You Need
- Implement Self-Attention Mechanism
- Implement Multi-Head Attention
- Implement Masked Self-Attention
- Implement Layer Normalization for Sequence Data
- Positional Encoding Calculator
Deep Learning
1. Linear Algebra
2. Probability and Statistics
3. Optimization Techniques
4. Fundamentals of Neural Networks
  - Softmax Activation Function Implementation
  - Implementation of Log Softmax Function
  - Sigmoid Activation Function
  - Implement ReLU Activation Function
  - Leaky ReLU Activation Function
  - Implement the PReLU Activation Function
  - Single Neuron
  - Implementing a Simple RNN
  - Implement a Long Short-Term Memory (LSTM) Network
  - Simple Convolutional 2D Layer
  - GPT-2 Text Generation
5. Backpropagation
  - Single Neuron with Backpropagation
  - Implementing Basic Autograd Operations
  - Implement a Simple RNN with Backpropagation Through Time (BPTT)
6. LLM
  - Implement Self-Attention Mechanism
  - The Pattern Weaver's Code
  - Positional Encoding Calculator
  - Implement Multi-Head Attention
  - GPT-2 Text Generation
DeepSeek R1
- Implement the GRPO Objective Function
- Group Relative Advantage for GRPO
- KL Divergence Estimator for GRPO
- Pass@k and Majority Voting Evaluation Metrics
- Knowledge Distillation Loss
DenseNet
- Single Neuron with Backpropagation
- Simple Convolutional 2D Layer
- Implement ReLU Activation Function
- Implement a Simple Residual Block with Shortcut Connection
- Implement Global Average Pooling
- Implement Batch Normalization for BCHW Input
- Implement a Batch Dense Block with 2D Convolutions
GPT 243
1. Autograd Engine (Value Class & Backpropagation)
  - Implementing Basic Autograd Operations
  - Single Neuron with Backpropagation
2. Lab: Autograd
  - MNIST: Classification Loss (with Gradient)
3. Tokenization & Embeddings
  - Character-Level Tokenizer (stoi/itos/BOS)
  - Learned Positional Embeddings
4. Lab: Tokenization
  - Build a Tokenizer for Language Modelling
5. Core Building Blocks (Linear, Softmax, RMSNorm)
6. Lab: Build a Neural Network from Scratch
  - MNIST: Build Neural NEtwork from Scratch (numpy Only)
7. Multi-Head Attention & KV Cache
  - Implement Self-Attention Mechanism
  - Implement Masked Self-Attention
  - Implement Multi-Head Attention
  - KV Cache for Efficient Autogregressive Attention
8. Lab: Attention
  - Design Your Own Attention Mechanism
9. Transformer Block (Residuals, MLP, Activations)
  - Implement a Simple Residual Block with Shortcut Connection
  - Implement Position-wise Feed-Forwards Block with Residual and Dropout
  - Implement the Square ReLU Activation Function
10. Lab: Activation Function
  - Design Your Own Activation Function
11. Loss Functions & Cross-Entropy
  - Compute Multi-class Cross-Entropy Loss
  - Implementation of Log Softmax Function
12. Adam Optimizer & Learning Rate Schedule
  - Implement Adam Optimization Algorithm
  - Linear Learning Rate Decay
13. Lab: Optimizer
  - Design Your Own Optimizer (numpy)
14. Training Loop (Putting It All Together)
  - Calculate Number of Parameters in Neural Network
15. Lab: Full Training Loop
  - Build a Digit Classifier from Scratch
16. Inference & Text Generation
  - Temperature Sampling
LLM Evaluation Methods
1. Multiple Choice Benchmarks
  - MMLU Letter-Matching Evaluation
  - MMLU Log-Probability Scoring
2. Verifier-Based Evaluation
  - Boxed Answer Extraction for Math Benchmarks
  - Math Answer Verification with Equivalence Checking
  - Code Execution Verifier for Programming Benchmarks
3. Preference Leaderboards
  - Elo Rating System for Model Comparison
  - Bradley-Terry Model for Pairwise Rankings
4. LLM-as-a-Judge
  - Rubric-Based LLM Judge Evaluation
  - Pairwise Preference Judge for LLM Comparison
5. Other Measures Mentioned in the Post
  - BLEU Score for Text Generation
  - Calculate PReplexity for Language Models
  - Compute Multi-class Cross-Entropy Loss
Linear Algebra
Machine Learning
1. Linear Algebra
2. Probability and Statistics
3. Optimization
4. Model Evaluation
5. Classification & Regression Techniques
  - Linear Regression Using Normal Equation
  - Linear Regression Using Gradient Descent
  - Binary Classification with Logistic Regression
  - Calculate Jaccard Index for Binary Classification
  - Pegasos Kernel SVM Implementation
  - Implement AdaBoost Fit Method
  - Softmax Activation Function Implementation
6. Unsupervised Learning
7. Deep Learning
Metadata Normalization (MDN)
1. Mathematical Prerequisites
  - Linear Regression Using Normal Equation
  - Implement Orthogonal Projection of a Vector onto a Line
2. Normalization Baselines (What MDN Improves Upon)
  - Implement Batch Normalization for BCHW Input
  - Implement Group Normalization
3. Core MDN Concepts
  - Implement Code MDN Residualization
  - Distance Correlation for Measuring Metadata Dependence
4. Advanced MDN (Handling Confounding)
  - MDN with Label Collinearity Control
5. Lab
  - Feature Deconfounder for Biased Image Data
ResNet
- Single Neuron with Backpropagation
- Simple Convolutional 2D Layer
- Implement ReLU Activation Function
- Implement a Simple Residual Block with Shortcut Connection
- Implement Global Average Pooling
- Implement Batch Normalization for BCHW Input
Sparsely Gated MoE
- Softmax Activation Function Implementation
- Single Neuron
- Calculate Computational Efficiency of MoE
- Implement Noisy Top-K Gating Function
- Implement a Sparse Mixture of Experts Layer
Data Science I Interview Prep
1. Core Machine Learning Concepts
2. Data Processing
3. Deep Learning
4. Model Evaluation & Metrics
Essense of Linear Algebra
1. Vectors
  - Scalar Multiplication of a Matrix
2. Linear Combinations
  - Compute Orthonormal Basis for 2D Vectors
3. Linear Transformations
4. Matrix Multiplication
  - Matrix times Matrix
5. Determinant
  - Determinant of a 4x4 Matrix using Laplace's Expansion
6. Inverse Matrices
7. Cross Product
  - Compute the Cross Product of Two 3D Vectors
8. Cramer's Rule
  - Solve System of Linear Equations Using Cramer's Rule
9. Change of Basis
  - Transformation Matrix from Basis B to C
10. Eigenvector and Eigenvalues
  - Calculate Eigenvalues of a Matrix
Micrograd Builder
Optimizers

Labs

Learning Paths

Calculus for Machine Learning
1. Derivatives and Gradients
2. Multivariate Calculus
3. Neural Network Derivatives
4. Backpropagation
5. Gradient Descent
6. Optimization
7. Calculus Lab
  - MNIST: Classification Loss (with Gradient)
8. Pytorch: Calculus Lab 1
  - PyTorch: Implement Your Own Gradient Descent Training Step
9. Pytorch: Calculus Lab 2
  - PyTorch: Build a Complete Training Loop
Linear Algebra for Machine Learning
1. Vector Operations
2. Vector Norms and Independence
  - Engram Context-Aware
  - Compute the Null Space of a Matrix
3. Matrix Basics
4. Matrix Multiplication
5. Matrix Properties I
6. Matrix Properties II
7. Solving Linear Systems
8. Orthogonality and Projections
  - Implement Orthogonal Projection of a Vector Onto a Line
  - Compute Orthonormal Basis for 2D Vectors
9. Matrix Decompositions I
  - Check if Matrix is Positive Definite
  - LU Decomposition of a Square Matrix
10. Matrix Decompositions II
  - Singular Value Decomposition of a 2x2 Matrix
  - QR Decomposition
11. Covariance and Correlation
  - Calculate Covariance Matrix
  - Calculate Correlation Matrix
12. Linear Algebra Lab I
  - Numpy: Design Your Own Dimensionality Reduction
13. Linear Algebra Lab II
  - Dimensionality Reduction with Sklearn
Probability and Statistics for Machine Learning
1. Descriptive Statistics
2. Probability Fundamentals
3. Bayes' Theorem
4. Common Distributions I
5. Common Distributions II
6. Law of Large Numbers and CLT
7. Information Theory
8. KL Divergence
9. Maximum Likelihood and MAP
10. Statistical Inference
11. Bayesian Methods
12. Probabalistic Models

Name		Name	Last commit message	Last commit date
Latest commit History 55 Commits
src		src
.gitignore		.gitignore
.python-version		.python-version
README.md		README.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Deep ML

Collections

Labs

Learning Paths

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Deep ML

Collections

Labs

Learning Paths

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages