🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…

Python 65,723 6,611 Updated Jan 22, 2026

microsoft / generative-ai-for-beginners

21 Lessons, Get Started Building with Generative AI

Jupyter Notebook 106,344 56,987 Updated Feb 11, 2026

mlfoundations / dclm

DataComp for Language Models

HTML 1,415 129 Updated Sep 9, 2025

jzhang38 / TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 8,888 602 Updated May 3, 2024

OpenCoder-llm / OpenCoder-llm

The Open Cookbook for Top-Tier Code Large Language Model

Python 2,039 116 Updated Dec 8, 2024

datajuicer / data-juicer

Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷

Python 5,891 325 Updated Feb 13, 2026

rasbt / LLMs-from-scratch

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 85,212 12,899 Updated Feb 9, 2026

Leolty / repobench

✨ RepoBench: Benchmarking Repository-Level Code Auto-Completion Systems - ICLR 2024

Python 187 12 Updated Aug 16, 2024

karpathy / minbpe

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 10,314 1,006 Updated Jul 1, 2024

openai / gpt-2

Code for the paper "Language Models are Unsupervised Multitask Learners"

Python 24,608 5,863 Updated Aug 14, 2024

naklecha / llama3-from-scratch

llama3 implementation one matrix multiplication at a time

Jupyter Notebook 15,235 1,288 Updated May 23, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ThreeAu Simba2017

Block or report Simba2017

Stars

jingyaogong / minimind

qibin0506 / Cortex

vllm-project / vllm-ascend

huggingface / open-r1

hiyouga / EasyR1

Simba2017 / EasyR1

deepseek-ai / DualPipe

deepseek-ai / open-infra-index

xlite-dev / LeetCUDA

se2p / pynguin

openai / simple-evals

peilongchencc / My-LLaMA-Factory

huggingface / Math-Verify

OpenHands / OpenHands

verl-project / verl

Jiayi-Pan / TinyZero

huggingface / smollm

deepseek-ai / DeepSeek-R1

hkust-nlp / simpleRL-reason

labmlai / annotated_deep_learning_paper_implementations