Skip to content
#

nano-gpt

Here are 11 public repositories matching this topic...

Provide specialized AI agents that develop, review, debug, and deploy production-ready code efficiently across various programming tasks.

  • Updated Jun 6, 2026
  • JavaScript

From-scratch LLM inference engine in PyTorch. MHA/GQA/MQA in one module, RoPE (NeoX/Llama half-rotation), append-only KV cache with numerical parity vs F.scaled_dot_product_attention, RMSNorm + SwiGLU decoder blocks, KV-cached greedy/top-k sampling. Trains a 1M-param GQA transformer on Tiny Shakespeare end-to-end.

  • Updated May 15, 2026
  • Python

Improve this page

Add a description, image, and links to the nano-gpt topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the nano-gpt topic, visit your repo's landing page and select "manage topics."

Learn more