Skip to content
#

transformer-from-scratch

Here are 17 public repositories matching this topic...

A complete implementation of the "Attention Is All You Need" Transformer model from scratch using PyTorch. This project focuses on building and training a Transformer for neural machine translation (English-to-Italian) on the OpusBooks dataset.

  • Updated Nov 8, 2025
  • Python

PyTorch Transformer for neural machine translation (NMT), inspired by Attention Is All You Need. German→English on OPUS Books: training, inference, and attention visualization.

  • Updated May 14, 2026
  • Jupyter Notebook

An educational implementation of core Transformer architecture concepts built from scratch using Python. This project explores how modern NLP transformer models work internally by implementing attention mechanisms, embeddings, positional encoding, and next-word prediction logic step-by-step.

  • Updated May 21, 2026
  • Jupyter Notebook

Improve this page

Add a description, image, and links to the transformer-from-scratch topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the transformer-from-scratch topic, visit your repo's landing page and select "manage topics."

Learn more