Build a Large Language Model from Scratch

This repository contains my own implementation of the code snippets that appear in Sebastian Raschka's: Build a Large Language Model (from Scratch) book.

It is part of my follow up of the sessions of the study group organized by Santi Viquez around the book. More information can be found in its Discord channel: AI from scratch.

Note

Although these type of books are traditionally followed with Jupyter notebooks, I opted for a object oriented version where classes are created in files (see the scratch/ folder) as I found that it is easier to test.

Tip

If you are interested in a Jupyter notebook version of the code from the book, the author himself has created a wonderful LLMs-from-scratch repository.

Completion

Chapter 2: Working with text data
Chapter 3: Coding attention mechanisms
Chapter 4: Implementing a GPT model from scratch
Chapter 5: Pretraining on unlabeled data
Chapter 6: Fine-tuning for classification
Chapter 7: Fine-tuning to follow instructions

Installation

This repository can be installed as a regular Python project, only that I don't plan to upload it to the Python Package Index as it's meant for pedagogical purposes, rather than for production use cases.

git clone https://github.com/elcapo/llm-from-scratch
cd llm-from-scratch

Virtual Environment

Before installing the dependencies, it is recommended to create and activate a virtual environment.

# Create a virtual environment in the `.venv` folder
python -m venv .venv

# Activate the new virtual environment
source .venv/bin/activate

# Install the dependencies
pip install -r requirements.txt

Documentation

Chapter 2: Working with text data
Chapter 3: Coding attention mechanisms
Chapter 4: Implementing a GPT model from scratch
Chapter 5: Pretraining on unlabeled data
- Jupyter Notebook: Pretraining
- Jupyter Notebook: 12 Word World

Tests

In order to facilitate the readability of the tests, most of the examples used in them are literal copies of the values (strings and vectors) that appear in the book.

pytest

Name		Name	Last commit message	Last commit date
Latest commit History 99 Commits
.vscode		.vscode
docs		docs
scratch		scratch
state		state
tests		tests
.gitignore		.gitignore
12-word-world.ipynb		12-word-world.ipynb
README.md		README.md
pretraining.ipynb		pretraining.ipynb
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Build a Large Language Model from Scratch

Completion

Installation

Virtual Environment

Documentation

Tests

About

Uh oh!

Releases

Packages

Uh oh!

Languages

elcapo/llm-from-scratch

Folders and files

Latest commit

History

Repository files navigation

Build a Large Language Model from Scratch

Completion

Installation

Virtual Environment

Documentation

Tests

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages