See the book live at https://llms.danwil.com.
by Dan Wilhelm
In this book, we develop a low-level understanding of Large Language Models (LLMs). To understand large circuits, we apply learnings from mathematics and the analysis of small circuits.
This book has an accompanying YouTube channel and project GitHub repo (coming soon).
To get the most out of this book, we recommend the reader be fluent in Python and know the basics of NumPy, linear algebra, and machine learning. Therefore, our target audience includes researchers, CS students, and software engineers with an interest in LLMs.
Currently, our focus is on analysis rather than training.
This site is built using the Rust static-site generator mdBook.
To build the site:
- Install mdBook.
- Install the katex preprocessor:
cargo install mdbook-katex
. - Install the mermaid preprocessor:
cargo install mdbook-mermaid
. - From the project directory, run
mdbook build
. The output will be in thebook
directory. - Alternatively, to view the site immediately in watch mode run
mdbook serve --open
.
Note: Also uses a table-of-contents modification that does not require installation.