Skip to content

benjaminstern/nanoGPT

 
 

Repository files navigation

A simplified adaption of the https://github.com/karpathy/nanoGPT repository, for a small model trained on the german translation of shakespeares works.

Showcase

If you only wan't to see the result open the jupyter notebook LLM-showcase.ipynb.

How to train the model

  • create a venv from the requirements.txt
    • if you get dependency problems, try to resolve them with the information from the requirements-pip-freeze.txt
  • python data/shakespeare_deutsch/prepare.py
  • on my hardware setup i ran into a memory shortage, setting this env variable helped (export 'PYTORCH_CUDA_ALLOC_CONF=max_split_size_mb:512')
  • python train.py config/train_shakespeare.py

About

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 86.2%
  • Jupyter Notebook 13.8%