A simplified adaption of the https://github.com/karpathy/nanoGPT repository, for a small model trained on the german translation of shakespeares works.
If you only wan't to see the result open the jupyter notebook LLM-showcase.ipynb.
- create a venv from the requirements.txt
- if you get dependency problems, try to resolve them with the information from the requirements-pip-freeze.txt
- python data/shakespeare_deutsch/prepare.py
- on my hardware setup i ran into a memory shortage, setting this env variable helped (export 'PYTORCH_CUDA_ALLOC_CONF=max_split_size_mb:512')
- python train.py config/train_shakespeare.py