pjlab-sys4nlp / llama-moe Public

Notifications You must be signed in to change notification settings
Fork 52
Star 916

⛷️ LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training (EMNLP 2024)

arxiv.org/abs/2406.16554

916 stars 52 forks Branches Tags Activity

Notifications

Name		Name	Last commit message	Last commit date
Latest commit History 140 Commits
.vscode		.vscode
conf		conf
docs		docs
models/124M		models/124M
scripts		scripts
smoe		smoe
tests		tests
.env.example		.env.example
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
VERSION		VERSION
requirements.txt		requirements.txt
setup.py		setup.py
tox.ini		tox.ini

Repository files navigation

train-moe

[Installation Guide] | [MoEfication Docs] | [Continual Pre-training Docs]

🌴 Dependencies

Python==3.11.4
- Packages: please check requirements.txt (NOTE: flash-attn must be properly installed by following their instructions)

🚀 QuickStart

Tokenization

RedPajama: bash scripts/tokenize/redpajama.sh (Don't forget to change the folder paths.)

Continual Pre-training (CPT)

NOTICE: Please create logs/ folder manually: mkdir -p logs

LLaMA MoEfication LoRA: sbatch scripts/cpt/lora.sh
LLaMA MoEfication Full-Parameter: sbatch scripts/cpt/fpt.sh

🤝 Contribution

Make sure the Python version >=3.10 (a strict version contraint for better type hinting)

$ conda install git  # upgrade git
$ git clone git@github.com:pjlab-sys4nlp/train-moe.git
$ cd train-moe
$ pip install -e .[dev]
$ pre-commit install

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

train-moe

🌴 Dependencies

🚀 QuickStart

Tokenization

Continual Pre-training (CPT)

🤝 Contribution

About

Releases 4

Packages

Contributors 5

Languages

License

pjlab-sys4nlp/llama-moe

Folders and files

Latest commit

History

Repository files navigation

train-moe

🌴 Dependencies

🚀 QuickStart

Tokenization

Continual Pre-training (CPT)

🤝 Contribution

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 4

Packages 0

Contributors 5

Languages

Packages