pjlab-sys4nlp / llama-moe Public

Notifications You must be signed in to change notification settings
Fork 52
Star 916

⛷️ LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training (EMNLP 2024)

arxiv.org/abs/2406.16554

916 stars 52 forks Branches Tags Activity

Notifications

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
.vscode		.vscode
conf		conf
docs/moefication		docs/moefication
scripts		scripts
smoe		smoe
tests		tests
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
VERSION		VERSION
requirements.txt		requirements.txt
setup.py		setup.py
tox.ini		tox.ini

Repository files navigation

train-moe

[MoEfication Docs]

🌴 Dependencies

Python >= 3.10
- scikit-learn>=1.3.0
- omegaconf>=2.0.6
- tqdm>=4.65.0
- datasets>=2.13.1
- transformers>=4.30.2
- peft>=0.4.0
- xformers>=0.0.20
- k_means_constrained==0.7.3
- install flash-attention followed by this instruction: https://github.com/Dao-AILab/flash-attention

🚀 QuickStart

Tokenization

RedPajama: bash scripts/tokenize/redpajama.sh (Don't forget to change the folder paths.)

Continual Pre-training (CPT)

NOTICE: Please create logs/ folder manually: mkdir -p logs

Dense LLaMA-v1 LoRA: sbatch scripts/cpt/lora.sh

🤝 Contribution

Make sure the Python version >=3.10 (a strict version contraint for better type hinting)

$ git clone git@github.com:pjlab-sys4nlp/train-moe.git
$ pip install -e .[dev]
$ pre-commit install

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

train-moe

🌴 Dependencies

🚀 QuickStart

Tokenization

Continual Pre-training (CPT)

🤝 Contribution

About

Releases 4

Packages

Contributors 5

Languages

License

pjlab-sys4nlp/llama-moe

Folders and files

Latest commit

History

Repository files navigation

train-moe

🌴 Dependencies

🚀 QuickStart

Tokenization

Continual Pre-training (CPT)

🤝 Contribution

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 4

Packages 0

Contributors 5

Languages

Packages