Principal-Spectral-Regularization-For-LLM-Training

Setup

To set up the environment for this project, follow these steps:

Clone the repository:

git clone https://github.com/xie-lab-ml/Principal-Spectral-Regularization-For-LLM-Training.git
cd Principal-Spectral-Regularization-For-LLM-Training

Create a virtual environment with Conda:

conda create -n llm-psr python=3.10
conda activate llm-psr

Install dependencies:
```
pip install -r requirements.txt
```

Download Dataset

To download the processed 100k-doc training sample dataset, run the following command:

# Download llama model data
wget https://bj.bcebos.com/paddlenlp/models/transformers/llama/data/llama_openwebtext_100k.bin
wget https://bj.bcebos.com/paddlenlp/models/transformers/llama/data/llama_openwebtext_100k.idx

LLaMA Pretraining

To run the default pretraining experiment on LLaMA models, run the following command to invoke the script:

python -u -m paddle.distributed.launch --gpus 0,1,2,3,4,5,6,7 run_pretrain.py config/llama/pretrain_argument.json

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
configs		configs
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
run_pretrain.py		run_pretrain.py
sgdm_psr_paddle.py		sgdm_psr_paddle.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Principal-Spectral-Regularization-For-LLM-Training

Table of Contents

Setup

Download Dataset

LLaMA Pretraining

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Principal-Spectral-Regularization-For-LLM-Training

Table of Contents

Setup

Download Dataset

LLaMA Pretraining

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages