m2kulkarni / lolcats Public

forked from HazyResearch/lolcats

Notifications You must be signed in to change notification settings
Fork 0
Star 0

Repo for "LoLCATs: On Low-Rank Linearizing of Large Language Models"

Notifications

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
configs		configs
lm_eval_harness		lm_eval_harness
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
distill_llama.py		distill_llama.py
environment.yaml		environment.yaml

Repository files navigation

LoLCATs

[WIP]

Example Commands

python distill_llama.py \
--model_config distill_long_llama3_8b_lk_smd_wtk64_fd64_w01 \
--distill_config distill_long_alpaca_8k_xent0_mse1000_lr1e-2_bs1 \
--finetune_config finetune_long_lora_qkvo_alpaca_clean_8192 \
--eval_config eval_alpaca_clean  \
--lk_zero_init --verbose --seed 0 --replicate 614 --state_chunk_len 1024 \
--num_train_epochs 2

cmd 3

python distill_llama.py \
--model_config distill_llama3_8b_lk_smd_wtk64_fd64_w01 \
--distill_config distill_alpaca_clean_xent0_mse1000_lr1e-2 \
--finetune_config finetune_lora_qkvo_alpaca_clean \
--eval_config eval_alpaca_clean  \
--lk_zero_init --verbose --seed 0 --replicate 614 \
--num_train_epochs 2