Skip to content

joelburget/moe-sae

Repository files navigation

I'm now focused on Olmoe, though train_mixtral.py still exists.

Sweep:

wandb sweep --project moe-sae olmoe-config.yaml
wandb agent <sweep id printed by previous command>

Train:

python3 train_olmoe.py

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages