Name	Name	Last commit message	Last commit date
parent directory ..
accelerate_configs	accelerate_configs
.DS_Store	.DS_Store
README.md	README.md
config_demo_code_largebatch_codeformat_largerollout.yaml	config_demo_code_largebatch_codeformat_largerollout.yaml
config_demo_code_sft.yaml	config_demo_code_sft.yaml

Name

Last commit message

Last commit date

accelerate_configs

.DS_Store

README.md

config_demo_code_largebatch_codeformat_largerollout.yaml

config_demo_code_sft.yaml

Post-training recipes

OpenR1 Distill 7B

To train the OpenR1 Distill 7B model, run:

sbatch --nodes=1 slurm/train.slurm --model OpenR1-Distill-7B --task sft --config distill --accelerator zero3

OlympicCoder

To train the OlympicCoder models, run:

# 7B
sbatch --nodes=1 slurm/train.slurm --model OlympicCoder-7B --task sft --config v00.00 --accelerator zero3

# 32B
sbatch --nodes=16 slurm/train.slurm --model OlympicCoder-32B --task sft --config v00.00 --accelerator fsdp

Note that we found it necessary to switch to FSDP1 and paged AdamW 8-bit for the 32B model in order to fit the largest possible context size.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

Post-training recipes

OpenR1 Distill 7B

OlympicCoder

FilesExpand file tree

configs

Directory actions

More options

Directory actions

More options

Latest commit

History

configs

Folders and files

parent directory

README.md

Post-training recipes

OpenR1 Distill 7B

OlympicCoder