A modular, scalable, high-performance training framework for LLMs, VLMs, diffusion, and embodied models.
-
Updated
Jun 5, 2026 - Python
A modular, scalable, high-performance training framework for LLMs, VLMs, diffusion, and embodied models.
Revisiting Mid-training in the Era of Reinforcement Learning Scaling
[๐๐๐ ๐ ๐ฎ๐ฌ๐ฎ๐ฒ] Dispersion loss counteracts embedding condensation and improves generalization in small language models
Open catalog of datasets used to train and align LLMs across pretraining, mid-training, and post-training.
Add a description, image, and links to the mid-training topic page so that developers can more easily learn about it.
To associate your repository with the mid-training topic, visit your repo's landing page and select "manage topics."