
Starred repositories
Lumina-DiMOO - An Open-Sourced Multi-Modal Large Diffusion Language Model
DatasetResearch: Benchmarking Agent Systems for Demand-Driven Dataset Discovery
Large-scale, Informative, and Diverse Multi-round Chat Data (and Models)
Multilingual Document Layout Parsing in a Single Vision-Language Model
Get your documents ready for gen AI
Reproducible and flexible LLM evaluations for scientific reasoning.
Qwen Code is a coding agent that lives in the digital world.
Kimi K2 is the large language model series developed by Moonshot AI team
ResearcherBench: Evaluating Deep AI Research Systems on the Frontiers of Scientific Inquiry
AlphaGo Moment for Model Architecture Discovery.
MegaScience: Pushing the Frontiers of Post-Training Datasets for Science Reasoning
End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning
⚡️SwanLab - an open-source, modern-design AI training tracking and visualization tool. Supports Cloud / Self-hosted use. Integrated with PyTorch / Transformers / LLaMA Factory / veRL/ Swift / Ultra…
Train your Agent model via our easy and efficient framework
The official repo of One RL to See Them All: Visual Triple Unified Reinforcement Learning
Scalable toolkit for efficient model reinforcement
The simplest, fastest repository for training/finetuning small-sized VLMs.
Revisiting Mid-training in the Era of Reinforcement Learning Scaling
Lightning Attention-2: A Free Lunch for Handling Unlimited Sequence Lengths in Large Language Models
Generative AI Act II: Test Time Scaling Drives Cognition Engineering
Solve Visual Understanding with Reinforced VLMs
Kimi-VL: Mixture-of-Experts Vision-Language Model for Multimodal Reasoning, Long-Context Understanding, and Strong Agent Capabilities
[COLM 2025] An Open Math Pre-trainng Dataset with 370B Tokens.