A unified inference and post-training framework for accelerated video generation.
-
Updated
Dec 23, 2025 - Python
A unified inference and post-training framework for accelerated video generation.
Awesome Reasoning LLM Tutorial/Survey/Guide
心理健康大模型 (LLM x Mental Health), Pre & Post-training & Dataset & Evaluation & Depoly & RAG, with InternLM / Qwen / Baichuan / DeepSeek / Mixtral / LLama / GLM series models
Explore the Multimodal “Aha Moment” on 2B Model
"LightReasoner: Can Small Language Models Teach Large Language Models Reasoning?"
An unified model for 4D human-scene reconstruction
[NeurIPS 2025] Flow x RL. "ReinFlow: Fine-tuning Flow Policy with Online Reinforcement Learning". Support VLAs e.g., pi0, pi0.5. Fully open-sourced.
Post-training scripts and samples for NVIDIA Cosmos ecosystem
MegaScience: Pushing the Frontiers of Post-Training Datasets for Science Reasoning
Official repo of paper "SRUM: Fine-Grained Self-Rewarding for Unified Multimodal Models". A post-training framework that creates a cost-effective, self-iterative optimization loop.
A High-Efficiency System of Large Language Model Based Search Agents
A Practitioner's Guide to M(eow)ti Turn Agentic ReinfOrcement learning
[CVPR 2025] Science-T2I: Addressing Scientific Illusions in Image Synthesis
Official implementation of Regularized Policy Gradient (RPG) (https://arxiv.org/abs/2505.17508)
Salesforce AI Research's open diffusion language model
[EMNLP'25] A novel alignment framework that leverages image retrieval to mitigate hallucinations in Vision Language Models.
Lightweight replication study of DeepSeek-R1-Zero. Interesting findings include "No Aha Moment", "Longer CoT ≠ Accuracy", and "Language Mixing in Instruct Models".
[EMNLP 2022] Continual Training of Language Models for Few-Shot Learning
Code repository dedicated to experimenting and research with tiny reasoning language model
[AAAI 2026] D²PPO: Diffusion Policy Policy Optimization with Dispersive Loss.
Add a description, image, and links to the post-training topic page so that developers can more easily learn about it.
To associate your repository with the post-training topic, visit your repo's landing page and select "manage topics."