⚡️SwanLab - an open-source, modern-design AI training tracking and visualization tool. Supports Cloud / Self-hosted use. Integrated with PyTorch / Transformers / LLaMA Factory / veRL/ Swift / Ultra…

Python 2,590 143 Updated Sep 12, 2025

Simple-Efficient / RL-Factory

Train your Agent model via our easy and efficient framework

Python 1,471 137 Updated Sep 11, 2025

MiniMax-AI / One-RL-to-See-Them-All

The official repo of One RL to See Them All: Visual Triple Unified Reinforcement Learning

Python 312 16 Updated May 31, 2025

NVIDIA-NeMo / RL

Scalable toolkit for efficient model reinforcement

Python 856 130 Updated Sep 12, 2025

GAIR-NLP / PC-Agent-E

Efficient Agent Training for Computer Use

Python 130 4 Updated Sep 5, 2025

huggingface / nanoVLM

The simplest, fastest repository for training/finetuning small-sized VLMs.

Python 4,023 380 Updated Sep 10, 2025

GAIR-NLP / OctoThinker

Revisiting Mid-training in the Era of Reinforcement Learning Scaling

Jupyter Notebook 172 12 Updated Jul 23, 2025

OpenNLPLab / lightning-attention

Lightning Attention-2: A Free Lunch for Handling Unlimited Sequence Lengths in Large Language Models

Python 325 26 Updated Feb 23, 2025

GAIR-NLP / cognition-engineering

Generative AI Act II: Test Time Scaling Drives Cognition Engineering

Python 205 9 Updated Apr 22, 2025

om-ai-lab / VLM-R1

Solve Visual Understanding with Reinforced VLMs

Python 5,532 350 Updated Aug 29, 2025

MoonshotAI / Kimi-VL

Kimi-VL: Mixture-of-Experts Vision-Language Model for Multimodal Reasoning, Long-Context Understanding, and Strong Agent Capabilities

1,058 50 Updated Jul 15, 2025

ByteDance-Seed / Seed-Thinking-v1.5

814 18 Updated Jun 9, 2025

Qihoo360 / Light-R1

Python 741 50 Updated Sep 3, 2025

LLM360 / MegaMath

[COLM 2025] An Open Math Pre-trainng Dataset with 370B Tokens.

XSLT 100 7 Updated Apr 4, 2025

multimodal-art-projection / COIG-P

Python 39 2 Updated Jul 15, 2025

volcengine / veScale

A PyTorch Native LLM Training Framework

Python 865 51 Updated Sep 12, 2025

Zengzhi Wang SinclairCoder

Starred repositories

scikit-learn

Natural language processing

pytorch-absa

absa

opinion-mining

opinion-target-extraction