Skip to content
View yushengsu-thu's full-sized avatar

Highlights

  • Pro

Organizations

@thunlp @ROCm @OpenBMB @RLsys-Foundation

Block or report yushengsu-thu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A Next-Generation Training Engine Built for Ultra-Large MoE Models

Python 4,765 358 Updated Sep 8, 2025

I recently interviewed with some AI labs and these are the notes I took during my study for ML fundamentals and Design. This was in Mar 2025 and given how fast the field of AI moves, some of it may…

17 3 Updated Aug 21, 2025

Research prototype of PRISM — a cost-efficient multi-LLM serving system with flexible time- and space-based GPU sharing.

Python 24 Updated Aug 15, 2025

Materials for learning SGLang

566 47 Updated Aug 31, 2025

Renderer for the harmony response format to be used with gpt-oss

Rust 3,758 193 Updated Aug 15, 2025

A Distributed Attention Towards Linear Scalability for Ultra-Long Context, Heterogeneous Data Training

Python 501 30 Updated Sep 8, 2025

Allow torch tensor memory to be released and resumed later

Python 2 Updated Aug 12, 2025

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 3,890 367 Updated Sep 8, 2025

Long-RL: Scaling RL to Long Sequences

Python 604 21 Updated Sep 8, 2025

slime is a LLM post-training framework aiming at scaling RL.

Python 1 Updated Sep 8, 2025

The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.

Python 1 Updated Jun 30, 2025

Ongoing research training transformer models at scale

Python 1 Updated Jun 29, 2025

slime is a LLM post-training framework for RL Scaling.

Python 1,675 146 Updated Sep 8, 2025

Bridge Megatron-Core to Hugging Face/Reinforcement Learning

Python 116 16 Updated Sep 5, 2025
Python 1 Updated Sep 8, 2025
C++ 150 79 Updated Sep 8, 2025

SkyRL: A Modular Full-stack RL Library for LLMs

Python 823 93 Updated Sep 8, 2025

Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes

JavaScript 1 1 Updated Aug 24, 2025

Code to automatically prove or verify estimates in analysis

JavaScript 303 24 Updated Jul 1, 2025

Allow torch tensor memory to be released and resumed later

Python 124 20 Updated Aug 29, 2025

My learning notes/codes for ML SYS.

Python 1 Updated Jun 2, 2025

Some Environment Examples of LLM Agents, it's designed to be able to integrated with VeRL

Python 3 1 Updated Apr 24, 2025

Distributed RL System for LLM Reasoning

Python 2,565 168 Updated Sep 8, 2025

Agent Laboratory is an end-to-end autonomous research workflow meant to assist you as the human researcher toward implementing your research ideas

Python 4,895 715 Updated Aug 20, 2025

My learning notes/codes for ML SYS.

Python 3,544 217 Updated Sep 3, 2025

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

7,908 281 Updated May 15, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 2 2 Updated Jul 8, 2025

Fully open reproduction of DeepSeek-R1

Python 25,394 2,368 Updated Sep 8, 2025

Sky-T1: Train your own O1 preview model within $450

Python 3,327 338 Updated Jul 12, 2025
Next