Skip to content
View YaooXu's full-sized avatar
  • Beijing

Block or report YaooXu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Model Merging in LLMs, MLLMs, and Beyond: Methods, Theories, Applications and Opportunities. arXiv:2408.07666.

532 27 Updated Sep 16, 2025

[ICLR 2024 Spotlight] Code for the paper "Merge, Then Compress: Demystify Efficient SMoE with Hints from Its Routing Policy"

Python 93 11 Updated Jun 20, 2025

My learning notes/codes for ML SYS.

Python 3,601 221 Updated Sep 12, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 13,403 2,365 Updated Sep 16, 2025

A Recipe for Building LLM Reasoners to Solve Complex Instructions

Python 22 Updated Aug 1, 2025

A framework for few-shot evaluation of language models.

Python 10,117 2,732 Updated Sep 12, 2025

Simple RL training for reasoning

Python 3,742 281 Updated Aug 3, 2025

Anonymous Github is a proxy server to support anonymous browsing of Github repositories for open-science code and data.

TypeScript 1,828 72 Updated Jul 22, 2025

Structural Entropy Guided Agent for Detecting and Repairing Knowledge Deficiencies in LLMs

Python 63 3 Updated Jul 25, 2025

FlagScale is a large model toolkit based on open-sourced projects.

Python 353 107 Updated Sep 12, 2025

📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥

1,713 70 Updated Sep 16, 2025

Democratizing Reinforcement Learning for LLMs

Jupyter Notebook 4,155 385 Updated Sep 11, 2025

Sky-T1: Train your own O1 preview model within $450

Python 3,329 339 Updated Jul 12, 2025

Fully open data curation for reasoning models

Python 2,084 173 Updated Sep 3, 2025

LLaSA: Large Language and Structured Data Assistant. NAACL 2025 Main.

Python 4 Updated Feb 17, 2025

Fully open reproduction of DeepSeek-R1

Python 25,429 2,373 Updated Sep 8, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 58,334 7,170 Updated Sep 15, 2025

Scalable RL solution for advanced reasoning of language models

Python 1,728 99 Updated Mar 18, 2025

Train transformer language models with reinforcement learning.

Python 15,525 2,194 Updated Sep 16, 2025

Compatibility fix for Shadowsocks [Python 3.10+]

Python 2 2 Updated Apr 1, 2024

A reading list on LLM based Synthetic Data Generation 🔥

1,409 85 Updated Jun 5, 2025

Fast and memory-efficient exact attention

Python 19,488 1,998 Updated Sep 13, 2025

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

6,831 376 Updated Sep 6, 2025

This repository contains source code for the TaBERT model, a pre-trained language model for learning joint representations of natural language utterances and (semi-)structured tables for semantic p…

Python 599 67 Updated Aug 26, 2021

A generative one-for-all model for joint graph language modeling

Python 42 7 Updated Jun 23, 2025

Awesome papers about unifying LLMs and KGs

2,461 170 Updated May 2, 2025

一个用于在 macOS 上平滑你的鼠标滚动效果或单独设置滚动方向的小工具, 让你的滚轮爽如触控板 | A lightweight tool used to smooth scrolling and set scroll direction independently for your mouse on macOS

Swift 17,452 562 Updated Sep 15, 2025
Next