- Beijing
Stars
Model Merging in LLMs, MLLMs, and Beyond: Methods, Theories, Applications and Opportunities. arXiv:2408.07666.
[ICLR 2024 Spotlight] Code for the paper "Merge, Then Compress: Demystify Efficient SMoE with Hints from Its Routing Policy"
My learning notes/codes for ML SYS.
verl: Volcano Engine Reinforcement Learning for LLMs
A Recipe for Building LLM Reasoners to Solve Complex Instructions
A framework for few-shot evaluation of language models.
Anonymous Github is a proxy server to support anonymous browsing of Github repositories for open-science code and data.
Structural Entropy Guided Agent for Detecting and Repairing Knowledge Deficiencies in LLMs
FlagScale is a large model toolkit based on open-sourced projects.
📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥
Democratizing Reinforcement Learning for LLMs
Sky-T1: Train your own O1 preview model within $450
Fully open data curation for reasoning models
LLaSA: Large Language and Structured Data Assistant. NAACL 2025 Main.
Fully open reproduction of DeepSeek-R1
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Scalable RL solution for advanced reasoning of language models
Train transformer language models with reinforcement learning.
MausKaffee34767 / shadowsocks
Forked from shadowsocks/shadowsocksCompatibility fix for Shadowsocks [Python 3.10+]
A reading list on LLM based Synthetic Data Generation 🔥
Fast and memory-efficient exact attention
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.
This repository contains source code for the TaBERT model, a pre-trained language model for learning joint representations of natural language utterances and (semi-)structured tables for semantic p…
A generative one-for-all model for joint graph language modeling
一个用于在 macOS 上平滑你的鼠标滚动效果或单独设置滚动方向的小工具, 让你的滚轮爽如触控板 | A lightweight tool used to smooth scrolling and set scroll direction independently for your mouse on macOS