Skip to content
View manyizhang's full-sized avatar

Block or report manyizhang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

s1: Simple test-time scaling

Python 5,938 684 Updated Mar 6, 2025

Utilities intended for use with Llama models.

Python 5,905 1,005 Updated Mar 1, 2025

CLiB中文大模型能力评测榜单(持续更新):目前已囊括195个大模型,覆盖chatgpt、gpt-4o、o3-mini、谷歌gemini、Claude3.5、智谱GLM-Zero、文心一言、qwen-max、百川、讯飞星火、商汤senseChat、minimax等商用模型, 以及DeepSeek-R1、deepseek-v3、qwen2.5、llama3.3、phi-4、glm4、书生int…

3,726 165 Updated Mar 11, 2025

Tools for merging pretrained large language models.

Python 5,403 510 Updated Mar 12, 2025

An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.

Jupyter Notebook 1,682 259 Updated Dec 27, 2024

Awesome LLM compression research papers and tools.

1,410 90 Updated Mar 11, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 43,877 5,365 Updated Mar 11, 2025

Reference implementation for DPO (Direct Preference Optimization)

Python 2,440 202 Updated Aug 11, 2024
Python 18 Updated Oct 17, 2024

An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries

Python 7,127 1,045 Updated Mar 12, 2025

Provides end-to-end model development pipelines for LLMs and Multimodal models that can be launched on-prem or cloud-native.

Python 493 145 Updated Mar 6, 2025

[NeurIPS 2024] SimPO: Simple Preference Optimization with a Reference-Free Reward

Python 843 57 Updated Feb 16, 2025

Awesome list for LLM quantization

Python 182 11 Updated Dec 24, 2024

Benchmarking LLMs with Challenging Tasks from Real Users

Python 218 41 Updated Nov 3, 2024

🦜🔗 Build context-aware reasoning applications

Jupyter Notebook 103,049 16,689 Updated Mar 12, 2025

AllenAI's post-training codebase

Python 2,789 358 Updated Mar 12, 2025

Open Source WizardCoder Dataset

Python 155 12 Updated Jul 12, 2023
Jupyter Notebook 80 2 Updated Dec 29, 2023
Python 1,506 160 Updated Mar 11, 2025

🐫 CAMEL: Finding the Scaling Law of Agents. The first and the best multi-agent framework. https://www.camel-ai.org

Python 9,762 999 Updated Mar 12, 2025

Instruction Tuning with GPT-4

HTML 4,279 303 Updated Jun 11, 2023
Python 1,459 110 Updated May 12, 2023

LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath

Python 9,352 731 Updated Aug 5, 2024

A collection of awesome-prompt-datasets, awesome-instruction-dataset, to train ChatLLM such as chatgpt 收录各种各样的指令数据集, 用于训练 ChatLLM 模型。

622 33 Updated Apr 7, 2024

A collection of open-source dataset to train instruction-following LLMs (ChatGPT,LLaMA,Alpaca)

1,115 58 Updated Jan 4, 2024

Papers and Datasets on Instruction Tuning and Following. ✨✨✨

Python 486 24 Updated Apr 4, 2024

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 38,087 4,654 Updated Mar 1, 2025

[ICLR 2024 Spotlight] FLASK: Fine-grained Language Model Evaluation based on Alignment Skill Sets

Python 214 18 Updated Dec 24, 2023

A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are continuously improving the project. Welcome to PR the works (p…

2,019 218 Updated Mar 4, 2025

Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…

Jupyter Notebook 16,426 2,368 Updated Mar 11, 2025
Next