Skip to content
View chenggong-zhang's full-sized avatar

Highlights

  • Pro

Block or report chenggong-zhang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Code for the paper: "Fine-Tuning Discrete Diffusion Models with Policy Gradient Methods"

6 Updated Feb 20, 2025

A library that scrapes Linkedin for user data

Python 2,296 615 Updated Dec 13, 2024

🐳 Docker入门学习笔记

1,553 278 Updated Feb 25, 2025

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 37,975 4,636 Updated Feb 28, 2025

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)

Python 5,152 512 Updated Feb 28, 2025

S-LoRA: Serving Thousands of Concurrent LoRA Adapters

Python 1,794 103 Updated Jan 21, 2024

Fast and memory-efficient exact attention

Python 15,987 1,504 Updated Feb 28, 2025

PyTorch implementation of FractalGen https://arxiv.org/abs/2502.17437

Python 673 26 Updated Feb 25, 2025

🔥 A minimal training framework for scaling FLA models

Python 70 13 Updated Feb 26, 2025

🚀 Efficient implementations of state-of-the-art linear attention models in Torch and Triton

Python 2,020 123 Updated Feb 28, 2025

The official implementation of Self-Play Preference Optimization (SPPO)

Python 494 47 Updated Jan 23, 2025

A PyTorch library for all things Reinforcement Learning (RL) for Combinatorial Optimization (CO)

Python 515 90 Updated Feb 18, 2025

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Python 11,384 713 Updated Dec 17, 2024

[RSS 2024] 3D Diffusion Policy: Generalizable Visuomotor Policy Learning via Simple 3D Representations

Python 702 70 Updated Feb 28, 2025

Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!

Python 36,453 2,742 Updated Feb 28, 2025

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Python 1,587 70 Updated Aug 15, 2024

Official Code for Paper "Think While You Generate: Discrete Diffusion with Planned Denoising" [ICLR 2025]

Python 50 1 Updated Feb 18, 2025

Google Research

Jupyter Notebook 35,000 8,017 Updated Feb 27, 2025

Elucidating the Design Space of Diffusion-Based Generative Models (EDM)

Python 1,545 156 Updated Mar 16, 2024

A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations

Python 11,925 774 Updated Feb 28, 2025

RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RN…

Python 13,232 893 Updated Feb 27, 2025

[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ult…

Jupyter Notebook 6,742 441 Updated Jan 12, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 39,784 5,962 Updated Feb 28, 2025

A brief and partial summary of RLHF algorithms.

95 2 Updated Nov 24, 2024

[ICLR 2025] Rectified Diffusion: Straightness Is Not Your Need

Python 188 6 Updated Dec 11, 2024

Official Pytorch repository for Extreme Compression of Large Language Models via Additive Quantization https://arxiv.org/pdf/2401.06118.pdf and PV-Tuning: Beyond Straight-Through Estimation for Ext…

Python 1,217 183 Updated Feb 26, 2025

[ICML 2024] CLLMs: Consistency Large Language Models

Python 377 17 Updated Nov 16, 2024

WikiChat is an improved RAG. It stops the hallucination of large language models by retrieving data from a corpus.

Python 1,386 123 Updated Jan 16, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 3,953 362 Updated Feb 28, 2025

[ICML 2024 Best Paper] Discrete Diffusion Modeling by Estimating the Ratios of the Data Distribution (https://arxiv.org/abs/2310.16834)

Python 493 52 Updated Feb 29, 2024
Next