RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RN…

Python 13,232 893 Updated Feb 27, 2025

FoundationVision / VAR

[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ult…

Jupyter Notebook 6,742 441 Updated Jan 12, 2025

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 39,784 5,962 Updated Feb 28, 2025

yihedeng9 / rlhf-summary-notes

A brief and partial summary of RLHF algorithms.

95 2 Updated Nov 24, 2024

G-U-N / Rectified-Diffusion

[ICLR 2025] Rectified Diffusion: Straightness Is Not Your Need

Python 188 6 Updated Dec 11, 2024

Vahe1994 / AQLM

Official Pytorch repository for Extreme Compression of Large Language Models via Additive Quantization https://arxiv.org/pdf/2401.06118.pdf and PV-Tuning: Beyond Straight-Through Estimation for Ext…

Python 1,217 183 Updated Feb 26, 2025

hao-ai-lab / Consistency_LLM

[ICML 2024] CLLMs: Consistency Large Language Models

Python 377 17 Updated Nov 16, 2024

stanford-oval / WikiChat

WikiChat is an improved RAG. It stops the hallucination of large language models by retrieving data from a corpus.

Python 1,386 123 Updated Jan 16, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 3,953 362 Updated Feb 28, 2025

louaaron / Score-Entropy-Discrete-Diffusion

[ICML 2024 Best Paper] Discrete Diffusion Modeling by Estimating the Ratios of the Data Distribution (https://arxiv.org/abs/2310.16834)

Python 493 52 Updated Feb 29, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Alexander chenggong-zhang

Achievements

Achievements

Highlights

Block or report chenggong-zhang

Stars

ozekri / SEPO

joeyism / linkedin_scraper

jaywcjlove / docker-tutorial

lm-sys / FastChat

OpenRLHF / OpenRLHF

S-LoRA / S-LoRA

Dao-AILab / flash-attention

LTH14 / fractalgen

fla-org / flame

fla-org / flash-linear-attention

uclaml / SPPO

ai4co / rl4co

microsoft / LoRA

YanjieZe / 3D-Diffusion-Policy

gradio-app / gradio

FoundationVision / LlamaGen

liusulin / DDPD

google-research / google-research

NVlabs / edm

kvcache-ai / ktransformers

BlinkDL / RWKV-LM