Skip to content
View dongzhuoyao's full-sized avatar
💭
I may be slow to respond.
💭
I may be slow to respond.

Highlights

  • Pro

Organizations

@CompVis

Block or report dongzhuoyao

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 4,347 225 Updated Dec 12, 2024

💥 Fast State-of-the-Art Tokenizers optimized for Research and Production

Rust 9,192 816 Updated Nov 27, 2024

Unsupervised text tokenizer for Neural Network-based text generation.

C++ 10,406 1,181 Updated Dec 1, 2024

[NeurIPS 2024] Boosting the performance of consistency models with PCM!

Python 400 15 Updated Dec 11, 2024

Does VLM Classification Benefit from LLM Description Semantics? (AAAI 2025)

Python 8 Updated Dec 17, 2024

The official Pytorch implementation of “BAD: Bidirectional Auto-regressive Diffusion for Text-to-Motion Generation”

Python 35 3 Updated Oct 22, 2024

A framework for few-shot evaluation of language models.

Python 7,319 1,975 Updated Dec 25, 2024

Code for BLT research paper

Python 1,112 70 Updated Dec 12, 2024

[arXiv:2406.07548] Image and Video Tokenization with Binary Spherical Quantization

Python 105 Updated Jun 12, 2024

Code for https://arxiv.org/abs/2406.04329

Python 42 1 Updated Dec 11, 2024

official code for Diff-Instruct algorithm for one-step diffusion distillation

Python 62 2 Updated Apr 6, 2024

Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image Synthesis

Jupyter Notebook 424 14 Updated May 24, 2024
1 Updated Oct 31, 2024

[NeurIPS 2024] Official implementation of "Unlocking the Capabilities of Masked Generative Models for Image Synthesis via Self-Guidance"

Python 8 1 Updated Dec 4, 2024

Autonomous agents for everyone

TypeScript 6,225 1,869 Updated Dec 28, 2024

A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes practical examples for both text and image modalities.

Python 1,592 62 Updated Dec 23, 2024

The official implementation of "[MASK] is All You Need"

104 3 Updated Dec 10, 2024

(NeurIPS 2024 Oral 🔥) Improved Distribution Matching Distillation for Fast Image Synthesis

Python 590 31 Updated Sep 27, 2024
Jupyter Notebook 4 Updated Dec 18, 2024

Bare-bones diffusion model code

Jupyter Notebook 144 11 Updated Jul 17, 2024

Tiny AutoEncoder for Stable Diffusion

Python 611 28 Updated Nov 7, 2024

[ICLR2024] The official implementation of paper "VDT: General-purpose Video Diffusion Transformers via Mask Modeling", by Haoyu Lu, Guoxing Yang, Nanyi Fei, Yuqi Huo, Zhiwu Lu, Ping Luo, Mingyu Ding.

Jupyter Notebook 221 13 Updated May 5, 2024

Codes for Visual Sketchpad: Sketching as a Visual Chain of Thought for Multimodal Language Models

Jupyter Notebook 141 9 Updated Oct 28, 2024

DistillDIFT: Distillation of Diffusion Features for Semantic Correspondence (WACV 2025)

Python 12 Updated Dec 6, 2024
Python 46 4 Updated Jul 30, 2024

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python 6,760 510 Updated Dec 25, 2024

Official implementation of FIFO-Diffusion: Generating Infinite Videos from Text without Training (NeurIPS 2024)

Python 410 28 Updated Oct 18, 2024

20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.

Python 11,000 1,095 Updated Dec 26, 2024

An open source implementation of the gameNgen paper

Python 10 6 Updated Dec 26, 2024

Code of Pyramidal Flow Matching for Efficient Video Generative Modeling

Python 2,626 260 Updated Dec 21, 2024
Next