Skip to content
View kentang-mit's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report kentang-mit

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

(CVPR 2025) From Slow Bidirectional to Fast Autoregressive Video Diffusion Models

Python 896 42 Updated Aug 7, 2025
Python 179 4 Updated Dec 17, 2024

A suite of image and video neural tokenizers

Jupyter Notebook 1,670 80 Updated Feb 11, 2025

HLS-based framework to accelerate the implementation of 2-D DP kernels on FPGA

C++ 11 2 Updated Jun 20, 2025

[ICLR 2025] VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation

Python 383 14 Updated Apr 25, 2025

SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer

Python 4,472 294 Updated Sep 5, 2025

HART: Efficient Visual Generation with Hybrid Autoregressive Transformer

Python 630 41 Updated Oct 16, 2024

A sparse attention kernel supporting mix sparse patterns

C++ 293 15 Updated Feb 13, 2025

PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838

Python 1,724 104 Updated Sep 27, 2024
Python 186 9 Updated Jul 12, 2024

[ICML 2024] LLM and Simulation as Bilevel Optimizers: A New Paradigm to Advance Physical Scientific Discovery

Python 79 8 Updated May 31, 2024

Code for the paper DisCo-Diff: Enhancing Continuous Diffusion Models with Discrete Latents, ICML 2024

Python 87 4 Updated Jun 12, 2024

[ICML 2024] Quest: Query-Aware Sparsity for Efficient Long-Context LLM Inference

Cuda 330 37 Updated Jul 10, 2025

SEED-Voken: A Series of Powerful Visual Tokenizers

Python 934 36 Updated Jun 27, 2025

[NeurIPS 2024] Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning

Python 70 4 Updated Feb 11, 2025

[ICLR 2025 Spotlight] OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text

Python 391 6 Updated May 5, 2025

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Python 1,857 84 Updated Aug 15, 2024

Tile primitives for speedy kernels

Cuda 2,672 171 Updated Sep 4, 2025

(NeurIPS 2024 Oral 🔥) Improved Distribution Matching Distillation for Fast Image Synthesis

Python 921 49 Updated Mar 5, 2025

LaVIT: Empower the Large Language Model to Understand and Generate Visual Content

Jupyter Notebook 590 32 Updated Oct 6, 2024

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Python 1,596 160 Updated Oct 28, 2024

Model Compression Toolbox for Large Language Models and Diffusion Models

Python 616 49 Updated Aug 14, 2025

[MLSys'25] QServe: W4A8KV4 Quantization and System Co-design for Efficient LLM Serving; [MLSys'25] LServe: Efficient Long-sequence LLM Serving with Unified Sparse Attention

C++ 747 51 Updated Mar 6, 2025

PyTorch emulation library for Microscaling (MX)-compatible data formats

Python 290 39 Updated Jun 18, 2025
Jupyter Notebook 935 106 Updated Apr 29, 2024

[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". A…

Jupyter Notebook 8,382 533 Updated May 18, 2025

Code for Neurips24 paper: QuaRot, an end-to-end 4-bit inference of large language models.

Python 420 49 Updated Nov 26, 2024

Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

Python 3,313 281 Updated May 4, 2024

VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and cloud.

Python 3,522 293 Updated Aug 6, 2025

Microsoft Collective Communication Library

C++ 360 31 Updated Sep 20, 2023
Next