Skip to content
View Hhhhhhao's full-sized avatar
🥝
🥝
  • Pittsburgh

Organizations

@cmu-mlsp

Block or report Hhhhhhao

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

EDM2 and Autoguidance -- Official PyTorch implementation

Python 621 28 Updated Dec 9, 2024

[Survey] Masked Modeling for Self-supervised Representation Learning on Vision and Beyond (https://arxiv.org/abs/2401.00897)

Python 314 16 Updated Oct 7, 2024
Python 16 1 Updated Dec 8, 2024
Python 9 Updated Jan 23, 2025

Collection of common code that's shared among different research projects in FAIR computer vision team.

Python 2,071 227 Updated Nov 26, 2024

Official Implementation for EMNLP 2024 (main) "AgentReview: Exploring Academic Peer Review with LLM Agent."

Jupyter Notebook 43 5 Updated Nov 11, 2024

A paper list of some recent works about Token Compress for Vit and VLM

297 15 Updated Jan 27, 2025

A suite of image and video neural tokenizers

Jupyter Notebook 1,534 65 Updated Jan 19, 2025

Official Pytorch Implementation of Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think (ICLR 2025)

Python 819 40 Updated Jan 28, 2025

XQ-GAN🚀: An Open-source Image Tokenization Framework for Autoregressive Generation

Python 182 Updated Feb 1, 2025

O1 Replication Journey

1,910 59 Updated Jan 14, 2025

Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.

Python 2,170 164 Updated Jan 30, 2025

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Python 1,487 154 Updated Oct 28, 2024

[ICLR 2025] Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.

Python 1,149 49 Updated Jan 23, 2025

[Official Implementation] Acoustic Autoregressive Modeling 🔥

Python 61 5 Updated Aug 24, 2024
Python 489 45 Updated Nov 20, 2024
Python 307 22 Updated Nov 21, 2024

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Python 10,478 983 Updated Jan 22, 2025

Official Implementation of "Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining"

Python 540 22 Updated Aug 16, 2024

A framework for few-shot evaluation of language models.

Python 7,618 2,047 Updated Jan 31, 2025

PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838

Python 1,256 68 Updated Sep 27, 2024

This is the official implementation for ControlVAR.

Python 91 3 Updated Dec 10, 2024

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Python 1,531 68 Updated Aug 15, 2024

An unofficial implementation of both ViT-VQGAN and RQ-VAE in Pytorch

Python 295 35 Updated May 23, 2023

Anole: An Open, Autoregressive and Native Multimodal Models for Interleaved Image-Text Generation

Python 716 38 Updated Aug 5, 2024

This is a repo to track the latest autoregressive visual generation papers.

121 Updated Jan 24, 2025
Python 414 44 Updated Jul 19, 2024

Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.

Python 1,914 113 Updated Jul 29, 2024

Official repo for the paper "Scaling Synthetic Data Creation with 1,000,000,000 Personas"

Python 996 65 Updated Sep 25, 2024
Python 13 Updated Jul 30, 2024
Next