Skip to content
View JacobKong's full-sized avatar

Block or report JacobKong

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.

1,482 74 Updated Oct 9, 2024

[ECCV2024] VideoMamba: State Space Model for Efficient Video Understanding

Python 828 60 Updated Jul 6, 2024

The official implementation of DiM: Diffusion Mamba for Efficient High-Resolution Image Synthesis

Python 156 8 Updated Jun 28, 2024

Mamba SSM architecture

Python 13,061 1,109 Updated Oct 28, 2024

Aim 💫 — An easy-to-use & supercharged open-source experiment tracker.

Python 5,203 320 Updated Nov 1, 2024

OmniTokenizer: one model and one weight for image-video joint tokenization.

Python 250 7 Updated Jul 9, 2024

Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding

Python 3,388 290 Updated Oct 11, 2024

[arXiv 2024] Follow-Your-Click: This repo is the official implementation of "Follow-Your-Click: Open-domain Regional Image Animation via Short Prompts"

851 34 Updated Apr 28, 2024

Open-Sora: Democratizing Efficient Video Production for All

Python 22,109 2,159 Updated Aug 9, 2024

[ICML 2024 Spotlight] FiT: Flexible Vision Transformer for Diffusion Model

Python 376 9 Updated Oct 31, 2024

PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis

Python 2,770 177 Updated Oct 31, 2024

Latte: Latent Diffusion Transformer for Video Generation.

Python 1,688 176 Updated Sep 28, 2024

Fast Diffusion Models with Transformers

Python 719 94 Updated Oct 25, 2024

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Python 6,247 557 Updated May 31, 2024

[IJCV 2024] LaVie: High-Quality Video Generation with Cascaded Latent Diffusion Models

Python 866 59 Updated Aug 20, 2024

Character Animation (AnimateAnyone, Face Reenactment)

Python 3,152 246 Updated May 31, 2024

InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥

Python 11,060 805 Updated Jul 18, 2024

[CVPR 2024] PIA, your Personalized Image Animator. Animate your images by text prompt, combing with Dreambooth, achieving stunning videos. PIA,你的个性化图像动画生成器,利用文本提示将图像变为奇妙的动画

Python 904 73 Updated Aug 5, 2024

Official JAX implementation of MAGVIT: Masked Generative Video Transformer

Python 947 42 Updated Jan 17, 2024

Implementation of MagViT2 Tokenizer in Pytorch

Python 560 34 Updated Oct 14, 2024

Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models

Python 2,950 262 Updated Oct 22, 2024

[ECCV 2024] FreeInit: Bridging Initialization Gap in Video Diffusion Models

Python 488 23 Updated Jan 18, 2024

KandinskyVideo — multilingual end-to-end text2video latent diffusion model

Python 174 20 Updated May 28, 2024

Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation

Python 1,102 62 Updated Oct 30, 2023

✨ Hotshot-XL: State-of-the-art AI text-to-GIF model trained to work alongside Stable Diffusion XL

Python 1,053 83 Updated Jan 23, 2024
Python 7,674 498 Updated Apr 14, 2024

AnimateDiff I2V version.

Python 180 4 Updated Mar 1, 2024

The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.

Jupyter Notebook 5,211 337 Updated Jun 28, 2024

[ICCV 2023] StableVideo: Text-driven Consistency-aware Diffusion Video Editing

Python 1,388 87 Updated Sep 7, 2023
Next