JacobKong

Follow

Weijie Kong JacobKong

Follow

25 followers · 3 following

Stars

yunlong10 / Awesome-LLMs-for-Video-Understanding

🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.

1,482 74 Updated Oct 9, 2024

OpenGVLab / VideoMamba

[ECCV2024] VideoMamba: State Space Model for Efficient Video Understanding

Python 828 60 Updated Jul 6, 2024

tyshiwo1 / DiM-DiffusionMamba

The official implementation of DiM: Diffusion Mamba for Efficient High-Resolution Image Synthesis

Python 156 8 Updated Jun 28, 2024

state-spaces / mamba

Mamba SSM architecture

Python 13,061 1,109 Updated Oct 28, 2024

aimhubio / aim

Aim 💫 — An easy-to-use & supercharged open-source experiment tracker.

Python 5,203 320 Updated Nov 1, 2024

FoundationVision / OmniTokenizer

OmniTokenizer: one model and one weight for image-video joint tokenization.

Python 250 7 Updated Jul 9, 2024

Tencent / HunyuanDiT

Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding

Python 3,388 290 Updated Oct 11, 2024

mayuelala / FollowYourClick

[arXiv 2024] Follow-Your-Click: This repo is the official implementation of "Follow-Your-Click: Open-domain Regional Image Animation via Short Prompts"

851 34 Updated Apr 28, 2024

hpcaitech / Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Python 22,109 2,159 Updated Aug 9, 2024

whlzy / FiT

[ICML 2024 Spotlight] FiT: Flexible Vision Transformer for Diffusion Model

Python 376 9 Updated Oct 31, 2024

PixArt-alpha / PixArt-alpha

PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis

Python 2,770 177 Updated Oct 31, 2024

Vchitect / Latte

Latte: Latent Diffusion Transformer for Video Generation.

Python 1,688 176 Updated Sep 28, 2024

chuanyangjin / fast-DiT

Fast Diffusion Models with Transformers

Python 719 94 Updated Oct 25, 2024

facebookresearch / DiT

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Python 6,247 557 Updated May 31, 2024

Vchitect / LaVie

[IJCV 2024] LaVie: High-Quality Video Generation with Cascaded Latent Diffusion Models

Python 866 59 Updated Aug 20, 2024

chaojie / ComfyUI-Moore-AnimateAnyone

Python 208 20 Updated Jun 10, 2024

MooreThreads / Moore-AnimateAnyone

Character Animation (AnimateAnyone, Face Reenactment)

Python 3,152 246 Updated May 31, 2024

instantX-research / InstantID

InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥

Python 11,060 805 Updated Jul 18, 2024

open-mmlab / PIA

[CVPR 2024] PIA, your Personalized Image Animator. Animate your images by text prompt, combing with Dreambooth, achieving stunning videos. PIA，你的个性化图像动画生成器，利用文本提示将图像变为奇妙的动画

Python 904 73 Updated Aug 5, 2024

google-research / magvit

Official JAX implementation of MAGVIT: Masked Generative Video Transformer

Python 947 42 Updated Jan 17, 2024

lucidrains / magvit2-pytorch

Implementation of MagViT2 Tokenizer in Pytorch

Python 560 34 Updated Oct 14, 2024

ali-vilab / VGen

Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models

Python 2,950 262 Updated Oct 22, 2024

TianxingWu / FreeInit

[ECCV 2024] FreeInit: Bridging Initialization Gap in Video Diffusion Models

Python 488 23 Updated Jan 18, 2024

ai-forever / KandinskyVideo

KandinskyVideo — multilingual end-to-end text2video latent diffusion model

Python 174 20 Updated May 28, 2024

showlab / Show-1

Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation

Python 1,102 62 Updated Oct 30, 2023

hotshotco / Hotshot-XL

✨ Hotshot-XL: State-of-the-art AI text-to-GIF model trained to work alongside Stable Diffusion XL

Python 1,053 83 Updated Jan 23, 2024

deep-floyd / IF

Python 7,674 498 Updated Apr 14, 2024

ykk648 / AnimateDiff-I2V

Forked from guoyww/AnimateDiff

AnimateDiff I2V version.

Python 180 4 Updated Mar 1, 2024

tencent-ailab / IP-Adapter

The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.

Jupyter Notebook 5,211 337 Updated Jun 28, 2024

rese1f / StableVideo

[ICCV 2023] StableVideo: Text-driven Consistency-aware Diffusion Video Editing

Python 1,388 87 Updated Sep 7, 2023