T2V-Turbo

This repository provides the official implementation of T2V-Turbo and T2V-Turbo-v2 from the following papers.

T2V-Turbo: Breaking the Quality Bottleneck of Video Consistency Model with Mixed Reward Feedback
Jiachen Li, Weixi Feng, Tsu-Jui Fu, Xinyi Wang, Sugato Basu, Wenhu Chen, William Yang Wang

Paper: https://arxiv.org/abs/2405.18750

Project Page: https://t2v-turbo.github.io/

T2V-Turbo-v2: Enhancing Video Model Post-Training through Data, Reward, and Conditional Guidance Design
Jiachen Li, Qian Long, Jian Zheng, Xiaofeng Gao, Robinson Piramuthu, Wenhu Chen, William Yang Wang

Paper: https://arxiv.org/abs/2410.05677

Project Page: https://t2v-turbo-v2.github.io/

🔔 News

[10.14.2024] Added Replicate Demo and API for T2V-Turbo-v2 .

[10.09.2024] Release the training and inference codes for T2V-Turbo-v2.

[06.24.2024] Release the training codes for T2V-Turbo (VC2).

Fast and High-Quality Text-to-video Generation 🚀

16-Step Results of T2V-Turbo-v2


light wind, feathers moving, she moves her gaze	Pikachu snowboarding	A musician strums his guitar, serenading the moonlit night


camera pan from left to right, a man wearing sunglasses and business suit	A cat wearing sunglasses at a pool	A raccoon is playing the electronic guitar

4-Step Results of T2V-Turbo


With the style of low-poly game art, A majestic, white horse gallops gracefully across a moonlit beach.	medium shot of Christine, a beautiful 25-year-old brunette resembling Selena Gomez, anxiously looking up as she walks down a New York street, cinematic style	a cartoon pig playing his guitar, Andrew Warhol style


a dog wearing vr goggles on a boat	Pikachu snowboarding	a girl floating underwater

8-Step Results of T2V-Turbo


Mickey Mouse is dancing on white background	light wind, feathers moving, she moves her gaze, 4k	fashion portrait shoot of a girl in colorful glasses, a breeze moves her hair


With the style of abstract cubism, The flowers swayed in the gentle breeze, releasing their sweet fragrance.	impressionist style, a yellow rubber duck floating on the wave on the sunset	A Egyptian tomp hieroglyphics painting ofA regal lion, decked out in a jeweled crown, surveys his kingdom.

🏭 Installation

pip install accelerate transformers diffusers webdataset loralib peft pytorch_lightning open_clip_torch==2.24.0 hpsv2 image-reward peft wandb av einops packaging omegaconf opencv-python kornia moviepy imageio torchdata==0.8.0 decord torchaudio bitsandbytes langdetect scipy

pip install git+https://github.com/openai/CLIP.git
pip install flash-attn --no-build-isolation
git clone https://github.com/Dao-AILab/flash-attention.git
cd flash-attention
pip install csrc/fused_dense_lib csrc/layer_norm

conda install xformers -c xformers

🛞 Model Checkpoints

Model	Resolution	Checkpoints
T2V-Turbo-v2 w/ MG	320x512
T2V-Turbo-v2 w/o MG	320x512
T2V-Turbo (VC2)	320x512
T2V-Turbo (MS)	256x256

🚀 Inference

We provide local demo codes supported with gradio (For MacOS users, need to set the device="mps" in app.py; For Intel GPU users, set device="xpu" in app.py). Please install gradio

pip install gradio==3.48.0

And Download the model checkpoint of VideoCrafter2.

T2V-Turbo-v2

To play with our T2V-Turbo-v2:

Download the unet_mg.pt of our T2V-Turbo-v2.
Launch the gradio demo with the following command:

python app.py \
  --unet_dir unet_mg.pt PATH_TO_VideoCrafter2_MODEL_CKPT \
  --base_model_dir PATH_TO_VideoCrafter2_MODEL_CKPT \
  --version v2 \
  --motion_gs 0.0

We also provide the unet trained without augmenting teacher ODE solver with guidance. To play with it, please follow the steps below:

Download the unet_no_mg.pt of our T2V-Turbo-v2.
Launch the gradio demo with the following command:

python app.py \
  --unet_dir unet_mg.pt PATH_TO_VideoCrafter2_MODEL_CKPT \
  --base_model_dir PATH_TO_VideoCrafter2_MODEL_CKPT \
  --version v2 \
  --motion_gs 0.0

T2V-Turbo

To play with our T2V-Turbo (VC2), please follow the steps below:

Download the unet_lora.pt of our T2V-Turbo (VC2) here.
Launch the gradio demo with the following command:

python app.py \
  --unet_dir PATH_TO_UNET_LORA.pt \
  --base_model_dir PATH_TO_VideoCrafter2_MODEL_CKPT \
  --version v1

To play with our T2V-Turbo (MS), please follow the steps below:

Download the unet_lora.pt of our T2V-Turbo (MS) here.
Launch the gradio demo with the following command:

python app_ms.py --unet_dir PATH_TO_UNET_LORA.pt

🏋️ Training

T2V-Turbo-v2

Run the following command:

bash train_t2v_turbo_v2.sh

T2V-Turbo

To train T2V-Turbo (VC2), first prepare the data and model as below

Download the model checkpoint of VideoCrafter2 here.
Prepare the WebVid-10M data. Save in the webdataset format.
Download the InternVid2 S2 Model
Set --pretrained_model_path, --train_shards_path_or_url and video_rm_ckpt_dir accordingly in train_t2v_turbo_vc2.sh.

Then run the following command:

bash train_t2v_turbo_v1.sh

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
assets		assets
configs		configs
data		data
intern_vid2		intern_vid2
lvdm		lvdm
model_scope		model_scope
ode_solver		ode_solver
pipeline		pipeline
preprocess_scripts		preprocess_scripts
reward_fn		reward_fn
scheduler		scheduler
utils		utils
viclip		viclip
.gitignore		.gitignore
README.md		README.md
app.py		app.py
app_ms.py		app_ms.py
cog.yaml		cog.yaml
inverse_ddim.py		inverse_ddim.py
motion_prior_sample.py		motion_prior_sample.py
predict.py		predict.py
style.css		style.css
train_latent_t2v_turbo_v2.py		train_latent_t2v_turbo_v2.py
train_t2v_turbo_v1.sh		train_t2v_turbo_v1.sh
train_t2v_turbo_v1_lora.py		train_t2v_turbo_v1_lora.py
train_t2v_turbo_v2.sh		train_t2v_turbo_v2.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

T2V-Turbo

🔔 News

Fast and High-Quality Text-to-video Generation 🚀

16-Step Results of T2V-Turbo-v2

4-Step Results of T2V-Turbo

8-Step Results of T2V-Turbo

🏭 Installation

🛞 Model Checkpoints

🚀 Inference

T2V-Turbo-v2

T2V-Turbo

🏋️ Training

T2V-Turbo-v2

T2V-Turbo

About

Releases

Packages

Contributors 3

Languages

Ji4chenLi/t2v-turbo

Folders and files

Latest commit

History

Repository files navigation

T2V-Turbo

🔔 News

Fast and High-Quality Text-to-video Generation 🚀

16-Step Results of T2V-Turbo-v2

4-Step Results of T2V-Turbo

8-Step Results of T2V-Turbo

🏭 Installation

🛞 Model Checkpoints

🚀 Inference

T2V-Turbo-v2

T2V-Turbo

🏋️ Training

T2V-Turbo-v2

T2V-Turbo

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages