FreeVS: Generative View Synthesis on Free Driving Trajectory

Official implementation of [ICLR2025] FreeVS: Generative View Synthesis on Free Driving Trajectory.

Qitai Wang, Lue Fan, Yuqi Wang, Yuntao Chen, Zhaoxiang Zhang

[arXiv] [Project page]

Recent updates

[2025/02/11] Implementation of FreeVS on Waymo Open Dataset is released.
[2025/01/23] 🎉 FreeVS was accepted to ICLR 2025！

To Do

Implementation on nuScenes
Provide 3D prior based on estimated depth where LiDAR observations are missing, to ensure the consistency of far, background area.

Prerequisite

conda create -n freevs python=3.8
conda activate freevs

cd diffusers
pip install .
pip install -r requirements.txt

Waymo Open Dataset

Quick start with examples

Download a trained model checkpoint, as well as serveral processed example scenes. Please check the License Agreement of WOD dataset before downloading this checkpoint.

cd diffusers
pip install huggingface_hub

huggingface-cli download Esdolo/FreeVS_WOD --local-dir ./pretrained/FreeVS_WOD/

huggingface-cli download Esdolo/FreeVS_Examples --local-dir ./waymo_process/FreeVS_Examples/

cd waymo_process/FreeVS_Examples
tar -xzf FreeVS_Examples.tar.gz
cd ../..

Run inference with example scenes:

python examples/freevs/inference_svd.py --front_only --model_path pretrained/FreeVS_WOD/ --img_pickle waymo_process/FreeVS_Examples/waymo_example_newtraj.pkl  --output_dir rendered_waymo_example_newtraj

python examples/freevs/inference_svd.py --front_only --model_path pretrained/FreeVS_WOD/ --img_pickle waymo_process/FreeVS_Examples/waymo_example_origintraj.pkl  --output_dir rendered_waymo_example_origintraj

Results synthesized in the origin/new trajectory will be output to rendered_waymo_example_origintraj / rendered_waymo_example_newtraj.

Prepare waymo GT images / pseudo images

cd waymo_process

#|-- <path to WOD>
#     |--*.tfrecoed
#     |...

# Extract images from .tfrecord files
python extract_gt_images.py --waymo_raw_dir <path to WOD> --output_dir waymo_gtimg_5hz_allseg --interval 2

# Generating pseudo-image
python lidarproj_halfreso_multiframe.py --waymo_raw_dir <path to WOD> --output_dir waymo_pseudoimg_multiframe --interval 2 

# Generating pseudo-image is time-consuming. You can also use the multiprocess script:
bash gen_pseudo_img.bash

cd ..

# Generate pickle info file
python data_process/waymo_data_generation_subsegbycampos_multiframe.py --data_root waymo_process/waymo_gtimg_5hz_allseg/ --pseudoimg_root waymo_process/waymo_pseudoimg_multiframe/ --output_pickle waymo_process/waymo_multiframe_subsegbycampos.pkl

(Recommend) Additional pseudo-images for camera transformation simulation

cd waymo_process

python lidarproj_halfreso_multiframe_mismatchframeaug.py --waymo_raw_dir <path to WOD> --output_dir waymo_pseudoimg_multiframe_+4frame --interval 2 --mismatchnframe 4

python lidarproj_halfreso_multiframe_mismatchframeaug.py --waymo_raw_dir <path to WOD> --output_dir waymo_pseudoimg_multiframe_-4frame --interval 2 --mismatchnframe -4

cd ..

python data_process/waymo_data_generation_subsegbycampos_multiframe.py --data_root waymo_process/waymo_gtimg_5hz_allseg/ --pseudoimg_root waymo_process/waymo_pseudoimg_multiframe/ --transformation_simulation --pseudoimg_root_2 waymo_process/waymo_pseudoimg_multiframe_+4frame/ --pseudoimg_root_3 waymo_process/waymo_pseudoimg_multiframe_+4frame/ --output_pickle waymo_process/waymo_multiframe_subsegbycampos_transform_simulation.pkl

Train SVD

We initialize SVD model from https://huggingface.co/stabilityai/stable-video-diffusion-img2vid-xt. Download it as pretrained/stable-video-diffusion-img2vid-xt.

# On WOD, we recommend training diffuser model with a frozen pseudo-image encoder, which can significantly accelerate model convergence.
# We privide a pseudo-image encoder checkpoint in diffusers/pretrained/.
# turn on --gradient_checkpointing to save memory cost if needed
bash examples/freevs/scripts/run_train_onlyunet.sh

# Script for joint training pseudo-img encoder and diffuser
bash examples/freevs/scripts/run_train.sh

Run inference

python examples/freevs/inference_svd.py --model_path work_dirs/freevs_waymo_halfreso_multiframe_transformation_simulate_trainunet --img_pickle waymo_process/waymo_multiframe_subsegbycampos_transform_simulation.pkl --output_dir rendered_waymo_origin

To control the camera pose for novel trajectory simulation, please modify camera extrinsic in waymo_process/lidarproj_halfreso_multiframe.py. We provide a example case of camera pose editing in waymo_process/scene_modify_example/lidarproj_halfreso_multiframe_democases_1250_camposedit.py.

Citation

@article{wang2024freevs,
  title={Freevs: Generative view synthesis on free driving trajectory},
  author={Wang, Qitai and Fan, Lue and Wang, Yuqi and Chen, Yuntao and Zhang, Zhaoxiang},
  journal={arXiv preprint arXiv:2410.18079},
  year={2024}
}

Acknowledgement

Many thanks to the following open-source projects:

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
diffusers		diffusers
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

FreeVS: Generative View Synthesis on Free Driving Trajectory

Recent updates

To Do

Prerequisite

Waymo Open Dataset

Quick start with examples

Prepare waymo GT images / pseudo images

(Recommend) Additional pseudo-images for camera transformation simulation

Train SVD

Run inference

Citation

Acknowledgement

About

Releases

Packages

Languages

esdolo/FreeVS

Folders and files

Latest commit

History

Repository files navigation

FreeVS: Generative View Synthesis on Free Driving Trajectory

Recent updates

To Do

Prerequisite

Waymo Open Dataset

Quick start with examples

Prepare waymo GT images / pseudo images

(Recommend) Additional pseudo-images for camera transformation simulation

Train SVD

Run inference

Citation

Acknowledgement

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages