Sketch-guided Image Inpainting with Partial Discrete Diffusion Process

Official Implementation of the Sketch-guided Image Inpainting with Partial Discrete Diffusion Process paper (CVPR-W 2024)

Setting up the Environment

We provide env.yml file to setup the conda environment. Please install it as follows:

conda env create -f env.yml
conda activate sketch

Downloading the Data

Please download the sketches and accompanying data from here and extract the folder into the repo's root directory. The tokenized images for train and validation splits can be downloaded from here --- these should also be placed in the root directory (if downloading at other location, please make sure to update the paths in configs/sketch_coco_inpainting.yaml).

After this, please download train and val splits of the MS-COCO images, extract them and put them under data/train/images and data/validation/images, respectively.

The directory folder for data directory should now look like:

data
|----train
|    |----contours_seg_resized
|    |----images
|    |----bbox_annotations.json
|    |----bbox.json
|    |----labels.json
|
|----validation
|    |----contours_seg_resized
|    |----images
|    |----bbox_annotations.json
|    |----bbox.json
|    |----labels.json
|
|----info.json

Training the Model

Begin by downloading the VQ-VAE openimages-f8-8192 from here and put it under checkpoints/taming_f8_8192_openimages_last.pth.

To start the training, run:

CUDA_VISIBLE_DEVICES=0,1 python train.py --config_file configs/sketch_coco_inpainting.yaml\
    --name sketch_inp_diff \
    --num_node 1

The pre-trained model can be downloaded from here.

Launching the Gradio Demo

After downloading the VQ-VAE as instructed in previous section, please run the following command to launch a Gradio Demo:

python gradio_demo.py --config_path configs/sketch_coco_inpainting.yaml \
    --ckpt_path <PATH_TO_CHECKPOINT>

Cite Us

If you find our work helpful, please consider citing us:

@InProceedings{Sharma_2024_CVPR,
    author    = {Sharma, Nakul and Tripathi, Aditay and Chakraborty, Anirban and Mishra, Anand},
    title     = {Sketch-guided Image Inpainting with Partial Discrete Diffusion Process},
    booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops},
    month     = {June},
    year      = {2024},
    pages     = {6024-6034}
}

Acknowledgements

We would like to thank the authors of VQ-Diffusion and Taming Transformers for open-sourcing their code and checkpoints!

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
checkpoints		checkpoints
configs		configs
help_folder		help_folder
image_synthesis		image_synthesis
scripts		scripts
LICENSE		LICENSE
README.md		README.md
env.yml		env.yml
eval.py		eval.py
gradio_demo.py		gradio_demo.py
inference_VQ_Diffusion.py		inference_VQ_Diffusion.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Sketch-guided Image Inpainting with Partial Discrete Diffusion Process

Setting up the Environment

Downloading the Data

Training the Model

Launching the Gradio Demo

Cite Us

Acknowledgements

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

License

vl2g/Sketch-Inpainting

Folders and files

Latest commit

History

Repository files navigation

Sketch-guided Image Inpainting with Partial Discrete Diffusion Process

Setting up the Environment

Downloading the Data

Training the Model

Launching the Gradio Demo

Cite Us

Acknowledgements

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages