Added scene image generation for Open Images #127

manolo-lolo · 2021-12-08T16:52:09Z

Adding the code for scene image generation as done in High-Resolution Complex Scene Synthesis with Transformers. Adding the Open Images dataset, COCO had been added in this PR. Also, added pre-trained models & provided first 100 layouts so that people can try it out easily.

I did not update the project main page to announce this new piece of code, probably this should be done before merging.

Announcing scene image generation properly @rromb and @manolo-lolo

Proposing the following announcement on project page:

Scene Image Synthesis

Scene image generation based on bounding box conditionals as done in High-Resolution Complex Scene Synthesis with Transformers (see talk on workshop page). Supporting the datasets COCO and Open Images.

Training

Download first-stage models COCO-8k-VQGAN for COCO or COCO/Open-Images-8k-VQGAN for Open Images.
Change ckpt_path in data/coco_scene_images_transformer.yaml and data/open_images_scene_images_transformer.yaml to point to the downloaded first-stage models.
Download the full COCO/OI datasets and adapt data_path in the same files, unless working with the 100 files provided for training and validation suits your needs already.

Code can be run with
python main.py --base configs/coco_scene_images_transformer.yaml -t True --gpus 0,
or
python main.py --base configs/open_images_scene_images_transformer.yaml -t True --gpus 0,

Sampling

Train a model as described above or download a pre-trained model:

Open Images 1 billion parameter model available that trained 100 epochs. On 256x256 pixels, FID 41.48±0.21, SceneFID 14.60±0.15, Inception Score 18.47±0.27. The model was trained with 2d crops of images and is thus well-prepared for the task of generating high-resolution images, e.g. 512x512.
Open Images distilled version of the above model with 125 million parameters allows for sampling on smaller GPUs (4 GB is enough for sampling 256x256 px images). Model was trained for 60 epochs with 10% soft loss, 90% hard loss. On 256x256 pixels, FID 43.07±0.40, SceneFID 15.93±0.19, Inception Score 17.23±0.11.
COCO 30 epochs
COCO 60 epochs (find model statistics for both COCO versions in assets/coco_scene_images_training.svg)

When downloading a pre-trained model, remember to change ckpt_path in configs/*project.yaml to point to your downloaded first-stage model (see ->Training).

Scene image generation can be run with
python scripts/make_scene_samples.py --outdir=/some/outdir -r /path/to/pretrained/model --resolution=512,512

Adding the description of scene-synthesis models as proposed in #127.

rromb · 2022-01-13T14:31:30Z

Great, thanks! I just added the announcement you suggested to the README (see 29b803f).

manolo-lolo added 3 commits December 8, 2021 15:48

Added scene images for Open Images dataset 🏞️

70a17a5

Small fixes for Open Images dataset 🔧

6e6303a

Added Open Images helper 🧰

249e3d9

manolo-lolo requested a review from rromb December 8, 2021 16:52

manolo-lolo added 6 commits December 8, 2021 17:55

Added 100 samples from Open Images for instant sampling 🌇

0d85af6

Fixed naming 🔤

ab8e47d

Fixed naming 🔤

6876a0f

Allow to externally set class number for compatibility

0821d00

Added scene images samples ☕

d91170e

Add parameter default value

2201a7d

rromb merged commit 09298dc into master Jan 13, 2022

rromb added a commit that referenced this pull request Jan 13, 2022

Update README.md

29b803f

Adding the description of scene-synthesis models as proposed in #127.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Added scene image generation for Open Images #127

Added scene image generation for Open Images #127

Uh oh!

manolo-lolo commented Dec 8, 2021 •

edited

Loading

Uh oh!

rromb commented Jan 13, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Added scene image generation for Open Images #127

Added scene image generation for Open Images #127

Uh oh!

Conversation

manolo-lolo commented Dec 8, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Scene Image Synthesis

Training

Sampling

Uh oh!

rromb commented Jan 13, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

manolo-lolo commented Dec 8, 2021 •

edited

Loading