PyTorch implementation of VQVAE-2

PyTorch Lightning wrapper for PyTorch used.

VQVAE-2 https://arxiv.org/pdf/1906.00446.pdf

Autoregressive PixelSNAIL https://arxiv.org/pdf/1712.09763.pdf

Implementations based on https://github.com/rosinality/vq-vae-2-pytorch and https://github.com/EugenHotaj/pytorch-generative

Step 1: Learning discrete latent space with VQVAE-2

To train on CIFAR10 with default parameters:

python train_vqvae.py

The VQVAE encoder compresses 3x32x32 images to 8x8 + 4x4 latent codes, which is a reduction of ~97%, using 8-bit color representations and codebook size of 512. The decoder tries to reconstruct the original image from latent codes.

Example reconstructions using default parameters from CIFAR10 test set. Original on top, reconstructions on bottom.

Compared to the originals, the reconstruciton is more blurry.

The top code is lower dimensioned and takes care of more global features, like the general color theme, while the bottom code takes care of details. Below you can see in following order: reconstruction from top code, reconstruction from bottom code, reconstruction and original.

Step 2: Train autoregressive model on latent codes

First make dataset with latent codes encoded by the trained VQVAE-2.

python make_latent_dataset.py

Now, train the autoregressive model

python train_pixelsnail.py

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
LICENSE		LICENSE
README.md		README.md
bottom_top_example.png		bottom_top_example.png
make_latent_dataset.py		make_latent_dataset.py
pixelsnail.py		pixelsnail.py
reconstruction_examples.png		reconstruction_examples.png
train_pixelsnail.py		train_pixelsnail.py
train_vqvae.py		train_vqvae.py
vqvae.py		vqvae.py
vqvae_components.py		vqvae_components.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PyTorch implementation of VQVAE-2

Step 1: Learning discrete latent space with VQVAE-2

Step 2: Train autoregressive model on latent codes

About

Releases

Packages

Languages

License

mattiasxu/VQVAE-2

Folders and files

Latest commit

History

Repository files navigation

PyTorch implementation of VQVAE-2

Step 1: Learning discrete latent space with VQVAE-2

Step 2: Train autoregressive model on latent codes

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages