The objective is to explore the paper called Neural Discrete Representation Learning by Aaron van den Oord, Oriol Vinyals, and Koray Kavukcuoglu published in 30 May 2018 arxiv link.
More preciselly, the objective is to understand in depht the proposed method, to re-implement using newer version of python and more reable code to be able to reproduce results and confirm (or not) the results shown in the paper.
The structure of this project is the following one:
- Folder
Modelscontains the VQ-VAEs model for MNIST and CIFAR10 dataset and the PixelCNN models for generating new samples. VQ_VAE.pyis the file containing the main training algorithm and testing for VQ-VAE models.generation.pyis the dile containing the training and testing for generating new samples.utils.pyis a python file full of usefull general funtctions.- It is assumed to have a folder
Datasetscontaining MNIST and CIFA10 dataset download viatorchvision.datasets. - It is assumed to have a folder
savescontaining saves of the different models.