Towards a neural statistician

This repo contains PyTorch implementation of the generative model proposed in Towards a neural statistician (Edwards and Storkey, ICLR 2017). The implementation contains our replication of all experiments provided in the paper. It also has an extension that allows for generating datasets conditioned on some labels.

Synthetic data experiment

Train a model:

python train.py --experiment synthetic --lr 1e-3 --num_epochs 50 --context_dim 3

Test a model:

python test_synthetic.py --model_name path_to_your_model

Test script will save mean contexts, distribution, means and variances for each sampled dataset as numpy array.

Our visualisation of these numpy arrays:

The following image shows 3-D scatter plots of the summary statistics learned. Each point is the mean of the approximate posterior over the context. Left plot shows points colored by distribution family, center plot colored by the mean and right plot colored by the variance. The plots have been rotated to illustrative angles.

Spatial MNIST experiment

Train a model:

python train.py --experiment 'mnist' --num_epochs 100 --context_dim 64 --num_stochastic_layers 3 --z_dim 2 --x_dim 2 --h_dim 2

Test a model:

Since we only sample digits conditioned on the inputs, separate commands are not required.

The sampled spatial MNIST digits conditioned on the inputs represent sensible subsets of a dataset:

where blue and red dots are the input digits as well as 6-sample summaries, and orange digits are the conditioned samples from spatial MNIST data.

Omniglot experiment

Train a model:

python train.py --experiment omniglot --nll bernoulli --num_data_per_dataset 5 --num_epochs 400 \
--context_dim 512 --masked --z_dim 16 --h_dim 4096 --batch_size 32 --lr 0.0001 --tensorboard

To sample from trained model, conditioned on unseen OMNIGLOT classes:

python test_omniglot.py --experiment omniglot --num_data_per_dataset 5 --num_samples_per_dataset 5 \
--context_dim 512 --masked --z_dim 16 --h_dim 4096 --batch_size 16 --model_name your_model_name

Our samples from trained model for unseen omniglot classes:

Few-shot learning to OMNIGLOT unseen classes. Left rows are input sets, right rows are samples given the inputs. The results were not cherry-picked.

To sample from trained model, conditioned on MNIST:

python test_omniglot.py --experiment omniglot --num_data_per_dataset 5 --num_samples_per_dataset 5 \
--context_dim 512 --masked --z_dim 16 --h_dim 4096 --batch_size 16 --model_name your_model_name --test_mnist

Our samples from trained model for MNIST:

Few-shot learning from OMNIGLOT to MNIST. Left rows are input sets, right rows are samples given the inputs. The results were not cherry-picked.

To get classification accuracy for a few-shot learning task with M classes using K samples per class:

python test_omniglot_accuracy.py --experiment omniglot --num_data_per_dataset K --num_classes M \
--context_dim 512 --masked --z_dim 16 --h_dim 4096 --batch_size 20 \
--model_name your_model_name

To get results on MNIST dataset include --test_mnist option.

Table comparing results obtained with our implementation and the ones provided in the paper.

YouTube Faces

Train a model:

python3 train.py --data_dir $YOUTUBE_DATABASE_DIRECTORY$ --z_dim 16 --context_dim 500 --h_dim 4096 --num_epochs 10 --num_data_per_dataset 5 --experiment youtube --x_dim 12288 \

To sample new frames conditioned on input frames:

python3 test_youtube.py --data_dir $YOUTUBE_DATABASE_DIRECTORY$ --z_dim 16 --context_dim 500 --num_data_per_dataset 5 --num_samples_per_dataset 5 --experiment youtube --h_dim 4096 --x_dim 12288 --test_conditioned --model_dir model_params --model_name youtube_20:02:2020_15:17:49/last \

The --model_name option should be changed to the latest trained model.

Frames (five rightmost images) sampled conditioned on input frames (five leftmost images).

To sample new frames from a context sampled from the prior (i.e. generate new faces), remove the --test_conditioned option.

Frames obtained from a context sampled from a prior with zero mean and unit variance.

Extension

For conditioning on the label, we slightly changed the proposed graphical model by introducing observed variable y:

The code can be found in movements branch of our repo.

Train a model:

python train.py --data_dir=emotions_resized/ --z_dim=16 --context_dim=500 \
--h_dim=4096 --num_epochs=5000 --num_data_per_dataset=6 --experiment=youtube --x_dim=4096 \
--n_channels=1 --total_num_persons=123 --train_num_persons=113 --test_num_persons=5 \
--use_labels --num_labels 8 --tensorboard

Note that we maintained the --experiment=youtube option due to the similarity of the task to the Youtube Database experiment. Sample from a trained model:

python test_youtube.py --data_dir=emotions_resized/ --z_dim=16 --context_dim=500 \
--h_dim=4096 --num_data_per_dataset=6 --experiment=youtube --x_dim=4096 \
--n_channels=1 --total_num_persons=123 --train_num_persons=113 --test_num_persons=5 \
--use_labels --num_labels 8 --model_name your_model_name

To train our model we used CK+ emotions database.

Sample faces conditioned on emotion label:

Name		Name	Last commit message	Last commit date
Latest commit History 82 Commits
mnist results		mnist results
readme_images		readme_images
sample		sample
sample_conditioned		sample_conditioned
training_images		training_images
.DS_Store		.DS_Store
MLMI4_Poster_Final.pdf		MLMI4_Poster_Final.pdf
README.md		README.md
dataset_mnist.py		dataset_mnist.py
dataset_omniglot.py		dataset_omniglot.py
dataset_omniglot_2.py		dataset_omniglot_2.py
dataset_synthetic.py		dataset_synthetic.py
dataset_youtube.py		dataset_youtube.py
logs.py		logs.py
losses.py		losses.py
models.py		models.py
process_images_youtube.py		process_images_youtube.py
test_omniglot.py		test_omniglot.py
test_omniglot_accuracy.py		test_omniglot_accuracy.py
test_synthetic.py		test_synthetic.py
test_youtube.py		test_youtube.py
train.py		train.py
train_omniglot.sh		train_omniglot.sh
utils.py		utils.py
utils_mnist.py		utils_mnist.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Towards a neural statistician

Synthetic data experiment

Spatial MNIST experiment

Omniglot experiment

YouTube Faces

Extension

About

Uh oh!

Releases

Packages

Contributors 3

Uh oh!

Languages

ashysheya/neural-stat

Folders and files

Latest commit

History

Repository files navigation

Towards a neural statistician

Synthetic data experiment

Spatial MNIST experiment

Omniglot experiment

YouTube Faces

Extension

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Uh oh!

Languages

Packages