Mutual Information based method for Unsupervised Disentanglement of Video Representations

This is the offical implementation of "Mutual Information based method for Unsupervised Disentanglement of Video Represenations" accepted for publication in ICPR 2020. The paper will be uploaded to arXiv soon. This code is developed using pytorch 1.4.0, make sure you use the same version for smooth execution.

To train or test for Moving Dsprites or MPI3D-Real datasets you need to download the datasets fist. To download Dsprites run the following command:

bash download_dsprites.sh

Similarly for MPI3D-Toy dataset:

bash download_mpi3d_real.sh

Training

Two train scripts are used one for traning the auto-encoder ans another to train LSTM.

To train auto-encoder for Moving mnist run the following command

python3 train_autoencoder.py --no_color --num_channels 1 --dataset mnist --niters 400

To train LSTM for Moving mnist run the following command (< checkpoint > is the latest autoencoder checkpoint) :

python3 train_lstm.py --encoder_checkpoint <checkpoint> --dataset mnist --no_color --num_channels 1 --niters 200

Similarly to train for Moving Dsprites dataset:

python3 train_autoencoder.py --dataset dsprites --niters 400

python3 train_lstm.py --encoder_checkpoint <checkpoint> --dataset dsprites --niters 200

Similarly to train for Moving MPI3D_Real dataset:

python3 train_autoencoder.py --dataset mpi3d_real --niters 200 --z_dims 10

python3 train_lstm.py --encoder_checkpoint <checkpoint> --dataset mpi3d_real --niters 200 --z_dims 10

Evaluation

To evaluate the auto-encoder run the following command:

python3 test_ours.py --checkpoint <checkpoint> --dataset <dataset>

Where < checkpoint > is the latest auto-encoder checkpoint. < dataset > is dataset to use, if dataset is mnist append --no_color and --num_channels arguments at the end and --z_dims if dataset is mpi3d_real.

To evaluate the LSTM run the following command:

python3 test_lstm.py --ae_checkpoint <ae_checkpoint> --lstm_checkpoint <lstm_checkpoint> --dataset <dataset>

Where < ae_checkpoint > is the latest auto-encoder checkpoint and < lstm_checkpoint > is latest LSTM checkpoint. < dataset > is dataset to use, if dataset is mnist append --no_color and --num_channels arguments at the end and --z_dims if dataset is mpi3d_real.

To compute the proposed disentanglement metric:

python3 compute_disentanglement_metric.py --checkpoint <checkpoint> --dataset <dataset>

Where < checkpoint > is the latest auto-encoder checkpoint. < dataset > is dataset to use, if dataset is mnist append --no_color and --num_channels arguments at the end and --z_dims if dataset is mpi3d_real.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
additional_results		additional_results
data		data
metrics		metrics
pretrained_models		pretrained_models
.gitignore		.gitignore
README.md		README.md
compute_disentanglement_metric.py		compute_disentanglement_metric.py
compute_ssim_psnr_lpips.py		compute_ssim_psnr_lpips.py
critics.py		critics.py
download_dsprites.sh		download_dsprites.sh
download_mpi3d_real.sh		download_mpi3d_real.sh
lstm.py		lstm.py
meteric_utils.py		meteric_utils.py
mi_estimators.py		mi_estimators.py
models.py		models.py
moving_dsprites.py		moving_dsprites.py
moving_mnist.py		moving_mnist.py
mpi3d_toy.py		mpi3d_toy.py
process_mpi.py		process_mpi.py
resnet_128.py		resnet_128.py
resnet_64.py		resnet_64.py
results_utils.py		results_utils.py
solver.py		solver.py
static_dsprites.py		static_dsprites.py
static_mpi3d_toy.py		static_mpi3d_toy.py
test_autoencoder.py		test_autoencoder.py
test_lstm.py		test_lstm.py
train_autoencoder.py		train_autoencoder.py
train_lstm.py		train_lstm.py
utils.py		utils.py
vgg_128.py		vgg_128.py
vgg_64.py		vgg_64.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Mutual Information based method for Unsupervised Disentanglement of Video Representations

Training

Evaluation

About

Uh oh!

Releases

Packages

Languages

blackPython/mipae

Folders and files

Latest commit

History

Repository files navigation

Mutual Information based method for Unsupervised Disentanglement of Video Representations

Training

Evaluation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages