GitHub

DCGAN, W-GAN, SN-GAN

Comparison experiments between different loss functions and weight regularization techiniques. For these experiments I chose DCGAN architecture.

Comparisons:

BCE loss vs. W-loss
Gradient penalty (GP) vs. Spectral norm (SN)
Transposed Conv (Deconv) vs. Upsampling + Conv (for intermadiate layers)
BatchNorm (BN) vs. No BatchNorm

Every model was trained for 50 epochs using Simpson Faces dataset from Kaggle. To be more precise, using a cleaned (a bit) version of its cropped data.

❕ Experiments were not meant to demonstrate the best looking generated images. Rather effects of different training stabilization techniques. Feel free to play around with model architectures/hyperparameters/number of epochs to achieve better results.

❕❕ Nor results were meant to estimate general "goodness" of any tested combination of techniques. One should make decisions according to their specific case.

Real image examples

Installation

Clone the repository

git clone https://github.com/ivankunyankin/gan.git
cd gan

Create an environment  and install the dependencies

python3 -m venv env 
source env/bin/activate 
pip3 install -r requirements.txt

cd into the directory of the model you want to train

Training

In order to start training you need to run:

python3 train.py

Add --upsample flag if you want to train an upsampling + conv generator

You can play around with hyper-parameters values. You will find them in the same config.yml

Tensorboard

You can watch the training process and see the intermediate results in tensorboard. Run the following:

tensorboard --logdir logs/

Experiment results

1. BCE-Loss:

On the images below you can see that when using BCE as a loss function, removing BatchNorm led to a mode collapse. Which at the same time is not true when using Upsampling + Conv instead of Deconv layers. Changing the way we increse the intermediate image size indeed can help with low-level artificats inherent to Transposed convolutions. But most importantly, take a look at the colors. They are much less saturated with BCE than with W-loss.

Deconv no BN

Deconv + BN

Upsample no BN

Upsample + BN

2. W-Loss + GP:

With W-loss colors are much better than with BCE loss. Interestingly, BatchNorm didn't improve image quality. As for the image size increasing techniques, it seems that they have shown comparable results. Deconv produced a bit more head-like shaped objects but I think that neither the difference is significant nor the results are trustworthy enough to say that one is better than the other.

Deconv no BN

Deconv + BN

Upsample no BN

Upsample + BN

3. W-Loss + SN:

Overall, experiments with spectral norm were not successful. Although, that can happen because this approach is just more sensitive to the model architecture and hyperparameters during training, given that it is a bit harder weight normalization technique than gradient penalty.

Deconv no BN

Deconv + BN

Upsample no BN

Upsample + BN

Contribution

If you think that there is en error in the code or that there is something that can make these experiments better, feel free to open a pull request.

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
assets		assets
dcgan		dcgan
sn-gan		sn-gan
w-gan		w-gan
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DCGAN, W-GAN, SN-GAN

Table of contents

Installation

Training

Tensorboard

Experiment results

1. BCE-Loss:

2. W-Loss + GP:

3. W-Loss + SN:

Contribution

About

Releases

Packages

Languages

ivankunyankin/gan

Folders and files

Latest commit

History

Repository files navigation

DCGAN, W-GAN, SN-GAN

Table of contents

Installation

Training

Tensorboard

Experiment results

1. BCE-Loss:

2. W-Loss + GP:

3. W-Loss + SN:

Contribution

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages