Benchmarking of Generative Vision Models

Generative models are difficult to train and tune, without exact criteria. Thus, exploring a straight forward pipeline for generative modeling, tuning and benchmarking. Within purview of this (personal) exploration:

Build models with different structures for MNIST type digit image generation.
Build models of different sizes - A small GAN and a large GAN
Build a custom benchmark - FID benchmark for MNIST type image generation.
Do all of this in Jax.

Diffusion model:

Generated samples: Training losses: Influenced by this implementation of score based diffusion, with modification for personal hardware, score function, hyperparameters, and some minor structure modifictions.

Quoting:

Uses the variance-preserving SDE to corrupt the data:

$y(0) \sim \mathrm{data}\qquad\mathrm{d}y(t) = -\frac{1}{2} β(t)y(t)\mathrm{d}t + \sqrt{β(t)}\mathrm{d}w(t) \qquad\text{for }t \in [0, T].$

Trains a score model $s_\theta$ according to the denoising objective:

$\arg\min_\theta \mathbb{E}{t \sim \mathrm{Uniform}[0, T]}\mathbb{E}{y(0) \sim \mathrm{data}}\mathbb{E}{(y(t)|y(0)) \sim \mathrm{SDE}} \lambda(t) | s\theta(t, y(t)) - \nabla_y \log p(y(t)|y(0)) |_2^2$

CNN-GAN model:

Generated samples: Training losses: Inspired by this implementation but modified structured for 28 x 28 MNIST instead of 64 x 64, modified training process and model structure like activations.

Mini-GAN model:

Generated samples: Training losses: Small MLP-GAN for basic control. (Also to analyze the training pathologies of GANs in a smaller domain). Based on my own torch implementation.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
__pycache__		__pycache__
assets		assets
data/mnist		data/mnist
models		models
.DS_Store		.DS_Store
README.md		README.md
cnngan.py		cnngan.py
diffusion.py		diffusion.py
fid.py		fid.py
minigan.py		minigan.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Benchmarking of Generative Vision Models

Diffusion model:

CNN-GAN model:

Mini-GAN model:

About

Releases

Packages

Languages

abhinavprao/visionbench

Folders and files

Latest commit

History

Repository files navigation

Benchmarking of Generative Vision Models

Diffusion model:

CNN-GAN model:

Mini-GAN model:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages