This first comprehensive book on models behind Generative AI has been thoroughly revised to cover all major classes of deep generative models: mixture models, Probabilistic Circuits, Autoregressive Models, Flow-based Models, Latent Variable Models, GANs, Hybrid Models, Score-based Generative Models, Energy-based Models, and Large Language Models. In addition, Generative AI Systems are discussed, demonstrating how deep generative models can be used for neural compression, among others.
Deep Generative Modeling is designed to appeal to curious students, engineers, and researchers with a modest mathematical background in undergraduate calculus, linear algebra, probability theory, and the basics of machine learning, deep learning, and programming in Python and PyTorch (or other deep learning libraries). It should find interest among students and researchers from a variety of backgrounds, including computer science, engineering, data science, physics, and bioinformatics who wish to get familiar with deep generative modeling. In order to engage with a reader, the book introduces fundamental concepts with specific examples and code snippets.
The aim of the book is to outline the most important techniques in deep generative modeling and, eventually, enable readers to formulate new models and implement them.
Front matter
Foreword by Prof. Max Welling
Preface
- Mixture of Gaussians (MoGs): a mixture of Gaussians
- Autoregressive Models (ARMs): ARMs parameterized with Causal Convolutionas and Transformers
- Flow-based models (flows): RealNVP and IDFs (Integer Discrete Flows)
- Variational Auto-Encoders (VAEs): a plain VAE and various priors, a hierarchical VAE
- Diffusion-based Deep Generative Models (DDGMs): a Gaussian forward diffusion
- Score-based Generative Models (SBGMs): score matching, score-based generative models, (conditional) flow matching
- Hybrid modeling
- Energy-based Models
- Generative Adversarial Networks (GANs)
- Neural Compression with Deep Generative Modeling
- Large Language Models (LLMs)
Appendix
A. Useful Facts from Algebra and Calculus
B. Useful Facts from Probability Theory and Statistics
The examples might look oversimplistic but that's the point! My idea is that everyone is able to follow every line of the code, and run the experiments within a couple of minutes on almost any laptop or computer. My goal is to encourage people who are new to understand and play with deep generative models. More advanced users, on the other hand, could refresh their knowledge or build on top of that to quickly check their ideas. Either way, I hope the code will help everyone to join a fascinating journey on deep generative modeling!
In all examples, we used:
pytorch 1.7.0
numpy 1.17.2
matplotlib 3.1.1
scikit-learn 0.21.3
pytorch-model-summary 0.1.1
jupyter 1.0.0
All examples of implemented deep generative models are provided as jupyter notebooks. They can be find in the following folders:
mog
: an example of a mixture of Gaussians with equiprobable components or trainable component probabilities.arms
: an example of an autoregressive model with a causal convolutiona layer in 1D and transformers.flows
: an example of a flow-based model, namely, RealNVP with coupling layers and permutation layers, and IDFs (Integer Discrete Flows).vaes
: (i) an example of a Variational Auto-Encoder using fully-connected layers and a standard Gaussian prior, (ii) an example of various priors for VAEs, (iii) an example of a hierarchical VAE.ddgms
: an example of a Diffusion-based Deep Generative Model using a Gaussian forward diffusion with a fixed variace and a reverse diffusion parameterized by an MLP.sbgms
: (i) an example of a score model using the score matching method and an MLP-based score model, (ii) an example of an SDE-based diffusion model parameterized by an MLP, (iii) an example of a conditional flow matching model parameterized by an MLP.hybrid_modeling
: an example of a hybrid model using fully-connected layers and IDFs.ebms
: an example of an energy-based model parameterized by an MLP.gans
: an example of a GAN parameterized by MLPs.neural_compression
: an example of applying deep generative modeling to image neural compression.llms
: an example of a decoder-based transformer (an LLM; here we call it teenyGPT).
For teaching purposes, please find the following material:
assignments_examples
: three examples of assignments: (i) for ARMs, (ii) for VAEs, (iii) for a group assignment;figures
: for all teachers, if you need to prepare your lectures, you can find some useful figures here.
Please do not forget to cite the book as the source. Thank you!
If you use this code in any way, please refer to it by citing my book "Deep Generative Modeling":
- APA style:
Tomczak, J. M. (2024). Deep Generative Modeling. Springer Cham
- Bibtex:
@book{tomczak2024deep,
title={Deep Generative Modeling},
author={Tomczak, Jakub M},
publisher={Springer Cham},
year={2024}
}