Denoising diffusion models represent a recent emerging topic in computer vision, demonstrating remarkable results in the area of generative modeling. A diffusion model is a deep generative model that is based on two stages, a forward diffusion stage and a reverse diffusion stage. In the forward diffusion stage, the input data is gradually perturbed over several steps by adding Gaussian noise. In the reverse stage, a model is tasked at recovering the original input data by learning to gradually reverse the diffusion process, step by step. Diffusion models are widely appreciated for the quality and diversity of the generated samples, despite their known computational burdens, i.e. low speeds due to the high number of steps involved during sampling. This repository categorizes the papers about diffusion models, applied in computer vision, according to their target task. The classifcation is based on our survey Diffusion Models in Vision: A Survey
- Unconditional Generation
- Conditional Generation
- Text-to-Image generation
- Super-Resolution
- Image Editing
- Region Image Editing
- Inpainting
- Image-to-Image Translation
- Image Segmentation
- Multi-Task
- Medical Image-to-Image Translation
- Medical Image Generation
- Medical Image Segmentation
- Medical Image Anomaly Detection
- Video Generation
- Few-Shot Image Generation
- Counterfactual Explanations and Estimations
- Image Restoration
- Image Registration
- Adversarial Purification
- Semantic Image Generation
- Shape Generation and Completion
- Classification
- Point Cloud Generation
- Theoretical
- Deep unsupervised learning using non-equilibrium thermodynamics
- Denoising diffusion probabilistic models
- Improved techniques for training score-based generative models
- Adversarial score matching and improved sampling for image generation
- Maximum likelihood training of score-based diffusion models
- D2C: Diffusion-Decoding Models for Few-Shot Conditional Generation
- Diffusion Normalizing Flow
- Diffusion Schrodinger bridge with applications to score-based generative modeling
- Structured denoising diffusion models in discrete state-spaces
- Score-based generative modeling in latent space
- Improved denoising diffusion probabilistic models
- Denoising Diffusion Implicit Models
- Non-Gaussian denoising diffusion models
- Bilateral denoising diffusion models
- Unleashing Transformers: Parallel Token Prediction with Discrete Absorbing Diffusion for Fast High-Resolution Image Generation from Vector-Quantized Codes
- Noise estimation for generative diffusion models
- Gotta go fast when generating data with score-based models
- Learning to efficiently sample from diffusion probabilistic models
- Deep generative learning via Schrodinger bridge
- VAEs meet Diffusion Models: Efficient and High-Fidelity Generation
- Variational diffusion models
- Score-based generative modeling with critically-damped Langevin diffusion
- Tackling the generative learning trilemma with Denoising Diffusion GANs
- Heavy-tailed denoising score matching
- Analytic-DPM: an Analytic Estimate of the Optimal Reverse Variance in Diffusion Probabilistic Models
- Learning Fast Samplers for Diffusion Models by Differentiating Through Sample Quality
- Truncated Diffusion Probabilistic Models
- Subspace Diffusion Generative Models
- Maximum Likelihood Training of Implicit Nonlinear Diffusion Models
- On Analyzing Generative and Denoising Capabilities of Diffusion-based Deep Generative Models
- Diffusion-GAN: Training GANs with Diffusion
- Accelerating Score-based Generative Models for High-Resolution Image Synthesis
- Diffusion models beat gans on image synthesis
- Classifier-Free Diffusion Guidance
- On Fast Sampling of Diffusion Probabilistic Models
- DiffuseVAE: Efficient, Controllable and High-Fidelity Generation from Low-Dimensional Latents
- Pseudo Numerical Methods for Diffusion Models on Manifolds
- Cascaded Diffusion Models for High Fidelity Image Generation
- High Fidelity Visualization of What Your Self-Supervised Representation Knows About
- Itô-Taylor Sampling Scheme for Denoising Diffusion Probabilistic Models using Ideal Derivatives
- {Dynamic Dual-Output Diffusion Models
- Generating High Fidelity Data from Low-density Regions using Diffusion Models
- Perception Prioritized Training of Diffusion Models
- Elucidating the Design Space of Diffusion-Based Generative Models
- Progressive distillation for fast sampling of diffusion models
- Denoising Likelihood Score Matching for Conditional Score-based Data Generation
- On Conditioning the Input Noise for Controlled Image Generation with Diffusion Models
- A Continuous Time Framework for Discrete Denoising Models
- DPM-Solver: A Fast ODE Solver for Diffusion Probabilistic Model Sampling in Around 10 Steps
- Compositional Visual Generation with Composable Diffusion Models
- Vector quantized diffusion model for text-to-image synthesis
- Hierarchical text-conditional image generation with CLIP latents
- Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding
- Fast Sampling of Diffusion Models with Exponential Integrator
- DiVAE: Photorealistic Images Synthesis with Denoising Diffusion Decoder
- Text-Guided Synthesis of Artistic Images with Retrieval-Augmented Diffusion Models
- Text2Human: Text-Driven Controllable Human Image Generation
- DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation
- Image super-resolution via iterative refinement
- Score-based Generative Neural Networks for Large-Scale Optimal Transport
- SDEdit: Guided Image Synthesis and Editing with Stochastic Differential Equations
- Blended Latent Diffusion
- GLIDE: Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion Models
- RePaint: Inpainting using Denoising Diffusion Probabilistic Models
- Palette: Image-to-Image Diffusion Models
- UNIT-DDPM: UNpaired Image Translation with Denoising Diffusion Probabilistic Models
- EGSDE: Unpaired Image-to-Image Translation via Energy-Guided Stochastic Differential Equations
- Pretraining is All You Need for Image-to-Image Translation
- VQBB: Image-to-image Translation with Vector Quantized Brownian Bridge
- The Swiss Army Knife for Image-to-Image Translation: Multi-Task Diffusion Models
- Label-Efficient Semantic Segmentation with Diffusion Models
- SegDiff: Image Segmentation with Diffusion Probabilistic Models
- Generative modeling by estimating gradients of the data distribution
- Score-Based Generative Modeling through Stochastic Differential Equations
- ImageBART: Bidirectional Context with Multinomial Diffusion for Autoregressive Image Synthesis
- Learning Energy-Based Models by Diffusion Recovery Likelihood
- Conditional image generation with score-based diffusion models
- More control for free! Image synthesis with semantic diffusion guidance
- ILVR: Conditioning Method for Denoising Diffusion Probabilistic Models
- Global Context with Discrete Diffusion in Vector Quantised Modelling for Image Generation
- High-Resolution Image Synthesis with Latent Diffusion Models
- Diffusion Autoencoders: Toward a Meaningful and Decodable Representation
- Come-Closer-Diffuse-Faster: Accelerating Conditional Diffusion Models for Inverse Problems through Stochastic Contraction
- DiffusionCLIP: Text-Guided Diffusion Models for Robust Image Manipulation
- Understanding DDPM Latent Codes Through Optimal Transport
- Conditional Simulation Using Diffusion Schrödinger Bridges
- Retrieval-Augmented Diffusion Models
- Accelerating Diffusion Models via Early Stop of the Diffusion Process
- Diffusion Models as Plug-and-Play Priors
- Non-Uniform Diffusion Models
- Unsupervised Medical Image Translation with Adversarial Diffusion Models
- Unsupervised Denoising of Retinal OCT with Diffusion Probabilistic Model
- Solving inverse problems in medical imaging with score-based generative models
- Score-based diffusion models for accelerated MRI
- Diffusion Models for Medical Anomaly Detection
- Fast Unsupervised Brain Anomaly Detection and Segmentation with Diffusion Models
- AnoDDPM: Anomaly Detection With Denoising Diffusion Probabilistic Models Using Simplex Noise
- What is Healthy? Generative Counterfactual Diffusion for Lesion Localization
- Video Diffusion Models
- Diffusion Probabilistic Modeling for Video Generation
- Flexible Diffusion Modeling of Long Videos
- Diffusion Models for Video Prediction and Infilling
- Diffusion Models for Counterfactual Explanations
- Diffusion Causal Models for Counterfactual Estimation