Attribute-Conditioned Diffusion Model

PyTorch implementation of a conditional diffusion model trained with classifier-free guidance.
The model generates 128×128 images conditioned on a fixed-length attribute vector.

Training is currently ongoing; intermediate results are provided for transparency.

Sample Results

Training Progression

The following composite image shows sampling progression during training. Each cell is a 4×4 grid (16 samples) generated using DDIM sampling with classifier-free guidance.

Sampling checkpoints shown (left → right, top → bottom): 15k → 30k → 45k → 60k → 75k → 90k → 105k → 117k → 129k steps

Near-Final Samples

Random samples generated near the current training stage:

Model Overview

Architecture: UNet with residual blocks and self-attention
Resolution: 128×128
Input channels: 3 (RGB)
Conditioning: fixed-length attribute vector
Prediction type: v-prediction
Attention applied at deeper UNet resolutions

UNet Details

Base channels: 192
Channel multipliers: [192, 384, 768, 768]
Time embedding dimension: 512
Conditioning injected via additive embedding
Group Normalization + SiLU activations

Diffusion Setup

Timesteps: 1000
Noise schedule: cosine schedule
Objective: mean squared error on v-prediction
Sampler: DDIM
Sampling steps: 50
Classifier-Free Guidance scale: 4.0
Conditional dropout probability: 0.15

Training Details

Batch size: 4
Optimizer: AdamW
Learning rate: 1e-4
Weight decay: 0.01
Gradient clipping: 1.0
Mixed precision training (AMP)
EMA decay: 0.9999

The training script includes:

automatic checkpointing
full RNG state restoration
resumable training from latest.pt
periodic DDIM sampling during training

Checkpointing

Two types of checkpoints are saved:

Training checkpoints
- Optimizer state
- Gradient scaler
- RNG states
- Model weights
- EMA weights
Clean inference checkpoints
- Model weights only
- EMA weights only

Clean checkpoints are intended for inference and sampling.

Sampling

Sampling is performed using DDIM with classifier-free guidance.

cd inference
python sample.py

Sampling configuration (CFG scale, number of steps, checkpoint paths) is defined directly in sample.py.

Training

To start or resume training:

python train_diffusion.py

All hyperparameters and paths are defined directly in the training script.

Notes

Training is ongoing and expected to continue to 200k steps.
Current samples are provided to document model progression.
The code prioritizes clarity and reproducibility over abstraction.

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
assets		assets
inference		inference
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
train_diffusion.py		train_diffusion.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Attribute-Conditioned Diffusion Model

Sample Results

Training Progression

Near-Final Samples

Model Overview

UNet Details

Diffusion Setup

Training Details

Checkpointing

Sampling

Training

Notes

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Attribute-Conditioned Diffusion Model

Sample Results

Training Progression

Near-Final Samples

Model Overview

UNet Details

Diffusion Setup

Training Details

Checkpointing

Sampling

Training

Notes

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages