🎨 SyntheticGen

Mitigating Long-Tail Bias in LoveDA via Prompt-Controlled Diffusion Augmentation

Addressing class imbalance in remote sensing datasets through controlled synthetic generation

🌟 Overview

SyntheticGen tackles the long-tail distribution problem in LoveDA by generating synthetic imagery with explicit control over class ratios. You can specify exactly what proportion of each land cover class should appear in the output.

✨ Highlights

Two-stage pipeline: ratio-conditioned layout D3PM + ControlNet image synthesis.
Full or sparse ratio control (e.g., building:0.4).
Config-first workflow for reproducible experiments.

🚀 Quick Start

Installation

git clone https://github.com/Buddhi19/SyntheticGen.git
cd SyntheticGen
pip install -r requirements.txt

Generate Your First Synthetic Image

python src/scripts/sample_pair.py \
  --config configs/sample_pair_ckpt40000_building0.4.yaml

📚 Usage

Training Pipeline (Configs)

Stage A: Train Layout Generator (D3PM)

python src/scripts/train_layout_d3pm.py \
  --config configs/train_layout_d3pm_masked_sparse_80k.yaml

(Optional) Ratio Prior for Sparse Conditioning

python src/scripts/compute_ratio_prior.py \
  --config configs/compute_ratio_prior_loveda_train.yaml

Stage B: Train Image Generator (ControlNet)

python src/scripts/train_controlnet_ratio.py \
  --config configs/train_controlnet_ratio_loveda_1024.yaml

Inference / Sampling (Configs)

End-to-end sampling (layout -> image):

python src/scripts/sample_pair.py \
  --config configs/sample_pair_ckpt40000_building0.4.yaml

Override config parameters via CLI if needed:

python src/scripts/sample_pair.py \
  --config configs/sample_pair_ckpt40000_building0.4.yaml \
  --ratios "building:0.4,forest:0.3" \
  --save_dir outputs/custom_generation

⚙️ Configuration

All experiments are driven by YAML/JSON config files in configs/.

Task	Script	Example Config
Layout Training	`src/scripts/train_layout_d3pm.py`	`configs/train_layout_d3pm_masked_sparse_80k.yaml`
Ratio Prior	`src/scripts/compute_ratio_prior.py`	`configs/compute_ratio_prior_loveda_train.yaml`
ControlNet Training	`src/scripts/train_controlnet_ratio.py`	`configs/train_controlnet_ratio_loveda_1024.yaml`
Sampling / Inference	`src/scripts/sample_pair.py`	`configs/sample_pair_ckpt40000_building0.4.yaml`

Config tips

Examples live in configs/.
To resume training, set resume_from_checkpoint: "checkpoint-XXXXX" in your config.
Dataset roots and domains are centralized in configs; edit once, reuse everywhere.
CLI flags override config values for quick experiments.

📁 Data Format

LoveDA Dataset Structure

LoveDA/
  Train/
    Train/            # some releases include this extra nesting
      Urban/
        images_png/
        masks_png/
      Rural/
        images_png/
        masks_png/
    Urban/
      images_png/
      masks_png/
    Rural/
      images_png/
      masks_png/
  Val/
    ...

Generic Dataset Structure

your_dataset/
  images/
    image_001.png
  masks/
    image_001.png   # label map with matching stem

📦 Pre-Generated Datasets

We provide synthetic datasets used in the paper: https://drive.google.com/drive/folders/14cMpLTgvcLdXhRY0kGhFKpDRMvpok90h?usp=sharing

🧾 Outputs

Checkpoints include training_config.json and class_names.json.
Sampling writes image.png, layout.png, and metadata.json.

📄 Citation

@misc{wijenayake2026mitigating,
      title={Mitigating Long-Tail Bias in LoveDA via Prompt-Controlled Diffusion Augmentation},
      author={Buddhi Wijenayake and Nichula Wasalathilake and Roshan Godaliyadda and Vijitha Herath and Parakrama Ekanayake and Vishal M. Patel},
      year={2026},
      eprint={2602.04749},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
      url={https://arxiv.org/abs/2602.04749}
}

📝 License

This project is licensed under the MIT License - see the LICENSE file for details.

🙏 Acknowledgments

LoveDA dataset creators for high-quality annotated remote sensing data
Hugging Face Diffusers for diffusion model infrastructure
ControlNet authors for controllable generation

Report Bug · Request Feature

Name		Name	Last commit message	Last commit date
Latest commit History 60 Commits
Generator		Generator
analyze		analyze
annotations		annotations
classification		classification
configs		configs
docs		docs
figures		figures
kernels/selective_scan		kernels/selective_scan
semanticsegmentation		semanticsegmentation
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🎨 SyntheticGen

Mitigating Long-Tail Bias in LoveDA via Prompt-Controlled Diffusion Augmentation

🌟 Overview

✨ Highlights

🚀 Quick Start

Installation

Generate Your First Synthetic Image

📚 Usage

Training Pipeline (Configs)

Inference / Sampling (Configs)

⚙️ Configuration

📁 Data Format

LoveDA Dataset Structure

Generic Dataset Structure

📦 Pre-Generated Datasets

🧾 Outputs

📄 Citation

📝 License

🙏 Acknowledgments

About

Uh oh!

Languages

License

Buddhi19/SyntheticGen

Folders and files

Latest commit

History

Repository files navigation

🎨 SyntheticGen

Mitigating Long-Tail Bias in LoveDA via Prompt-Controlled Diffusion Augmentation

🌟 Overview

✨ Highlights

🚀 Quick Start

Installation

Generate Your First Synthetic Image

📚 Usage

Training Pipeline (Configs)

Inference / Sampling (Configs)

⚙️ Configuration

📁 Data Format

LoveDA Dataset Structure

Generic Dataset Structure

📦 Pre-Generated Datasets

🧾 Outputs

📄 Citation

📝 License

🙏 Acknowledgments

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Languages