Single-step Diffusion for Image Compression at Ultra-Low Bitrates

Chanung Park, Joo Chan Lee, and Jong Hwan Ko

[Paper(arxiv)]

Method Overview

The proposed codec encodes images into a latent representation and reconstructs them with a single diffusion denoising step, enabling ultra-low-bitrate operation with fast decoding. The core design uses VQ-Residual training to factorize the latent into a structural base code (capturing global geometry/structure) and a learned residual (capturing high-frequency details), providing a stable scaffold while restoring fine textures. During decoding, rate-aware noise modulation sets the denoising strength according to the target bitrate (bpp)—stronger at lower bpp and milder at higher bpp. With the bitrate-conditioned noise level fixed, the decoder performs one denoising pass, eliminating multi-step overhead while preserving perceptual quality at extremely low bpp. This design achieves compression performance comparable to state-of-the-art methods and delivers ~50× faster decoding than multi-step diffusion codecs, substantially improving the practicality of generative compression.

Setup

Our code is based on ResShift.

For installation:

git clone https://github.com/Freemasti/DiffO.git
conda env create -f environments.yaml

Running

Training

torchrun --standalone --nproc_per_node=1 --nnodes=1 main_diffusion_train_vq.py  --steps 2 --cfg_path configs/config.yaml --save_dir {save_path}

Inference

python inference.py -i {input_path} -o {output_path}  -r {reference_path} --scale 1 --steps 2  --ckpt weights/ema_model_60000.pth --vqckpt weights/vq_model_60000.pth.tar --GT_vqckpt weights/GT_vq_model_60000.pth.tar --config configs/config.yaml  --ddim --one_step

BibTeX

@article{park2025diffo,
  author    = {Park, Chanung and Lee, Joo Chan and Ko, Jong Hwan},
  title     = {Single-step Diffusion for Image Compression at Ultra-Low Bitrates},
  journal   = {arXiv preprint arXiv:2506.16572},
  year      = {2025},
}

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
__pycache__		__pycache__
assets		assets
basicsr		basicsr
configs		configs
datapipe		datapipe
ldm		ldm
models		models
taming		taming
utils		utils
vqcompress		vqcompress
README.md		README.md
environments.yaml		environments.yaml
evaluate.py		evaluate.py
hyper_encoder.py		hyper_encoder.py
inference.py		inference.py
main.py		main.py
main_diffusion_train_vq.py		main_diffusion_train_vq.py
sampler.py		sampler.py
trainer_LIC_e2e.py		trainer_LIC_e2e.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Single-step Diffusion for Image Compression at Ultra-Low Bitrates

Chanung Park, Joo Chan Lee, and Jong Hwan Ko

[Paper(arxiv)]

Method Overview

Setup

Running

Training

Inference

BibTeX

About

Uh oh!

Releases

Packages

Languages

Freemasti/DiffO

Folders and files

Latest commit

History

Repository files navigation

Single-step Diffusion for Image Compression at Ultra-Low Bitrates

Chanung Park, Joo Chan Lee, and Jong Hwan Ko

[Paper(arxiv)]

Method Overview

Setup

Running

Training

Inference

BibTeX

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages