Würstchen

What is this?

Würstchen is a new framework for training text-conditional models by moving the computationally expensive text-conditional stage into a highly compressed latent space. Common approaches make use of a single stage compression, while Würstchen introduces another Stage that introduces even more compression. In total we have Stage A & B that are responsible for compressing images and Stage C that learns the text-conditional part in the low dimensional latent space. With that Würstchen achieves a 42x compression factor, while still reconstructing images faithfully. This enables training of Stage C to be fast and computationally cheap. We refer to the paper for details.

Use Würstchen

You can use the model simply through the notebooks here. The Stage B notebook only for reconstruction and the Stage C notebook is for the text-conditional generation. You can also try the text-to-image generation on Google Colab.

Train your own Würstchen

Training Würstchen is considerably faster and cheaper than other text-to-image as it trains in a much smaller latent space of 12x12. We provide training scripts for both Stage B and Stage C.

Download Models

Model	Download	Parameters	Conditioning
Würstchen v1	Huggingface	1B (Stage C) + 600M (Stage B) + 19M (Stage A)	CLIP-H-Text

Acknowledgment

Special thanks to Stability AI for providing compute for our research.

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
run		run
LICENSE		LICENSE
README.md		README.md
diffuzz.py		diffuzz.py
model_card.md		model_card.md
modules.py		modules.py
train_stage_B.py		train_stage_B.py
train_stage_C.py		train_stage_C.py
utils.py		utils.py
vqgan.py		vqgan.py
würstchen-stage-B.ipynb		würstchen-stage-B.ipynb
würstchen-stage-C.ipynb		würstchen-stage-C.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Würstchen

What is this?

Use Würstchen

Train your own Würstchen

Download Models

Acknowledgment

About

Releases

Packages

Languages

License

phymhan/Wuerstchen

Folders and files

Latest commit

History

Repository files navigation

Würstchen

What is this?

Use Würstchen

Train your own Würstchen

Download Models

Acknowledgment

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages