Direct Ascent Synthesis: Revealing Hidden Generative Capabilities in Discriminative Models
A demo for the Direct Ascent Synthesis: Revealing Hidden Generative Capabilities in Discriminative Models paper (https://arxiv.org/abs/2502.07753)
This Colab demonstrates
- Text to image generation
- "Style" transfer
- Image reconstruction from its CLIP embedding
What you can expect:
- Text to image generation for
a photo of a meteor streaking through the night sky, detailedlooking like this
- Its individual resolutions looking like this after generation:

- To showcase generation diversity, 4 generations of the
a beautiful photo of Antelope Canyon’s light beams, detailed
- Combining a source image of an SF skyline at night with the Van Gogh "The Starry Night"

- And finally reconstructing an image of Henry VIII from its CLIP embedding:

- Get a spectrum of a generated image:

If you find this useful and would you like to cite us, please use the following bibtex
@misc{fort2025directascentsynthesisrevealing,
title={Direct Ascent Synthesis: Revealing Hidden Generative Capabilities in Discriminative Models},
author={Stanislav Fort and Jonathan Whitaker},
year={2025},
eprint={2502.07753},
archivePrefix={arXiv},
primaryClass={cs.CV},
url={https://arxiv.org/abs/2502.07753},
}