(Accepted by IJCV) Liquid: Language Models are Scalable and Unified Multi-modal Generators
-
Updated
Nov 10, 2025 - Python
(Accepted by IJCV) Liquid: Language Models are Scalable and Unified Multi-modal Generators
Just another reasonably minimal repo for class-conditional training of pixel-space diffusion transformers.
An open source community implementation of the model from the paper: "Movie Gen: A Cast of Media Foundation Models". Join our community to help implement this model!
Normalizing Flow with Diffusion Prior Model (NFDPM)
image processing and generation stuff using python
Some of my Discord bots
Add a description, image, and links to the image-gen topic page so that developers can more easily learn about it.
To associate your repository with the image-gen topic, visit your repo's landing page and select "manage topics."