GitHub - localhd/Mini-DALLE3: Mini-DALLE3: Interactive Text to Image by Prompting Large Language Models

minidalle3.mp4

An experimental attempt to obtain the interactive and interleave text-to-image and text-to-text experience of DALL•E 3 and ChatGPT.

Try Yourself 🤗

Download the checkpoint and save it as following

checkpoints
   - models
   - sdxl_models

run the following commands, and you will get a gradio-based web demo.

export OPENAI_API_KEY="your key"
python -m minidalle3.web

TODO

Support generating image interleaved in the conversations.
Support generating multiple images at once.
Support selecting image.
Support refinement.
Support prompt refinement/variation.
Instruct tuned LLM/SD.

Citation

If you find this repo helpful, please consider citing us.

@misc{minidalle3,
    author={Lai, Zeqiang and Zhu, Xizhou and Dai, Jifeng and Qiao, Yu and Wang, Wenhai},
    title={Mini-DALLE3: Interactive Text to Image by Prompting Large Language Models},
    year={2023},
    url={https://github.com/Zeqiang-Lai/Mini-DALLE3},
}

Acknowledgement

IP-Adapter • Stable Diffusion XL

Name		Name	Last commit message	Last commit date
Latest commit History 46 Commits
assets		assets
minidalle3		minidalle3
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Try Yourself 🤗

TODO

Citation

Acknowledgement

About

Releases

Packages

Languages

localhd/Mini-DALLE3

Folders and files

Latest commit

History

Repository files navigation

Try Yourself 🤗

TODO

Citation

Acknowledgement

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages