@@ -21,8 +21,10 @@ API and command-line option may change frequently.***
2121 - [ SD3/SD3.5] ( ./docs/sd3.md )
2222 - [ Flux-dev/Flux-schnell] ( ./docs/flux.md )
2323 - [ Chroma] ( ./docs/chroma.md )
24+ - [ Qwen Image] ( ./docs/qwen_image.md )
2425 - Image Edit Models
2526 - [ FLUX.1-Kontext-dev] ( ./docs/kontext.md )
27+ - [ Qwen Image Edit/Qwen Image Edit 2509] ( ./docs/qwen_image_edit.md )
2628 - Video Models
2729 - [ Wan2.1/Wan2.2] ( ./docs/wan.md )
2830 - [ PhotoMaker] ( https://github.com/TencentARC/PhotoMaker ) support.
@@ -285,7 +287,7 @@ usage: ./bin/sd [arguments]
285287
286288arguments:
287289 -h, --help show this help message and exit
288- -M, --mode [MODE] run mode, one of: [img_gen, vid_gen, convert], default: img_gen
290+ -M, --mode [MODE] run mode, one of: [img_gen, vid_gen, upscale, convert], default: img_gen
289291 -t, --threads N number of threads to use during computation (default: -1)
290292 If threads <= 0, then threads will be set to the number of CPU physical cores
291293 --offload-to-cpu place the weights in RAM to save VRAM, and automatically load them into VRAM when needed
@@ -296,11 +298,13 @@ arguments:
296298 --clip_g path to the clip-g text encoder
297299 --clip_vision path to the clip-vision encoder
298300 --t5xxl path to the t5xxl text encoder
301+ --qwen2vl path to the qwen2vl text encoder
302+ --qwen2vl_vision path to the qwen2vl vit
299303 --vae [VAE] path to vae
300304 --taesd [TAESD_PATH] path to taesd. Using Tiny AutoEncoder for fast decoding (low quality)
301305 --control-net [CONTROL_PATH] path to control net model
302306 --embd-dir [EMBEDDING_PATH] path to embeddings
303- --upscale-model [ESRGAN_PATH] path to esrgan model. Upscale images after generate, just RealESRGAN_x4plus_anime_6B supported by now
307+ --upscale-model [ESRGAN_PATH] path to esrgan model. For img_gen mode, upscale images after generate, just RealESRGAN_x4plus_anime_6B supported by now
304308 --upscale-repeats Run the ESRGAN upscaler this many times (default 1)
305309 --type [TYPE] weight type (examples: f32, f16, q4_0, q4_1, q5_0, q5_1, q8_0, q2_K, q3_K, q4_K)
306310 If not specified, the default is the type of the weight file
@@ -449,6 +453,7 @@ These projects use `stable-diffusion.cpp` as a backend for their image generatio
449453- [Local Diffusion](https://github.com/rmatif/Local-Diffusion)
450454- [sd.cpp-webui](https://github.com/daniandtheweb/sd.cpp-webui)
451455- [LocalAI](https://github.com/mudler/LocalAI)
456+ - [Neural-Pixel](https://github.com/Luiz-Alcantara/Neural-Pixel)
452457
453458## Contributors
454459
@@ -463,6 +468,7 @@ Thank you to all the people who have already contributed to stable-diffusion.cpp
463468## References
464469
465470- [ggml](https://github.com/ggerganov/ggml)
471+ - [diffusers](https://github.com/huggingface/diffusers)
466472- [stable-diffusion](https://github.com/CompVis/stable-diffusion)
467473- [sd3-ref](https://github.com/Stability-AI/sd3-ref)
468474- [stable-diffusion-stability-ai](https://github.com/Stability-AI/stablediffusion)
@@ -473,4 +479,4 @@ Thank you to all the people who have already contributed to stable-diffusion.cpp
473479- [generative-models](https://github.com/Stability-AI/generative-models/)
474480- [PhotoMaker](https://github.com/TencentARC/PhotoMaker)
475481- [Wan2.1](https://github.com/Wan-Video/Wan2.1)
476- - [Wan2.2](https://github.com/Wan-Video/Wan2.2)
482+ - [Wan2.2](https://github.com/Wan-Video/Wan2.2)
0 commit comments