Add SEGA for FLUX #3

Marlon154 · 2024-11-04T14:34:11Z

What does this PR do?

This PR adds SEGA for FLUX, thereby enable users to guide the diffusion process by applying editing prompts.

* Add `remote_decode` to `remote_utils` * test dependency * test dependency * dependency * dependency * dependency * docstrings * changes * make style * apply * revert, add new options * Apply style fixes * deprecate base64, headers not needed * address comments * add license header * init test_remote_decode * more * more test * more test * skeleton for xl, flux * more test * flux test * flux packed * no scaling * -save * hunyuanvideo test * Apply style fixes * init docs * Update src/diffusers/utils/remote_utils.py Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * comments * Apply style fixes * comments * hybrid_inference/vae_decode * fix * tip? * tip * api reference autodoc * install tip --------- Co-authored-by: sayakpaul <spsayakpaul@gmail.com> Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>

fix-copies went uncaught it seems.

* fix-copies went uncaught it seems. * remove more unneeded encode_prompt() tests * Revert "fix-copies went uncaught it seems." This reverts commit eefb302. * empty

…eneration model (huggingface#10626) * Update EasyAnimate V5.1 * Add docs && add tests && Fix comments problems in transformer3d and vae * delete comments and remove useless import * delete process * Update EXAMPLE_DOC_STRING * rename transformer file * make fix-copies * make style * refactor pt. 1 * update toctree.yml * add model tests * Update layer_norm for norm_added_q and norm_added_k in Attention * Fix processor problem * refactor vae * Fix problem in comments * refactor tiling; remove einops dependency * fix docs path * make fix-copies * Update src/diffusers/pipelines/easyanimate/pipeline_easyanimate_control.py * update _toctree.yml * fix test * update * update * update * make fix-copies * fix tests --------- Co-authored-by: Aryan <aryan@huggingface.co> Co-authored-by: Aryan <contact.aryanvs@gmail.com> Co-authored-by: YiYi Xu <yixu310@gmail.com> Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>

* Fix SD2.X clip single file load projection_dim Infer projection_dim from the checkpoint before loading from pretrained, override any incorrect hub config. Hub configuration for SD2.X specifies projection_dim=512 which is incorrect for SD2.X checkpoints loaded from civitai and similar. Exception was previously thrown upon attempting to load_model_dict_into_meta for SD2.X single file checkpoints. Such LDM models usually require projection_dim=1024 * convert_open_clip_checkpoint use hidden_size for text_proj_dim * convert_open_clip_checkpoint, revert checkpoint[text_proj_key].shape[1] -> [0] values are identical --------- Co-authored-by: Teriks <Teriks@users.noreply.github.com> Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>

* Update pipeline_animatediff.py * Update pipeline_animatediff_controlnet.py * Update pipeline_animatediff_sparsectrl.py * Update pipeline_animatediff_video2video.py * Update pipeline_animatediff_video2video_controlnet.py --------- Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>

* Add example of Ip-Adapter-Callback. * Add image links from HF Hub.

…ace#10945)

* Update ip_adapter.py * Update ip_adapter.py * Update ip_adapter.py * Update ip_adapter.py * Update ip_adapter.py * Apply style fixes --------- Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com> Co-authored-by: hlky <hlky@hlky.ac>

* initial comit * fix empty cache * fix one more * fix style * update device functions * update * update * Update src/diffusers/utils/testing_utils.py Co-authored-by: hlky <hlky@hlky.ac> * Update src/diffusers/utils/testing_utils.py Co-authored-by: hlky <hlky@hlky.ac> * Update src/diffusers/utils/testing_utils.py Co-authored-by: hlky <hlky@hlky.ac> * Update tests/pipelines/controlnet/test_controlnet.py Co-authored-by: hlky <hlky@hlky.ac> * Update src/diffusers/utils/testing_utils.py Co-authored-by: hlky <hlky@hlky.ac> * Update src/diffusers/utils/testing_utils.py Co-authored-by: hlky <hlky@hlky.ac> * Update tests/pipelines/controlnet/test_controlnet.py Co-authored-by: hlky <hlky@hlky.ac> * with gc.collect * update * make style * check_torch_dependencies * add mps empty cache * add changes * bug fix * enable on xpu * update more cases * revert * revert back * Update test_stable_diffusion_xl.py * Update tests/pipelines/stable_diffusion/test_stable_diffusion.py Co-authored-by: hlky <hlky@hlky.ac> * Update tests/pipelines/stable_diffusion/test_stable_diffusion.py Co-authored-by: hlky <hlky@hlky.ac> * Update tests/pipelines/stable_diffusion/test_stable_diffusion_img2img.py Co-authored-by: hlky <hlky@hlky.ac> * Update tests/pipelines/stable_diffusion/test_stable_diffusion_img2img.py Co-authored-by: hlky <hlky@hlky.ac> * Update tests/pipelines/stable_diffusion/test_stable_diffusion_img2img.py Co-authored-by: hlky <hlky@hlky.ac> * Apply suggestions from code review Co-authored-by: hlky <hlky@hlky.ac> * add test marker --------- Co-authored-by: hlky <hlky@hlky.ac>

* Update evaluation.md * Update docs/source/en/conceptual/evaluation.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* feat: support non-diffusers lumina2 LoRAs. * revert ipynb changes (but I don't know why this is required ☹️) * empty --------- Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com> Co-authored-by: YiYi Xu <yixu310@gmail.com>

…e#10927) * [Quantization] support pass MappingType for TorchAoConfig * Apply style fixes --------- Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>

…tization_config.py. (huggingface#10961) Update quantization_config.py

* update * refactor image-to-video pipeline * update * fix copied from * use FP32LayerNorm

) * Fix seed initialization to handle args.seed = 0 correctly * Apply style fixes --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>

…SDXL (huggingface#10951) * feat: add Mixture-of-Diffusers ControlNet Tile upscaler Pipeline for SDXL * make style make quality

* Update pipeline_cogview4.py * Use GLM instead of T5 in doc

fix

* fix t5 training bug * Apply style fixes --------- Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>

use style bot GH action from hfh Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

…hs` is passed in a distributed training env (huggingface#10973) * updated train_dreambooth_lora to fix the LR schedulers for `num_train_epochs` in distributed training env * fixed formatting * remove trailing newlines * fixed style error

fix tests

Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

* update * update * addressed PR comments * update --------- Co-authored-by: YiYi Xu <yixu310@gmail.com>

…ce#11369) * Add stochastic sampling to FlowMatchEulerDiscreteScheduler This PR adds stochastic sampling to FlowMatchEulerDiscreteScheduler based on Lightricks/LTX-Video@b1aeddd ltx_video/schedulers/rf.py * Apply style fixes * Use config value directly * Apply style fixes * Swap order * Update src/diffusers/schedulers/scheduling_flow_match_euler_discrete.py Co-authored-by: YiYi Xu <yixu310@gmail.com> * Update src/diffusers/schedulers/scheduling_flow_match_euler_discrete.py Co-authored-by: YiYi Xu <yixu310@gmail.com> --------- Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com> Co-authored-by: YiYi Xu <yixu310@gmail.com>

…e#11281) * initial commit * initial commit * initial commit * initial commit * initial commit * initial commit * Update examples/dreambooth/train_dreambooth_lora_hidream.py Co-authored-by: Bagheera <59658056+bghira@users.noreply.github.com> * move prompt embeds, pooled embeds outside * Update examples/dreambooth/train_dreambooth_lora_hidream.py Co-authored-by: hlky <hlky@hlky.ac> * Update examples/dreambooth/train_dreambooth_lora_hidream.py Co-authored-by: hlky <hlky@hlky.ac> * fix import * fix import and tokenizer 4, text encoder 4 loading * te * prompt embeds * fix naming * shapes * initial commit to add HiDreamImageLoraLoaderMixin * fix init * add tests * loader * fix model input * add code example to readme * fix default max length of text encoders * prints * nullify training cond in unpatchify for temp fix to incompatible shaping of transformer output during training * smol fix * unpatchify * unpatchify * fix validation * flip pred and loss * fix shift!!! * revert unpatchify changes (for now) * smol fix * Apply style fixes * workaround moe training * workaround moe training * remove prints * to reduce some memory, keep vae in `weight_dtype` same as we have for flux (as it's the same vae) https://github.com/huggingface/diffusers/blob/bbd0c161b55ba2234304f1e6325832dd69c60565/examples/dreambooth/train_dreambooth_lora_flux.py#L1207 * refactor to align with HiDream refactor * refactor to align with HiDream refactor * refactor to align with HiDream refactor * add support for cpu offloading of text encoders * Apply style fixes * adjust lr and rank for train example * fix copies * Apply style fixes * update README * update README * update README * fix license * keep prompt2,3,4 as None in validation * remove reverse ode comment * Update examples/dreambooth/train_dreambooth_lora_hidream.py Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * Update examples/dreambooth/train_dreambooth_lora_hidream.py Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * vae offload change * fix text encoder offloading * Apply style fixes * cleaner to_kwargs * fix module name in copied from * add requirements * fix offloading * fix offloading * fix offloading * update transformers version in reqs * try AutoTokenizer * try AutoTokenizer * Apply style fixes * empty commit * Delete tests/lora/test_lora_layers_hidream.py * change tokenizer_4 to load with AutoTokenizer as well * make text_encoder_four and tokenizer_four configurable * save model card * save model card * revert T5 * fix test * remove non diffusers lumina2 conversion --------- Co-authored-by: Bagheera <59658056+bghira@users.noreply.github.com> Co-authored-by: hlky <hlky@hlky.ac> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>

update

up

Small change requirements_sana.txt to requirements_hidream.txt

…e#11375) * fix * add tests * add message check

… for image resizing (huggingface#11395)

update

* Kolors additional pipelines, community contrib --------- Co-authored-by: Teriks <Teriks@users.noreply.github.com> Co-authored-by: Linoy Tsaban <57615435+linoytsaban@users.noreply.github.com>

* 1. add pre-computation of prompt embeddings when custom prompts are used as well 2. save model card even if model is not pushed to hub 3. remove scheduler initialization from code example - not necessary anymore (it's now if the base model's config) 4. add skip_final_inference - to allow to run with validation, but skip the final loading of the pipeline with the lora weights to reduce memory reqs * pre encode validation prompt as well * Update examples/dreambooth/train_dreambooth_lora_hidream.py Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * Update examples/dreambooth/train_dreambooth_lora_hidream.py Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * Update examples/dreambooth/train_dreambooth_lora_hidream.py Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * pre encode validation prompt as well * Apply style fixes * empty commit * change default trained modules * empty commit * address comments + change encoding of validation prompt (before it was only pre-encoded if custom prompts are provided, but should be pre-encoded either way) * Apply style fixes * empty commit * fix validation_embeddings definition * fix final inference condition * fix pipeline deletion in last inference * Apply style fixes * empty commit * layers * remove readme remarks on only pre-computing when instance prompt is provided and change example to 3d icons * smol fix * empty commit --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>

Fix Flux IP adapter argument in the example IP-Adapter example had a wrong argument. Fix `true_cfg` -> `true_cfg_scale`

update

…for resizing (huggingface#11421) * Set LANCZOS as default interpolation mode for resizing * [train_dreambooth_lora.py] Set LANCZOS as default interpolation mode for resizing

…s in pipelines during torch.compile() (huggingface#11085) * test for better torch.compile stuff. * fixes * recompilation and graph break. * clear compilation cache. * change to modeling level test. * allow running compilation tests during nightlies.

* enable group_offload cases and quanto cases on XPU Signed-off-by: YAO Matrix <matrix.yao@intel.com> * use backend APIs Signed-off-by: Yao Matrix <matrix.yao@intel.com> * fix style Signed-off-by: Yao Matrix <matrix.yao@intel.com> --------- Signed-off-by: YAO Matrix <matrix.yao@intel.com> Signed-off-by: Yao Matrix <matrix.yao@intel.com>

* enable test_layerwise_casting_memory cases on XPU Signed-off-by: Yao Matrix <matrix.yao@intel.com> * fix style Signed-off-by: Yao Matrix <matrix.yao@intel.com> --------- Signed-off-by: Yao Matrix <matrix.yao@intel.com>

fix import.

…follow up (huggingface#11426) * Update train_text_to_image.py * update

…ipts follow up (huggingface#11427) * Update train_text_to_image_lora.py * update_train_text_to_image_lora

* enable gguf test cases on XPU Signed-off-by: YAO Matrix <matrix.yao@intel.com> * make SD35LargeGGUFSingleFileTests::test_pipeline_inference pas Signed-off-by: root <root@a4bf01945cfe.jf.intel.com> * make FluxControlLoRAGGUFTests::test_lora_loading pass Signed-off-by: Yao Matrix <matrix.yao@intel.com> * polish code Signed-off-by: Yao Matrix <matrix.yao@intel.com> * Apply style fixes --------- Signed-off-by: YAO Matrix <matrix.yao@intel.com> Signed-off-by: root <root@a4bf01945cfe.jf.intel.com> Signed-off-by: Yao Matrix <matrix.yao@intel.com> Co-authored-by: root <root@a4bf01945cfe.jf.intel.com> Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>

remove unnecessary pipeline moving to cpu in validation Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

* Fixing missing provider options argument * Adding if else for provider options * Apply suggestions from code review Co-authored-by: YiYi Xu <yixu310@gmail.com> * Apply style fixes * Update src/diffusers/pipelines/onnx_utils.py Co-authored-by: YiYi Xu <yixu310@gmail.com> * Update src/diffusers/pipelines/onnx_utils.py Co-authored-by: YiYi Xu <yixu310@gmail.com> --------- Co-authored-by: Uros Petkovic <urpektov@amd.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: YiYi Xu <yixu310@gmail.com> Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>

…lNet training (huggingface#11449) Set LANCZOS as the default interpolation for image resizing

…gingface#11425) raise warning instead of error

Signed-off-by: Yao Matrix <matrix.yao@intel.com>

…e#11457) udpate

Signed-off-by: Yao Matrix <matrix.yao@intel.com>

* enable unidiffuser cases on XPU Signed-off-by: Yao Matrix <matrix.yao@intel.com> * fix a typo Signed-off-by: Yao Matrix <matrix.yao@intel.com> * fix style Signed-off-by: Yao Matrix <matrix.yao@intel.com> --------- Signed-off-by: Yao Matrix <matrix.yao@intel.com>

Marlon154 added the enhancement New feature or request label Nov 4, 2024

Marlon154 self-assigned this Nov 4, 2024

Marlon154 force-pushed the main branch from 9418745 to b0c8973 Compare January 16, 2025 08:50

hlky and others added 27 commits March 2, 2025 17:10

Update VAE Decode endpoints (huggingface#10939)

54043c3

[chore] fix-copies to flux pipelines (huggingface#10941)

4aaa0d2

fix-copies went uncaught it seems.

[Tests] Remove more encode prompts tests (huggingface#10942)

7513162

* fix-copies went uncaught it seems. * remove more unneeded encode_prompt() tests * Revert "fix-copies went uncaught it seems." This reverts commit eefb302. * empty

Add Example of IPAdapterScaleCutoffCallback to Docs (huggingface#10934)

982f9b3

* Add example of Ip-Adapter-Callback. * Add image links from HF Hub.

Update pipeline_cogview4.py (huggingface#10944)

f92e599

Fix redundant prev_output_channel assignment in UNet2DModel (huggingf…

8f15be1

…ace#10945)

Update evaluation.md (huggingface#10938)

cc22058

* Update evaluation.md * Update docs/source/en/conceptual/evaluation.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

[Quantization] support pass MappingType for TorchAoConfig (huggingfac…

11d8e3c

…e#10927) * [Quantization] support pass MappingType for TorchAoConfig * Apply style fixes --------- Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>

Fix the missing parentheses when calling is_torchao_available in quan…

dcd77ce

…tization_config.py. (huggingface#10961) Update quantization_config.py

[LoRA] Support Wan (huggingface#10943)

3ee899f

* update * refactor image-to-video pipeline * update * fix copied from * use FP32LayerNorm

feat: add Mixture-of-Diffusers ControlNet Tile upscaler Pipeline for …

66bf7ea

…SDXL (huggingface#10951) * feat: add Mixture-of-Diffusers ControlNet Tile upscaler Pipeline for SDXL * make style make quality

[Docs] CogView4 comment fix (huggingface#10957)

a74f02f

* Update pipeline_cogview4.py * Use GLM instead of T5 in doc

update check_input for cogview4 (huggingface#10966)

24c062a

fix

Add VAE Decode endpoint slow test (huggingface#10946)

08f74a8

[flux lora training] fix t5 training bug (huggingface#10845)

e031caf

* fix t5 training bug * Apply style fixes --------- Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>

use style bot GH Action from huggingface_hub (huggingface#10970)

fbf6b85

use style bot GH action from hfh Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

[tests] fix tests for save load components (huggingface#10977)

6e2a93d

fix tests

Fix loading OneTrainer Flux LoRA (huggingface#10978)

b150276

Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

ishan-modi and others added 30 commits April 21, 2025 09:56

[Refactor] Minor Improvement for import utils (huggingface#11161)

f59df3b

* update * update * addressed PR comments * update --------- Co-authored-by: YiYi Xu <yixu310@gmail.com>

Update modeling imports (huggingface#11129)

f108ad8

update

[HiDream] move deprecation to 0.35.0 (huggingface#11384)

448c72a

up

Update README_hidream.md (huggingface#11386)

026507c

Small change requirements_sana.txt to requirements_hidream.txt

Fix group offloading with block_level and use_stream=True (huggingfac…

6cef71d

…e#11375) * fix * add tests * add message check

[train_dreambooth_flux] Add LANCZOS as the default interpolation mode…

4b60f4b

… for image resizing (huggingface#11395)

[Feature] Added Xlab Controlnet support (huggingface#11249)

a4f9c3c

update

Kolors additional pipelines, community contrib (huggingface#11372)

b4be422

* Kolors additional pipelines, community contrib --------- Co-authored-by: Teriks <Teriks@users.noreply.github.com> Co-authored-by: Linoy Tsaban <57615435+linoytsaban@users.noreply.github.com>

Fix Flux IP adapter argument in the pipeline example (huggingface#11402)

7986834

Fix Flux IP adapter argument in the example IP-Adapter example had a wrong argument. Fix `true_cfg` -> `true_cfg_scale`

[BUG] fixed WAN docstring (huggingface#11226)

e8312e7

update

Fix typos in strings and comments (huggingface#11407)

f00a995

[train_dreambooth_lora.py] Set LANCZOS as default interpolation mode …

bd96a08

…for resizing (huggingface#11421) * Set LANCZOS as default interpolation mode for resizing * [train_dreambooth_lora.py] Set LANCZOS as default interpolation mode for resizing

[tests] fix import. (huggingface#11434)

0e3f271

fix import.

[train_text_to_image] Better image interpolation in training scripts …

b3b04fe

…follow up (huggingface#11426) * Update train_text_to_image.py * update

[train_text_to_image_lora] Better image interpolation in training scr…

3da98e7

…ipts follow up (huggingface#11427) * Update train_text_to_image_lora.py * update_train_text_to_image_lora

[Hi-Dream LoRA] fix bug in validation (huggingface#11439)

0ac1d5b

remove unnecessary pipeline moving to cpu in validation Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

Set LANCZOS as the default interpolation for image resizing in Contro…

58431f1

…lNet training (huggingface#11449) Set LANCZOS as the default interpolation for image resizing

Raise warning instead of error for block offloading with streams (hug…

8fe5a14

…gingface#11425) raise warning instead of error

enable marigold_intrinsics cases on XPU (huggingface#11445)

60892c5

Signed-off-by: Yao Matrix <matrix.yao@intel.com>

torch.compile fullgraph compatibility for Hunyuan Video (huggingfac…

c865115

…e#11457) udpate

enable consistency test cases on XPU, all passed (huggingface#11446)

fbe2fe5

Signed-off-by: Yao Matrix <matrix.yao@intel.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add SEGA for FLUX #3

Add SEGA for FLUX #3

Uh oh!

Marlon154 commented Nov 4, 2024

Uh oh!

Uh oh!

Add SEGA for FLUX #3

Are you sure you want to change the base?

Add SEGA for FLUX #3

Uh oh!

Conversation

Marlon154 commented Nov 4, 2024

What does this PR do?

Uh oh!

Uh oh!