Deprecate `upcast_vae` in SDXL based pipelines by DN6 · Pull Request #12619 · huggingface/diffusers

DN6 · 2025-11-10T07:55:36Z

What does this PR do?

Removed the type cast in the AutoencoderKL Decoder in this PR in order to address graph breaks. The PR breaks some Mellon Nodes and default SDXL inference. The problem is that the casting is used to address an issue in the base SDXL VAE, which overflows when running in FP16.

The overflow issue has been fixed for a while and almost all finetunes use the fixed VAE, so this logic that conditionally upcasts the upsample layers of the VAE is no longer needed. The memory savings from this selective layer casting in also quite minimal, so we can safely upcast all layers in the VAE if needed.

This PR

Deprecates the upcast_vae method in the SDXL based pipelines that selectively upcasts only the upsample layers of the decoder in favour of upcasting the entire VAE if upcasting is required.

Fixes # (issue)

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline?
Did you read our philosophy doc (important for complex PRs)?
Was this discussed/approved via a GitHub issue or the forum? Please add a link to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

HuggingFaceDocBuilderDev · 2025-11-10T08:03:55Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

DN6 · 2025-11-11T05:51:48Z

@bot /style

github-actions · 2025-11-11T05:52:12Z

Style fix runs successfully without any file modified.

DN6 · 2025-11-12T17:54:06Z

Gentle ping @yiyixuxu

sayakpaul

Just one question. We should also add a test so that we can remove the upcast_vae() method after the depcrecation cycle is complete.

sayakpaul · 2025-11-14T05:54:33Z

examples/community/lpw_stable_diffusion_xl.py

-            self.vae.post_quant_conv.to(dtype)
-            self.vae.decoder.conv_in.to(dtype)
-            self.vae.decoder.mid_block.to(dtype)
+        deprecate("`upcast_vae` is deprecated")


We should also guide the users about what they should do instead of using upcast_vae(). We should also set the version number of thel lib after which this should be removed.

Hmm I assumed that no one would be using this method standalone. But looks like there are a few instances where this method is called directly.
https://github.com/search?q=%22pipe.upcast_vae()%22+language:Python&type=code

Will update the deprecation message and handle the casting inside the method itself for backwards compatiblity.

This reverts commit 7390638.

This reverts commit 21a03f9.

jiqing-feng · 2025-12-02T05:52:34Z

Please let me know if you have any progress or anything I can help. Thanks!

sayakpaul

I left two questions. Once resolved, we should merge.

sayakpaul · 2025-12-03T07:34:35Z

examples/community/lpw_stable_diffusion_xl.py

    # Copied from diffusers.pipelines.stable_diffusion.pipeline_stable_diffusion_upscale.StableDiffusionUpscalePipeline.upcast_vae
    def upcast_vae(self):
-        dtype = self.vae.dtype
+        deprecate("upcast_vae", "1.0.0", "`upcast_vae` is deprecated. Please use `pipe.vae.to(torch.float32)`")


(nit): I think this warrants a better message. We could link to the reasoning being provided in the OP of this PR:

Suggested change

deprecate("upcast_vae", "1.0.0", "`upcast_vae` is deprecated. Please use `pipe.vae.to(torch.float32)`")

deprecate("upcast_vae", "1.0.0", "`upcast_vae` is deprecated. Please use `pipe.vae.to(torch.float32)`. For more details, please refer to: https://github.com/huggingface/diffusers/pull/12619#issue-3606633695.")

sayakpaul · 2025-12-03T07:36:48Z

src/diffusers/pipelines/stable_diffusion/pipeline_stable_diffusion_upscale.py

        return latents

    def upcast_vae(self):
-        dtype = self.vae.dtype


Hmm, should the deprecation be not called from here, too? I am seeing some methods where the deprecation call exists and some where it doesn't (for example, this one). Is this expected?

update

21a03f9

DN6 requested a review from yiyixuxu November 10, 2025 07:55

update

7390638

sayakpaul mentioned this pull request Nov 11, 2025

stable_diffusion_xl_img2img failed due to dtype missmatch #12632

Closed

sayakpaul reviewed Nov 14, 2025

View reviewed changes

DN6 added 7 commits November 14, 2025 17:30

Revert "update"

9aea015

This reverts commit 7390638.

Revert "update"

2cf6dd1

This reverts commit 21a03f9.

update

799cf8d

update

5307ae2

update

9d1f757

update

d0b66ad

Merge branch 'main' into sdxl-vae-fix

b2e62d9

sayakpaul approved these changes Dec 3, 2025

View reviewed changes

update

68eee98

DN6 merged commit 1908c47 into main Dec 3, 2025
32 of 35 checks passed

asomoza mentioned this pull request Dec 16, 2025

Fix SDXL VAE decode latents dtype mismatch on non-MPS #12847

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Deprecate `upcast_vae` in SDXL based pipelines#12619

Deprecate `upcast_vae` in SDXL based pipelines#12619
DN6 merged 10 commits intomainfrom
sdxl-vae-fix

DN6 commented Nov 10, 2025 •

edited

Loading

Uh oh!

HuggingFaceDocBuilderDev commented Nov 10, 2025

Uh oh!

DN6 commented Nov 11, 2025

Uh oh!

github-actions bot commented Nov 11, 2025 •

edited

Loading

Uh oh!

DN6 commented Nov 12, 2025

Uh oh!

sayakpaul left a comment

Uh oh!

sayakpaul Nov 14, 2025

Uh oh!

DN6 Nov 14, 2025

Uh oh!

jiqing-feng commented Dec 2, 2025

Uh oh!

sayakpaul left a comment

Uh oh!

sayakpaul Dec 3, 2025

Uh oh!

sayakpaul Dec 3, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

	deprecate("upcast_vae", "1.0.0", "`upcast_vae` is deprecated. Please use `pipe.vae.to(torch.float32)`")
	deprecate("upcast_vae", "1.0.0", "`upcast_vae` is deprecated. Please use `pipe.vae.to(torch.float32)`. For more details, please refer to: https://github.com/huggingface/diffusers/pull/12619#issue-3606633695.")

Conversation

DN6 commented Nov 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Before submitting

Who can review?

Uh oh!

HuggingFaceDocBuilderDev commented Nov 10, 2025

Uh oh!

DN6 commented Nov 11, 2025

Uh oh!

github-actions bot commented Nov 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

DN6 commented Nov 12, 2025

Uh oh!

sayakpaul left a comment

Choose a reason for hiding this comment

Uh oh!

sayakpaul Nov 14, 2025

Choose a reason for hiding this comment

Uh oh!

DN6 Nov 14, 2025

Choose a reason for hiding this comment

Uh oh!

jiqing-feng commented Dec 2, 2025

Uh oh!

sayakpaul left a comment

Choose a reason for hiding this comment

Uh oh!

sayakpaul Dec 3, 2025

Choose a reason for hiding this comment

Uh oh!

sayakpaul Dec 3, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

DN6 commented Nov 10, 2025 •

edited

Loading

github-actions bot commented Nov 11, 2025 •

edited

Loading