refactor: estimate working vae memory during encode/decode #8438
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Summary
Extract logic to estimate working memory to utility functions. Then, instead of requiring the estimate to be passed as an arg to all encode calls (e.g. static method
ImageToLatentsInvocation.encode()
), do the estimation within the encode calls. This makes it impossible to not handle the memory estimation when doing VAE stuff.Related Issues / Discussions
Also fixes an issue introduced in #8414 where some calls to VAE encode didn't pass estimate working memory: https://discord.com/channels/1020123559063990373/1149506274971631688/1405975664359903403
QA Instructions
I manually tested all code paths that do VAE stuff:
Merge Plan
n/a
Checklist
What's New
copy (if doing a release after this PR)