ops: limit return of requants (Fix perf of some fp8 dynamic_vram workflows) by rattus128 · Pull Request #12506 · Comfy-Org/ComfyUI

rattus128 · 2026-02-17T15:48:14Z

This check was far too broad and the dtype is not a reliable indicator of wanting the requant (as QT returns the compute dtype as the dtype). So explictly plumb whether fp8mm wants the requant or not.

Example Test Conditions:

Windows, RTX5060, 32GB RAM, --fast dynamic_vram
WAN2.1 fp8 scaled 14B + Lora

Before:

Requested to load WAN21
0 models unloaded.
Model WAN21 prepared for dynamic VRAM loading. 13630MB Staged. 1053 patches attached.
100%|████████████████████████████████████████████████████████████████████████████████████| 4/4 [03:38<00:00, 54.52s/it]
Requested to load WanVAE
Model WanVAE prepared for dynamic VRAM loading. 242MB Staged. 0 patches attached.
Prompt executed in 250.19 seconds

After:

Requested to load WAN21
0 models unloaded.
Model WAN21 prepared for dynamic VRAM loading. 13630MB Staged. 1053 patches attached.
100%|████████████████████████████████████████████████████████████████████████████████████| 4/4 [03:16<00:00, 49.10s/it]
Requested to load WanVAE
Model WanVAE prepared for dynamic VRAM loading. 242MB Staged. 0 patches attached.
Prompt executed in 229.95 seconds

This check was far too broad and the dtype is not a reliable indicator of wanting the requant (as QT returns the compute dtype as the dtype). So explictly plumb whether fp8mm wants the requant or not.

ops: limit return of requants

ee2ac7f

This check was far too broad and the dtype is not a reliable indicator of wanting the requant (as QT returns the compute dtype as the dtype). So explictly plumb whether fp8mm wants the requant or not.

rattus128 requested review from Kosinkadink, comfyanonymous and guill as code owners February 17, 2026 15:48

comfyanonymous merged commit 58dcc97 into Comfy-Org:master Feb 17, 2026
12 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ops: limit return of requants (Fix perf of some fp8 dynamic_vram workflows)#12506

ops: limit return of requants (Fix perf of some fp8 dynamic_vram workflows)#12506
comfyanonymous merged 1 commit intoComfy-Org:masterfrom
rattus128:prs/dynamic-vram-fixes/dont-requant

rattus128 commented Feb 17, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

rattus128 commented Feb 17, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants