Scavallari fp8 support #10868

andompesta · 2025-11-25T00:27:42Z

this PR aims at solving some bug related to FP8 such as:

scaling factor allocation to the correct device
adding support for casting to QuantizeTensor
proper Fp8 scaling factor implementation
enable FP8 mixedprecision for text-encoders

…ctor

comfyanonymous · 2025-11-25T01:01:12Z

comfy/quant_ops.py

        scale = scale.to(device=tensor.device, dtype=torch.float32)

-        tensor_scaled = tensor * (1.0 / scale).to(tensor.dtype)
+        tensor_fp32 = tensor.to(torch.float32)


casting to fp32 here causes some pretty large slowdowns that make the fp8 ops as slow as 16 bit.

Kosinkadink · 2025-11-26T01:44:54Z

comfy modified this PR in #10872 and merged it in, so closing this PR

andompesta added 3 commits November 25, 2025 01:22

fix fp8 mixed-precision loading issue

db730ee

add generic support for quantize tensor casting and proper scaling fa…

b95c05d

…ctor

enable test encoders to load Fp8 mixed precision

277035b

andompesta requested a review from Kosinkadink as a code owner November 25, 2025 00:27

comfyanonymous reviewed Nov 25, 2025

View reviewed changes

comfyanonymous mentioned this pull request Nov 25, 2025

Cleanup and fix issues with text encoder quants. #10872

Merged

Kosinkadink closed this Nov 26, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Scavallari fp8 support #10868

Scavallari fp8 support #10868

Uh oh!

andompesta commented Nov 25, 2025

Uh oh!

comfyanonymous Nov 25, 2025

Uh oh!

Kosinkadink commented Nov 26, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Scavallari fp8 support #10868

Scavallari fp8 support #10868

Uh oh!

Conversation

andompesta commented Nov 25, 2025

Uh oh!

comfyanonymous Nov 25, 2025

Choose a reason for hiding this comment

Uh oh!

Kosinkadink commented Nov 26, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants