Skip to content

Add MXFP8 Support and Consolidated stable scaled mm API#26

Merged
comfyanonymous merged 11 commits intoComfy-Org:mainfrom
contentis:scaled_mm_v2
Jan 24, 2026
Merged

Add MXFP8 Support and Consolidated stable scaled mm API#26
comfyanonymous merged 11 commits intoComfy-Org:mainfrom
contentis:scaled_mm_v2

Conversation

@contentis
Copy link
Contributor

Updated scaled mm API

  • With torch 2.10 scaled_mm was introduced as stable API
  • Added wrapper to pipe through the stble API if the torch version is sufficient, fallback to experimental _scaled_mm otherwise (current behaviour)

MXFP8

  • Add MXFP8 (Microscaling FP8) block quantization support with E8M0 (power-of-2) scales and 32-element blocks, targeting SM >= 10.0 (Blackwell)
  • Introduce TensorCoreMXFP8Layout with full operator dispatch for mm, addmm, and linear

@comfyanonymous comfyanonymous merged commit 7019af5 into Comfy-Org:main Jan 24, 2026
14 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants