Re-land the PR of "Add INT8 SDPA path for CPU" #2215

Valentine233 · 2025-05-16T06:06:29Z

Re-land #1372.

Based on the original PR, there are two main modifications:

Fix the wheel issue nightly build for mac stops on 0422 #2157 by disabling cpp files building by default. The cpp kernels are only enabled if we manually set USE_CPP_KERNELS=1 when building from source. The support for pip installation with cpp kernels will be a follow-up work.
Change the API name from scaled_dot_product_int8 to qscaled_dot_product, in order to reuse the API for future FP8 SDPA support.

pytorch-bot · 2025-05-16T06:06:33Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/2215

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 65f7d50 with merge base 96aec6a ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

leslie-fang-intel · 2025-05-16T12:31:59Z

test/prototype/inductor/test_int8_sdpa_fusion.py

+from torchao.prototype.inductor.fx_passes.int8_sdpa_fusion import _int8_sdpa_init
+from torchao.utils import TORCH_VERSION_AT_LEAST_2_7
+
+use_cpp_avx512 = os.getenv("USE_AVX512", "0") == "1"


This feels wrong

if user didn't set the flag during the build phase but only testing, will it cause CI failure?

if user build the custom op, but didn't enable this flag to test, will it just skip the testing?

One way comes to my mind is to check if this custom op has been registered to the CPU dispatch key correctly, for example torch._C._dispatch_dump("torchao::qscaled_dot_product"). Feel free to explore if any better idea.

Thanks the suggestion, replace with "CPU" in torch._C._dispatch_dump("torchao::qscaled_dot_product").

leslie-fang-intel · 2025-05-16T12:58:22Z

test/prototype/inductor/test_int8_sdpa_fusion.py

+                    self.device, enabled=enable_autocast, dtype=torch.bfloat16
+                ),
+            ):
+                _int8_sdpa_init()


For how to register the custom pass, could we follow the suggestion in pytorch/pytorch#153532 (comment)?

Thanks and modified!

leslie-fang-intel · 2025-05-16T13:05:46Z

test/test_ops.py

+    compute_max_diff,
+)
+
+use_cpp_avx512 = os.getenv("USE_AVX512", "0") == "1"


Thanks the suggestion, replace with "CPU" in torch._C._dispatch_dump("torchao::qscaled_dot_product").

leslie-fang-intel · 2025-05-20T05:47:27Z

setup.py

@@ -55,6 +55,10 @@ def read_version(file_path="version.txt"):
    and platform.system() == "Darwin"
 )

+use_cpp_avx512 = os.getenv("USE_AVX512", "0") == "1"


This name might not be intuitive. This flag actual decide building of CPP kernels or not.

Thanks. Changed to use_cpp_kernels.

atalman

wheel build looks good

This reverts commit 1bbeed1.

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label May 16, 2025

Valentine233 added the topic: not user facing Use this tag if you don't want this PR to show up in release notes label May 16, 2025

Valentine233 marked this pull request as draft May 16, 2025 06:11

Valentine233 marked this pull request as ready for review May 16, 2025 07:27

Valentine233 requested review from jansel, jerryzh168, drisspg, leslie-fang-intel and atalman May 16, 2025 07:28

leslie-fang-intel requested changes May 16, 2025

View reviewed changes

Valentine233 requested a review from leslie-fang-intel May 19, 2025 05:51

leslie-fang-intel reviewed May 20, 2025

View reviewed changes

leslie-fang-intel approved these changes May 20, 2025

View reviewed changes

Valentine233 force-pushed the int8_sdpa_op_cpu branch from 080576b to 164a8ff Compare May 20, 2025 06:02

Valentine233 added 5 commits May 20, 2025 02:03

enable int8 sdpa cpu

d32b259

change api from scaled_dot_product_int8 to qscaled_dot_product

cffac7d

fix ci

d530896

update int8 sdpa

49a8681

change flag name

65f7d50

Valentine233 force-pushed the int8_sdpa_op_cpu branch from 164a8ff to 65f7d50 Compare May 20, 2025 06:05

atalman approved these changes May 20, 2025

View reviewed changes

Valentine233 merged commit 1bbeed1 into pytorch:main May 21, 2025
35 checks passed

drisspg added a commit that referenced this pull request May 21, 2025

Revert "Re-land the PR of "Add INT8 SDPA path for CPU" (#2215)"

d5f054b

This reverts commit 1bbeed1.

drisspg mentioned this pull request May 21, 2025

Revert "Re-land the PR of "Add INT8 SDPA path for CPU"" #2234

Closed

Valentine233 mentioned this pull request May 23, 2025

Support INT8 SDPA template for CPU #2148

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Re-land the PR of "Add INT8 SDPA path for CPU" #2215

Re-land the PR of "Add INT8 SDPA path for CPU" #2215

Uh oh!

Valentine233 commented May 16, 2025 •

edited

Loading

Uh oh!

pytorch-bot bot commented May 16, 2025 •

edited

Loading

Uh oh!

leslie-fang-intel May 16, 2025

Uh oh!

Valentine233 May 19, 2025 •

edited

Loading

Uh oh!

leslie-fang-intel May 16, 2025

Uh oh!

Valentine233 May 19, 2025

Uh oh!

leslie-fang-intel May 16, 2025

Uh oh!

Valentine233 May 19, 2025

Uh oh!

leslie-fang-intel May 20, 2025

Uh oh!

Valentine233 May 20, 2025

Uh oh!

atalman left a comment

Uh oh!

Uh oh!

Uh oh!

Re-land the PR of "Add INT8 SDPA path for CPU" #2215

Re-land the PR of "Add INT8 SDPA path for CPU" #2215

Uh oh!

Conversation

Valentine233 commented May 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented May 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/2215

✅ No Failures

Uh oh!

leslie-fang-intel May 16, 2025

Choose a reason for hiding this comment

Uh oh!

Valentine233 May 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

leslie-fang-intel May 16, 2025

Choose a reason for hiding this comment

Uh oh!

Valentine233 May 19, 2025

Choose a reason for hiding this comment

Uh oh!

leslie-fang-intel May 16, 2025

Choose a reason for hiding this comment

Uh oh!

Valentine233 May 19, 2025

Choose a reason for hiding this comment

Uh oh!

leslie-fang-intel May 20, 2025

Choose a reason for hiding this comment

Uh oh!

Valentine233 May 20, 2025

Choose a reason for hiding this comment

Uh oh!

atalman left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Valentine233 commented May 16, 2025 •

edited

Loading

pytorch-bot bot commented May 16, 2025 •

edited

Loading

Valentine233 May 19, 2025 •

edited

Loading