[Inductor] Fix consolidating _scaled_mm into mm template TMA error #150686

PaulZhang12 · 2025-04-04T16:10:15Z

Summary: The previous diff broke a few tests that didn't run on internal or GH CI: T220169086, this fixes that issue. The {% if } block is only supposed to support autotuned parameters (constexpr), and should not be used for locals based on other examples.

Test Plan: buck test 'fbcode//mode/opt' fbcode//caffe2/test/inductor:fp8 -- --exact 'caffe2/test/inductor:fp8 - test_tensorwise_scaling_bfloat16_shape_16,32,32_has_bias_False_use_fast_accum_True_persistent_matmul_True (caffe2.test.inductor.test_fp8.TestFP8Lowering)'

Reviewed By: NikhilAPatel

Differential Revision: D72460516

cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @ipiszy @chenyang78 @kadeng @muchulee8 @amjames @chauhang @aakhundov

Summary: The previous diff broke a few tests that didn't run on internal or GH CI: T220169086, this fixes that issue. The {% if } block is only supposed to support autotuned parameters (constexpr), and should not be used for locals based on other examples. Test Plan: buck test 'fbcode//mode/opt' fbcode//caffe2/test/inductor:fp8 -- --exact 'caffe2/test/inductor:fp8 - test_tensorwise_scaling_bfloat16_shape_16,32,32_has_bias_False_use_fast_accum_True_persistent_matmul_True (caffe2.test.inductor.test_fp8.TestFP8Lowering)' Reviewed By: NikhilAPatel Differential Revision: D72460516

pytorch-bot · 2025-04-04T16:10:19Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/150686

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 New Failure, 1 Unrelated Failure

As of commit 0c678a5 with merge base f443035 ():

NEW FAILURE - The following job has failed:

pull / cuda12.4-py3.10-gcc9-sm75 / test (pr_time_benchmarks, 1, 1, linux.g4dn.metal.nvidia.gpu) (gh)
REGRESSION: benchmark ('symint_sum_loop', 'compile_time_instruction_count') failed, actual result 11125036684 is 168.59% higher than expected 4142000000 ±+1.50% if this is an expected regression, please update the expected results.

BROKEN TRUNK - The following job failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

inductor / unit-test / linux-jammy-cpu-py3.9-gcc11-inductor / test (inductor_avx2, 1, 2, linux.10xlarge.avx2) (gh) (trunk failure)
'Test'

This comment was automatically generated by Dr. CI and updates every 15 minutes.

facebook-github-bot · 2025-04-04T16:10:27Z

This pull request was exported from Phabricator. Differential Revision: D72460516

facebook-github-bot · 2025-04-04T21:54:16Z

@pytorchbot merge -i

(Initiating merge automatically since Phabricator Diff has merged, merging with -i because oss signals were bypassed internally)

pytorchmergebot · 2025-04-04T21:56:07Z

Merge started

Your change will be merged while ignoring the following 2 checks: pull / cuda12.4-py3.10-gcc9-sm75 / test (pr_time_benchmarks, 1, 1, linux.g4dn.metal.nvidia.gpu), inductor / unit-test / linux-jammy-cpu-py3.9-gcc11-inductor / test (inductor_avx2, 1, 2, linux.10xlarge.avx2)

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

…ytorch#150686) Summary: The previous diff broke a few tests that didn't run on internal or GH CI: T220169086, this fixes that issue. The {% if } block is only supposed to support autotuned parameters (constexpr), and should not be used for locals based on other examples. Test Plan: buck test 'fbcode//mode/opt' fbcode//caffe2/test/inductor:fp8 -- --exact 'caffe2/test/inductor:fp8 - test_tensorwise_scaling_bfloat16_shape_16,32,32_has_bias_False_use_fast_accum_True_persistent_matmul_True (caffe2.test.inductor.test_fp8.TestFP8Lowering)' Reviewed By: NikhilAPatel Differential Revision: D72460516 Pull Request resolved: pytorch#150686 Approved by: https://github.com/eellison, https://github.com/NikhilAPatel

pytorch-bot bot added ciflow/inductor module: inductor labels Apr 4, 2025

facebook-github-bot added the fb-exported label Apr 4, 2025

PaulZhang12 added the topic: not user facing topic category label Apr 4, 2025

eellison approved these changes Apr 4, 2025

View reviewed changes

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Apr 4, 2025

NikhilAPatel self-assigned this Apr 4, 2025

NikhilAPatel approved these changes Apr 4, 2025

View reviewed changes

pytorchmergebot added the merging label Apr 4, 2025

pytorchmergebot added the Merged label Apr 4, 2025

pytorchmergebot closed this in 2a2ddff Apr 4, 2025

pytorchmergebot removed the merging label Apr 4, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Inductor] Fix consolidating _scaled_mm into mm template TMA error #150686

[Inductor] Fix consolidating _scaled_mm into mm template TMA error #150686

Uh oh!

PaulZhang12 commented Apr 4, 2025 •

edited by pytorch-bot bot

Loading

Uh oh!

pytorch-bot bot commented Apr 4, 2025 •

edited

Loading

Uh oh!

facebook-github-bot commented Apr 4, 2025

Uh oh!

facebook-github-bot commented Apr 4, 2025

Uh oh!

pytorchmergebot commented Apr 4, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

[Inductor] Fix consolidating _scaled_mm into mm template TMA error #150686

[Inductor] Fix consolidating _scaled_mm into mm template TMA error #150686

Uh oh!

Conversation

PaulZhang12 commented Apr 4, 2025 • edited by pytorch-bot bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Apr 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/150686

❌ 1 New Failure, 1 Unrelated Failure

Uh oh!

facebook-github-bot commented Apr 4, 2025

Uh oh!

facebook-github-bot commented Apr 4, 2025

Uh oh!

pytorchmergebot commented Apr 4, 2025

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

PaulZhang12 commented Apr 4, 2025 •

edited by pytorch-bot bot

Loading

pytorch-bot bot commented Apr 4, 2025 •

edited

Loading