[Torch] Support mask mod arguments in flex_attention HOP by keshavvinayak01 · Pull Request #4570 · llvm/torch-mlir

keshavvinayak01 · 2026-05-13T14:40:32Z

In traced/exported FX graphs, Dynamo represents mask_mod closure captures for the flex_attention HOP as the final mask_mod_other_buffers argument. torch.hop_flex_attention only modelled the mask function symbol, so those captures could not be represented in Torch IR.

This adds trailing variadic mask_mod_other_buffers operands to torch.hop_flex_attention and prints them with explicit mask_mod_other_buffers(...) syntax. The FX importer forwards only exported mask_mod_other_buffers values into those operands. score_mod_other_buffers are named and rejected with an explicit unsupported error instead of being silently dropped.

zjgarvey

I'm mostly concerned about the silently dropped arg.

Another thing I'd like to ask is if you would be willing to add verifier logic for at least checking operand count vs. mask_mod_fn arity (or at least something meaningful to cover the new case).

Signed-off-by: Keshav Vinayak Jha <keshavvinayakjha@gmail.com>

keshavvinayak01

Thanks for the reviews, please check again.

keshavvinayak01 force-pushed the flex-mask-mod-args branch from 2520a91 to 6bec667 Compare May 13, 2026 14:47

keshavvinayak01 marked this pull request as ready for review May 13, 2026 15:26

keshavvinayak01 requested review from Groverkss, IanWood1, MaheshRavishankar, rsuderman, sjain-stanford and sommerlukas May 13, 2026 15:26

keshavvinayak01 force-pushed the flex-mask-mod-args branch 2 times, most recently from 5ff3fba to 819aea0 Compare May 13, 2026 15:35

rkayaith reviewed May 14, 2026

View reviewed changes

Comment thread python/torch_mlir/extras/fx_importer.py Outdated

Comment thread python/torch_mlir/extras/fx_importer.py Outdated

Comment thread include/torch-mlir/Dialect/Torch/IR/TorchOps.td Outdated

Comment thread test/Dialect/Torch/ops.mlir Outdated

rkayaith requested a review from zjgarvey May 14, 2026 16:22

zjgarvey requested changes May 14, 2026

View reviewed changes

Comment thread python/torch_mlir/extras/fx_importer.py

Comment thread python/torch_mlir/extras/fx_importer.py Outdated

Comment thread include/torch-mlir/Dialect/Torch/IR/TorchOps.td

[Torch] Support mask mod arguments in flex_attention HOP

e780e23

Signed-off-by: Keshav Vinayak Jha <keshavvinayakjha@gmail.com>

keshavvinayak01 force-pushed the flex-mask-mod-args branch from 819aea0 to e780e23 Compare May 18, 2026 11:35

[Torch] Fix flex_attention formatting

adbfa1b

Signed-off-by: Keshav Vinayak Jha <keshavvinayakjha@gmail.com>

keshavvinayak01 commented May 18, 2026

View reviewed changes

keshavvinayak01 requested review from rkayaith and zjgarvey and removed request for rkayaith May 18, 2026 12:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Torch] Support mask mod arguments in flex_attention HOP#4570

[Torch] Support mask mod arguments in flex_attention HOP#4570
keshavvinayak01 wants to merge 2 commits into
llvm:mainfrom
iree-org:flex-mask-mod-args

keshavvinayak01 commented May 13, 2026 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

zjgarvey left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

keshavvinayak01 left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

keshavvinayak01 commented May 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

zjgarvey left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

keshavvinayak01 left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

keshavvinayak01 commented May 13, 2026 •

edited

Loading