Add rock.blockwise_load_tile to encapsulate the logic of loads #1988

dhernandez0 · 2025-09-15T15:51:10Z

Motivation

The purpose of this PR is to encapsulate the logic of loads from memory (+ store to LDS and load from LDS). The goal is to be able to use the same logic for attention.

Technical Details

make LDS inputs optional for BlockwiseGemmAccelOp
GemmLoadTileType to describe loading behaviors (Default, DoubleBuffer, BypassLDS)
new BlockwiseLoadTileOp to encapsulate the loading logic
BlockwiseLoadTileToThreadwisePass pass to lower BlockwiseLoadTileOp to the desired loading behavior (based on GemmLoadTileType)
ToBlockwise: refactor gemm and attention to use common functions and use BlockwiseLoadTileOp to encapsulate loading for gemm.

Test Plan

Add tests for new pass
Add test for ToBlockwise (as it has changed significantly)
Compare performance with develop branch. To verify no slowdown is introduced by this PR.

Test Result

Tests pass

Submission Checklist

Look over the contributing guidelines at https://github.com/ROCm/ROCm/blob/develop/CONTRIBUTING.md#pull-requests.

mlir/include/mlir/Dialect/Rock/IR/RockOps.td

mlir/include/mlir/Dialect/Rock/Passes.td

mlir/include/mlir/Dialect/Rock/IR/RockOps.td

mlir/test/Dialect/Rock/loadtile_to_threadwise_lowering.mlir

mlir/test/Dialect/Rock/toblockwise_gemm_accel_lowering.mlir

justinrosner · 2025-10-06T14:59:15Z

mlir/include/mlir/Dialect/Rock/IR/RockOps.td

-                                          RockGemmFeaturesInterface>]>,
-      Arguments<(ins MemRefOf<LdsBufferTypes>:$matrixA,
-          MemRefOf<LdsBufferTypes>:$matrixB, I32Attr:$inMPerThread,
+    : Rock_Op<"blockwise_gemm_accel", [AttrSizedOperandSegments,


Do we want for this to also implement MemoryEffectsOpInterface?

that's a good point, GemmOp and ThreadwiseAccelGemmOp do implement it, not sure why this one doesn't. I'll do that.

and BlockwiseGemmOp implements it as well. This looks like a bug

justinrosner · 2025-10-06T15:02:28Z

mlir/lib/Dialect/Rock/Transforms/BlockwiseLoadTileToThreadwise.cpp

+//
+//===----------------------------------------------------------------------===//
+//
+// This pass lowers `rock.blockwise_load_tile` to rock.threadwise_* ops.


Can we have a more detailed description of what this pass does here?

pabloantoniom · 2025-10-07T09:54:05Z

Code LGTM! Did you check that there are is no performance variation?

dhernandez0 · 2025-10-09T09:24:12Z

Code LGTM! Did you check that there are is no performance variation?

Yes, I just finished doing that. Most of the tier1 gemm list problem configs generate the exact same assembly. For some problem configs (~18%), there are some with minor assembly changes but no performance difference.

…from device memory

dhernandez0 requested a review from causten as a code owner September 15, 2025 15:51

dhernandez0 self-assigned this Sep 15, 2025

dhernandez0 mentioned this pull request Sep 15, 2025

Use rock.blockwise_load_tile for attention #1980

Open

1 task

Base automatically changed from remove_reversegrid to develop September 15, 2025 16:08

dhernandez0 requested review from justinrosner and umangyadav September 15, 2025 16:08

dhernandez0 force-pushed the 1947-part1 branch from 878b9bd to 94053d5 Compare September 15, 2025 16:14

dhernandez0 force-pushed the 1947-part1 branch 2 times, most recently from 9479ede to 5d9b627 Compare September 23, 2025 11:21

dhernandez0 requested a review from pabloantoniom September 23, 2025 11:26

pabloantoniom reviewed Sep 25, 2025

View reviewed changes

mlir/include/mlir/Dialect/Rock/IR/RockOps.td Outdated Show resolved Hide resolved

pabloantoniom reviewed Sep 25, 2025

View reviewed changes

mlir/include/mlir/Dialect/Rock/Passes.td Show resolved Hide resolved

pabloantoniom reviewed Sep 25, 2025

View reviewed changes

mlir/include/mlir/Dialect/Rock/IR/RockOps.td Show resolved Hide resolved

pabloantoniom reviewed Oct 3, 2025

View reviewed changes

mlir/include/mlir/Dialect/Rock/IR/RockOps.td Show resolved Hide resolved

dhernandez0 force-pushed the 1947-part1 branch 2 times, most recently from bf499a0 to 0308ca1 Compare October 6, 2025 09:35

justinrosner reviewed Oct 6, 2025

View reviewed changes

dhernandez0 force-pushed the 1947-part1 branch from 6f08a8c to 86578df Compare October 7, 2025 12:44

dhernandez0 force-pushed the 1947-part1 branch from 86578df to 2c72ac3 Compare October 13, 2025 08:52

pabloantoniom approved these changes Oct 13, 2025

View reviewed changes

Add rock.blockwise_load_tile to encapsulate the logic of how we load …

85413bb

…from device memory

dhernandez0 force-pushed the 1947-part1 branch from 2c72ac3 to 85413bb Compare October 13, 2025 09:29

Merge branch 'develop' into 1947-part1

ce241a2

dhernandez0 merged commit a9de961 into develop Oct 14, 2025
8 of 16 checks passed

dhernandez0 deleted the 1947-part1 branch October 14, 2025 07:43

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add rock.blockwise_load_tile to encapsulate the logic of loads #1988

Add rock.blockwise_load_tile to encapsulate the logic of loads #1988

Uh oh!

dhernandez0 commented Sep 15, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

justinrosner Oct 6, 2025

Uh oh!

dhernandez0 Oct 6, 2025

Uh oh!

dhernandez0 Oct 6, 2025

Uh oh!

dhernandez0 Oct 6, 2025

Uh oh!

justinrosner Oct 6, 2025

Uh oh!

dhernandez0 Oct 6, 2025

Uh oh!

pabloantoniom commented Oct 7, 2025

Uh oh!

dhernandez0 commented Oct 9, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Add rock.blockwise_load_tile to encapsulate the logic of loads #1988

Add rock.blockwise_load_tile to encapsulate the logic of loads #1988

Uh oh!

Conversation

dhernandez0 commented Sep 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation

Technical Details

Test Plan

Test Result

Submission Checklist

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

justinrosner Oct 6, 2025

Choose a reason for hiding this comment

Uh oh!

dhernandez0 Oct 6, 2025

Choose a reason for hiding this comment

Uh oh!

dhernandez0 Oct 6, 2025

Choose a reason for hiding this comment

Uh oh!

dhernandez0 Oct 6, 2025

Choose a reason for hiding this comment

Uh oh!

justinrosner Oct 6, 2025

Choose a reason for hiding this comment

Uh oh!

dhernandez0 Oct 6, 2025

Choose a reason for hiding this comment

Uh oh!

pabloantoniom commented Oct 7, 2025

Uh oh!

dhernandez0 commented Oct 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

dhernandez0 commented Sep 15, 2025 •

edited

Loading

dhernandez0 commented Oct 9, 2025 •

edited

Loading