Parameter Sweep for Attention #1830

dorde-antic · 2025-05-09T22:04:31Z

Implement parameter sweep for attention (attentionSweeps.py) that will test combinations of input shapes and perfConfigs for attention and find potential bugs. Adjust parameterSweeps.py so that attentionSweeps.py can reuse its methods.

Resolves https://github.com/ROCm/rocMLIR-internal/issues/1800

mlir/utils/performance/attentionSweeps.py

…fRunner instead of inheriting it.

codecov · 2025-05-15T14:40:31Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Additional details and impacted files

@@             Coverage Diff             @@
##           develop    #1830      +/-   ##
===========================================
- Coverage    78.52%   77.99%   -0.52%     
===========================================
  Files          100      100              
  Lines        29907    30057     +150     
  Branches      4452     4656     +204     
===========================================
- Hits         23482    23442      -40     
- Misses        4590     4601      +11     
- Partials      1835     2014     +179

Flag	Coverage Δ
mfma	`77.99% <ø> (-0.52%)`	⬇️
navi3x	`77.99% <ø> (?)`
navi4x	`77.99% <ø> (?)`

Flags with carried forward coverage won't be shown. Click here to find out more.

see 34 files with indirect coverage changes

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

…weeps

mlir/utils/performance/attentionSweeps.py

Copilot

Pull Request Overview

This PR introduces a parameter sweep for attention-based kernels to test various input shapes and performance configuration combinations and to uncover potential bugs.

Implements a parameter sweep generator for testing attention configurations.
Adds asynchronous test runners and logging for failing configurations.
Provides a command-line interface for configuring the sweep.

Comments suppressed due to low confidence (1)

mlir/utils/performance/attentionSweeps.py:66

[nitpick] Ensure that attribute names in generateMlirDriverArgs are consistent with those set in AttentionConfiguration (e.g., 'dataType' vs 'dtype' and 'numCU' vs 'numCu'). Consistent naming helps avoid confusion and potential runtime errors.

'-t', self.dataType,

mlir/utils/performance/attentionSweeps.py

dhernandez0 · 2025-06-17T09:45:54Z

mlir/utils/performance/attentionSweeps.py

+    parser.add_argument('--quiet', action='store_true')
+    parser.add_argument('--jobs', type=int, default=os.cpu_count())
+    parser.add_argument('--mlir-build-dir', type=str, required=True)
+    parser.add_argument('--samples', type=int, default=1000)


I wonder if we should use a time limit instead of number of samples? Because different machines will take diff time for 1k samples. @umangyadav

I wonder if we should use a time limit instead of number of samples? Because different machines will take diff time for 1k samples. @umangyadav @dhernandez0

There is option to add timeout in the CI function which would call attentionSweeps.py, which can be used independently of the samples number.

Maybe void parameterSweep(...) in CI, could be changed to something like:

void parameterSweep(String CONFIG, String codepath, String sweepType = "parameter") { timeout(time: 300, activity: true, unit: 'MINUTES') { dir('build') { if (sweepType == "attention") { sh """python3 ./bin/attentionSweeps.py""" } else { sh """python3 ./bin/parameterSweeps.py -j 5 ${CONFIG}""" } } } }

it should be time limited then I think ...

and call in the stage would be something like:

stage("Parameter Sweep") { steps { script { // ... ]) { parameterSweep("conv_structure", "${CODEPATH}") parameterSweep("perf_config", "${CODEPATH}") parameterSweep("", "${CODEPATH}", "attention") } } } } }

but I think if it times out, it will make CI fail?

mlir/utils/performance/attentionSweeps.py

Signed-off-by: Djordje Antic <djoantic@amd.com>

dhernandez0

LGTM, thanks for creating this script, great work!

dorde-antic · 2025-06-21T15:18:05Z

TODO (in a follow-up tickets/PRs):

Consider the new flag from Attention: split-kv implementation #1895 when it's merged
Add attentionSweeps.py to CI in https://github.com/ROCm/rocMLIR-internal/issues/1837

…hanges Add return value to initializeDataTypesAttention() in perfRunner so that it can be used easily in other modules. Invoke initializeDataTypesAttention() in attentionSweeps.py. Signed-off-by: Djordje Antic <djoantic@amd.com>

dorde-antic changed the title ~~Attention sweeps~~ Parameter Sweep for Attention May 9, 2025

dorde-antic force-pushed the attentionSweeps branch from e6e74e6 to e0b3124 Compare May 9, 2025 22:37

dorde-antic requested review from dhernandez0, umangyadav, stefankoncarevic, djramic and mirza-halilcevic May 13, 2025 11:01

dhernandez0 reviewed May 13, 2025

View reviewed changes

mlir/utils/performance/attentionSweeps.py Outdated Show resolved Hide resolved

dorde-antic added 9 commits May 15, 2025 13:06

Initial commit

cde47e5

Implement testAttentionConfig and use AttentionConfiguration from per…

94e0d75

…fRunner instead of inheriting it.

Add values range for seq_lengths, head_dims. Make cocode cleaner

bdd4226

Add perfConfigSpace

b8fd118

Describe methods

bee6c7d

perfConfig for attention fix, value ranges fix

b2cf597

Value ranges fix

d94d9a7

Code fixes

eebc2b0

Change sampling method to avoid OOM crashes

023185f

dorde-antic force-pushed the attentionSweeps branch from e5563cc to 023185f Compare May 15, 2025 13:06

dorde-antic marked this pull request as ready for review May 15, 2025 13:09

dorde-antic requested a review from causten as a code owner May 15, 2025 13:09

Remove unnecessary print() lines

ebb9cae

dorde-antic added 4 commits May 15, 2025 13:19

AttentionConfiguration generateMlirDriverArgs() method for attentionS…

2ed21c8

…weeps

Pipe chaining fixed

ab41e5d

Typo fix

6c27c7f

Fix

3cc6546

dhernandez0 reviewed May 16, 2025

View reviewed changes

dorde-antic added 2 commits May 16, 2025 09:27

Fix

ac2fba4

Fixes

24938b1

Copilot AI reviewed Jun 11, 2025

View reviewed changes

mlir/utils/performance/attentionSweeps.py Outdated Show resolved Hide resolved

mlir/utils/performance/attentionSweeps.py Outdated Show resolved Hide resolved

dhernandez0 mentioned this pull request Jun 12, 2025

Attention: return LSE (log-sum-exp) #1882

Merged

dorde-antic added 14 commits June 13, 2025 05:44

Add causal and return_lse flags, small fixes

504db62

Merge branch 'develop' into attentionSweeps

f3b6f57

Change valid range for getCurrentSeqLens()

b77c7a8

Addressing comments

9e0be32

Addressing comments

89c3ec1

Don't use global CURRENT_SEQ_LEN

7ffc172

Set --kernel-repeats to 1

5290678

Reuse generateMlirDriverCommandLine from perfRunner

640f595

Remove unnecessary debug message

01f6787

Merge branch 'develop' into attentionSweeps

5c95283

List to string: new changes

c777383

Cleaner repr in log

2814440

Cleaner repr in log - missing comma when split

bf37cf1

Consider two types of perfConfigs

e11e0a5

dorde-antic requested a review from dhernandez0 June 16, 2025 20:32

newline

1ab7c1e

dhernandez0 reviewed Jun 17, 2025

View reviewed changes

dorde-antic added 5 commits June 17, 2025 05:15

Address comments

575b152

Address comments

af2cdd6

Signed-off-by: Djordje Antic <djoantic@amd.com>

Merge branch 'develop' into attentionSweeps

0016171

Reuse methods

a113403

Signed-off-by: Djordje Antic <djoantic@amd.com>

Refactor code

5f10e41

Signed-off-by: Djordje Antic <djoantic@amd.com>

dhernandez0 approved these changes Jun 20, 2025

View reviewed changes

Merge branch 'develop' into attentionSweeps

1eb4eb2

dorde-antic merged commit 2c251fd into develop Jun 22, 2025
15 of 22 checks passed

dorde-antic deleted the attentionSweeps branch June 22, 2025 21:06

Parameter Sweep for Attention #1830

Parameter Sweep for Attention #1830

Uh oh!

Conversation

dorde-antic commented May 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

codecov bot commented May 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

dhernandez0 Jun 17, 2025

Choose a reason for hiding this comment

Uh oh!

dorde-antic Jun 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

dorde-antic Jun 19, 2025

Choose a reason for hiding this comment

Uh oh!

dhernandez0 Jun 20, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

dhernandez0 left a comment

Choose a reason for hiding this comment

Uh oh!

dorde-antic commented Jun 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

dorde-antic commented May 9, 2025 •

edited

Loading

codecov bot commented May 15, 2025 •

edited

Loading

dorde-antic Jun 19, 2025 •

edited

Loading

dorde-antic commented Jun 21, 2025 •

edited

Loading