Skip to content

[ET-VK] Integrate axis mapping into naive matrix multiplication shaders #5277

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
wants to merge 5 commits into from

Conversation

SS-JIA
Copy link
Contributor

@SS-JIA SS-JIA commented Sep 11, 2024

Stack from ghstack (oldest at bottom):

Context

Give similar treatment as #5223 to integrate axis mapping into the naive matrix multiplication shaders. As with the previous diff, code cleanup is performed as well to consolidate shaders and improve code readability.

Performance impact

Running the matrix multiplication operator benchmark, we can observe the following results:

commit matmul_naive_texture3d_float linear_naive_texture3d_float
master 6.53645 6.98834
this commit 6.61293 6.34905

Evidently, accounting for axis mapping did not have any significant adverse impact on shader latency.

Differential Revision: D62518403

## Context

Give similar treatment as #5223 to integrate axis mapping into the naive matrix multiplication shaders. As with the previous diff, code cleanup is performed as well to consolidate shaders and improve code readability.

## Performance impact


Running the matrix multiplication operator benchmark, we can observe the following results:

| commit      | matmul_naive_texture3d_float | linear_naive_texture3d_float |
|-------------|------------------------------|------------------------------|
| master      | 6.53645                      | 6.98834                      |
| this commit | 6.61293                      | 6.34905                      |

Evidently, accounting for axis mapping did not have any significant adverse impact on shader latency.

Differential Revision: [D62518403](https://our.internmc.facebook.com/intern/diff/D62518403/)

[ghstack-poisoned]
Copy link

pytorch-bot bot commented Sep 11, 2024

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/5277

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit c315784 with merge base 9845019 (image):
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Sep 11, 2024
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D62518403

SS-JIA added a commit that referenced this pull request Sep 11, 2024
## Context

Give similar treatment as #5223 to integrate axis mapping into the naive matrix multiplication shaders. As with the previous diff, code cleanup is performed as well to consolidate shaders and improve code readability.

## Performance impact


Running the matrix multiplication operator benchmark, we can observe the following results:

| commit      | matmul_naive_texture3d_float | linear_naive_texture3d_float |
|-------------|------------------------------|------------------------------|
| master      | 6.53645                      | 6.98834                      |
| this commit | 6.61293                      | 6.34905                      |

Evidently, accounting for axis mapping did not have any significant adverse impact on shader latency.

Differential Revision: [D62518403](https://our.internmc.facebook.com/intern/diff/D62518403/)

ghstack-source-id: 242059485
Pull Request resolved: #5277
…ation shaders"

## Context

Give similar treatment as #5223 to integrate axis mapping into the naive matrix multiplication shaders. As with the previous diff, code cleanup is performed as well to consolidate shaders and improve code readability.

## Performance impact


Running the matrix multiplication operator benchmark, we can observe the following results:

| commit      | matmul_naive_texture3d_float | linear_naive_texture3d_float |
|-------------|------------------------------|------------------------------|
| master      | 6.53645                      | 6.98834                      |
| this commit | 6.61293                      | 6.34905                      |

Evidently, accounting for axis mapping did not have any significant adverse impact on shader latency.

Differential Revision: [D62518403](https://our.internmc.facebook.com/intern/diff/D62518403/)

[ghstack-poisoned]
SS-JIA added a commit that referenced this pull request Sep 12, 2024
Pull Request resolved: #5277

## Context

Give similar treatment as #5223 to integrate axis mapping into the naive matrix multiplication shaders. As with the previous diff, code cleanup is performed as well to consolidate shaders and improve code readability.

## Performance impact


Running the matrix multiplication operator benchmark, we can observe the following results:

| commit      | matmul_naive_texture3d_float | linear_naive_texture3d_float |
|-------------|------------------------------|------------------------------|
| master      | 6.53645                      | 6.98834                      |
| this commit | 6.61293                      | 6.34905                      |

Evidently, accounting for axis mapping did not have any significant adverse impact on shader latency.

ghstack-source-id: 242137430
@exported-using-ghexport

Differential Revision: [D62518403](https://our.internmc.facebook.com/intern/diff/D62518403/)
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D62518403

…ation shaders"

## Context

Give similar treatment as #5223 to integrate axis mapping into the naive matrix multiplication shaders. As with the previous diff, code cleanup is performed as well to consolidate shaders and improve code readability.

## Performance impact


Running the matrix multiplication operator benchmark, we can observe the following results:

| commit      | matmul_naive_texture3d_float | linear_naive_texture3d_float |
|-------------|------------------------------|------------------------------|
| master      | 6.53645                      | 6.98834                      |
| this commit | 6.61293                      | 6.34905                      |

Evidently, accounting for axis mapping did not have any significant adverse impact on shader latency.

Differential Revision: [D62518403](https://our.internmc.facebook.com/intern/diff/D62518403/)

[ghstack-poisoned]
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D62518403

…ation shaders"

## Context

Give similar treatment as #5223 to integrate axis mapping into the naive matrix multiplication shaders. As with the previous diff, code cleanup is performed as well to consolidate shaders and improve code readability.

## Performance impact


Running the matrix multiplication operator benchmark, we can observe the following results:

| commit      | matmul_naive_texture3d_float | linear_naive_texture3d_float |
|-------------|------------------------------|------------------------------|
| master      | 6.53645                      | 6.98834                      |
| this commit | 6.61293                      | 6.34905                      |

Evidently, accounting for axis mapping did not have any significant adverse impact on shader latency.

Differential Revision: [D62518403](https://our.internmc.facebook.com/intern/diff/D62518403/)

[ghstack-poisoned]
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D62518403

…ation shaders"

## Context

Give similar treatment as #5223 to integrate axis mapping into the naive matrix multiplication shaders. As with the previous diff, code cleanup is performed as well to consolidate shaders and improve code readability.

## Performance impact


Running the matrix multiplication operator benchmark, we can observe the following results:

| commit      | matmul_naive_texture3d_float | linear_naive_texture3d_float |
|-------------|------------------------------|------------------------------|
| master      | 6.53645                      | 6.98834                      |
| this commit | 6.61293                      | 6.34905                      |

Evidently, accounting for axis mapping did not have any significant adverse impact on shader latency.

Differential Revision: [D62518403](https://our.internmc.facebook.com/intern/diff/D62518403/)

[ghstack-poisoned]
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D62518403

@facebook-github-bot
Copy link
Contributor

This pull request has been merged in 31e652d.

@SS-JIA SS-JIA deleted the gh/SS-JIA/73/head branch January 24, 2025 19:43
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. fb-exported Merged
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants