-
Notifications
You must be signed in to change notification settings - Fork 607
[ET-VK] Integrate axis mapping into naive matrix multiplication shaders #5277
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
## Context Give similar treatment as #5223 to integrate axis mapping into the naive matrix multiplication shaders. As with the previous diff, code cleanup is performed as well to consolidate shaders and improve code readability. ## Performance impact Running the matrix multiplication operator benchmark, we can observe the following results: | commit | matmul_naive_texture3d_float | linear_naive_texture3d_float | |-------------|------------------------------|------------------------------| | master | 6.53645 | 6.98834 | | this commit | 6.61293 | 6.34905 | Evidently, accounting for axis mapping did not have any significant adverse impact on shader latency. Differential Revision: [D62518403](https://our.internmc.facebook.com/intern/diff/D62518403/) [ghstack-poisoned]
🔗 Helpful Links🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/5277
Note: Links to docs will display an error until the docs builds have been completed. ✅ No FailuresAs of commit c315784 with merge base 9845019 ( This comment was automatically generated by Dr. CI and updates every 15 minutes. |
This pull request was exported from Phabricator. Differential Revision: D62518403 |
## Context Give similar treatment as #5223 to integrate axis mapping into the naive matrix multiplication shaders. As with the previous diff, code cleanup is performed as well to consolidate shaders and improve code readability. ## Performance impact Running the matrix multiplication operator benchmark, we can observe the following results: | commit | matmul_naive_texture3d_float | linear_naive_texture3d_float | |-------------|------------------------------|------------------------------| | master | 6.53645 | 6.98834 | | this commit | 6.61293 | 6.34905 | Evidently, accounting for axis mapping did not have any significant adverse impact on shader latency. Differential Revision: [D62518403](https://our.internmc.facebook.com/intern/diff/D62518403/) ghstack-source-id: 242059485 Pull Request resolved: #5277
…ation shaders" ## Context Give similar treatment as #5223 to integrate axis mapping into the naive matrix multiplication shaders. As with the previous diff, code cleanup is performed as well to consolidate shaders and improve code readability. ## Performance impact Running the matrix multiplication operator benchmark, we can observe the following results: | commit | matmul_naive_texture3d_float | linear_naive_texture3d_float | |-------------|------------------------------|------------------------------| | master | 6.53645 | 6.98834 | | this commit | 6.61293 | 6.34905 | Evidently, accounting for axis mapping did not have any significant adverse impact on shader latency. Differential Revision: [D62518403](https://our.internmc.facebook.com/intern/diff/D62518403/) [ghstack-poisoned]
Pull Request resolved: #5277 ## Context Give similar treatment as #5223 to integrate axis mapping into the naive matrix multiplication shaders. As with the previous diff, code cleanup is performed as well to consolidate shaders and improve code readability. ## Performance impact Running the matrix multiplication operator benchmark, we can observe the following results: | commit | matmul_naive_texture3d_float | linear_naive_texture3d_float | |-------------|------------------------------|------------------------------| | master | 6.53645 | 6.98834 | | this commit | 6.61293 | 6.34905 | Evidently, accounting for axis mapping did not have any significant adverse impact on shader latency. ghstack-source-id: 242137430 @exported-using-ghexport Differential Revision: [D62518403](https://our.internmc.facebook.com/intern/diff/D62518403/)
This pull request was exported from Phabricator. Differential Revision: D62518403 |
…ation shaders" ## Context Give similar treatment as #5223 to integrate axis mapping into the naive matrix multiplication shaders. As with the previous diff, code cleanup is performed as well to consolidate shaders and improve code readability. ## Performance impact Running the matrix multiplication operator benchmark, we can observe the following results: | commit | matmul_naive_texture3d_float | linear_naive_texture3d_float | |-------------|------------------------------|------------------------------| | master | 6.53645 | 6.98834 | | this commit | 6.61293 | 6.34905 | Evidently, accounting for axis mapping did not have any significant adverse impact on shader latency. Differential Revision: [D62518403](https://our.internmc.facebook.com/intern/diff/D62518403/) [ghstack-poisoned]
This pull request was exported from Phabricator. Differential Revision: D62518403 |
…ation shaders" ## Context Give similar treatment as #5223 to integrate axis mapping into the naive matrix multiplication shaders. As with the previous diff, code cleanup is performed as well to consolidate shaders and improve code readability. ## Performance impact Running the matrix multiplication operator benchmark, we can observe the following results: | commit | matmul_naive_texture3d_float | linear_naive_texture3d_float | |-------------|------------------------------|------------------------------| | master | 6.53645 | 6.98834 | | this commit | 6.61293 | 6.34905 | Evidently, accounting for axis mapping did not have any significant adverse impact on shader latency. Differential Revision: [D62518403](https://our.internmc.facebook.com/intern/diff/D62518403/) [ghstack-poisoned]
This pull request was exported from Phabricator. Differential Revision: D62518403 |
…ation shaders" ## Context Give similar treatment as #5223 to integrate axis mapping into the naive matrix multiplication shaders. As with the previous diff, code cleanup is performed as well to consolidate shaders and improve code readability. ## Performance impact Running the matrix multiplication operator benchmark, we can observe the following results: | commit | matmul_naive_texture3d_float | linear_naive_texture3d_float | |-------------|------------------------------|------------------------------| | master | 6.53645 | 6.98834 | | this commit | 6.61293 | 6.34905 | Evidently, accounting for axis mapping did not have any significant adverse impact on shader latency. Differential Revision: [D62518403](https://our.internmc.facebook.com/intern/diff/D62518403/) [ghstack-poisoned]
This pull request was exported from Phabricator. Differential Revision: D62518403 |
This pull request has been merged in 31e652d. |
Stack from ghstack (oldest at bottom):
virtual_transpose()
tovTensor
for no copy transposition #5353Context
Give similar treatment as #5223 to integrate axis mapping into the naive matrix multiplication shaders. As with the previous diff, code cleanup is performed as well to consolidate shaders and improve code readability.
Performance impact
Running the matrix multiplication operator benchmark, we can observe the following results:
Evidently, accounting for axis mapping did not have any significant adverse impact on shader latency.
Differential Revision: D62518403