Arm backend: Cherry-pick tosa patch to speed up large models #15592

ArmRyan · 2025-11-05T11:40:35Z

Only affects models >2gb

Change-Id: Id5ec79eecc4f1ae8d864e35ed13b34bbf761d30b

cc @freddan80 @per @zingo @oscarandersson8218 @digantdesai

* Only affects models >2gb Signed-off-by: Ryan O'Shea <ryan.oshea3@arm.com> Change-Id: Id5ec79eecc4f1ae8d864e35ed13b34bbf761d30b

pytorch-bot · 2025-11-05T11:40:40Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/15592

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 Cancelled Job, 6 Unrelated Failures

As of commit 9650adc with merge base 964515c ():

CANCELLED JOB - The following job was cancelled. Please retry:

pull / test-binary-size-linux-gcc / linux-job (gh)

FLAKY - The following jobs failed but were likely due to flakiness present on trunk:

pull / test-setup-linux-gcc / linux-job (gh) (similar failure)
##[error]The operation was canceled.
Test CUDA Builds / test-models-cuda (add) / linux-job (gh) (detected as infra flaky with no log or failing log classifier)

BROKEN TRUNK - The following jobs failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

pull / test-openvino-linux / linux-job (gh) (trunk failure)
##[error]The operation was canceled.
pull / unittest / windows / windows-job (gh) (trunk failure)
backends/xnnpack/test/ops/test_conv1d.py::TestConv1d::test_qs8_conv1d_batchnorm_seq
pull / unittest-editable / windows / windows-job (gh) (trunk failure)
backends/xnnpack/test/ops/test_conv1d.py::TestConv1d::test_qs8_conv1d_batchnorm_seq
trunk / unittest-release / windows / windows-job (gh) (trunk failure)
backends/xnnpack/test/ops/test_conv1d.py::TestConv1d::test_qs8_conv1d_batchnorm_seq

This comment was automatically generated by Dr. CI and updates every 15 minutes.

zingo · 2025-11-05T14:17:31Z

@SS-JIA here is a change bumping TOSA slightly that you might also want to test, we think it will work if you keep the old version internally if you prefer, this change should only be needed on larger models that are not yet in the tests flow.

ArmRyan · 2025-11-05T14:45:17Z

@SS-JIA here is a change bumping TOSA slighly that you might also wen to test, we also think to will work if you keep the old version internally, this change should only be needed on larger models that are not yet in the tests flow.

This is just a 2line change in an existing function it doesnt add anything new or change any behaviour, it just resereves memory correctly so nothing changes functionally and no new files or structure changes - wont affect anything

YufengShi-dudu

It has been tested internally and shows a significant speedup on large models.

…#15592) * Only affects models >2gb Signed-off-by: Ryan O'Shea <ryan.oshea3@arm.com>

Arm backend: Cherry-pick tosa patch to speed up large models

9650adc

* Only affects models >2gb Signed-off-by: Ryan O'Shea <ryan.oshea3@arm.com> Change-Id: Id5ec79eecc4f1ae8d864e35ed13b34bbf761d30b

ArmRyan requested review from YufengShi-dudu and per November 5, 2025 11:40

ArmRyan requested a review from digantdesai as a code owner November 5, 2025 11:40

ArmRyan added partner: arm For backend delegation, kernels, demo, etc. from the 3rd-party partner, Arm ciflow/trunk release notes: arm Changes to the ARM backend delegate labels Nov 5, 2025

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Nov 5, 2025

YufengShi-dudu approved these changes Nov 5, 2025

View reviewed changes

SS-JIA approved these changes Nov 5, 2025

View reviewed changes

zingo merged commit 149e23d into pytorch:main Nov 5, 2025
297 of 306 checks passed

abhinaykukkadapu pushed a commit to abhinaykukkadapu/executorch that referenced this pull request Nov 6, 2025

Arm backend: Cherry-pick tosa patch to speed up large models (pytorch…

cfc6d0f

…#15592) * Only affects models >2gb Signed-off-by: Ryan O'Shea <ryan.oshea3@arm.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Arm backend: Cherry-pick tosa patch to speed up large models #15592

Arm backend: Cherry-pick tosa patch to speed up large models #15592

Uh oh!

ArmRyan commented Nov 5, 2025 •

edited by pytorch-bot bot

Loading

Uh oh!

pytorch-bot bot commented Nov 5, 2025 •

edited

Loading

Uh oh!

zingo commented Nov 5, 2025 •

edited

Loading

Uh oh!

ArmRyan commented Nov 5, 2025

Uh oh!

YufengShi-dudu left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Arm backend: Cherry-pick tosa patch to speed up large models #15592

Arm backend: Cherry-pick tosa patch to speed up large models #15592

Uh oh!

Conversation

ArmRyan commented Nov 5, 2025 • edited by pytorch-bot bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Nov 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/15592

❌ 1 Cancelled Job, 6 Unrelated Failures

Uh oh!

zingo commented Nov 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ArmRyan commented Nov 5, 2025

Uh oh!

YufengShi-dudu left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

ArmRyan commented Nov 5, 2025 •

edited by pytorch-bot bot

Loading

pytorch-bot bot commented Nov 5, 2025 •

edited

Loading

zingo commented Nov 5, 2025 •

edited

Loading