Qualcomm AI Engine Direct - Reduce redundant observers #6351

winskuo-quic · 2024-10-18T05:27:30Z

Summary

Current observer flow checks min/max/axis values to determine whether 2 observer is same. If same, it would only keep 1 observer. Providing these values helps observers to properly compare with other nodes observer, which helps reducing the number of observers, which further helps reducing time to quantize the model.

Before:

After:

pytorch-bot · 2024-10-18T05:27:34Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/6351

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 2 New Failures

As of commit 66c3e44 with merge base f7e26d7 ():

NEW FAILURES - The following jobs have failed:

pull / test-binary-size-linux-gcc / linux-job (gh)
RuntimeError: Command docker exec -t 2faba9fe80a8bee8d16129f0ca36a9513b804626405921577955092fc9a1f52a /exec failed with exit code 1
pull / unittest-arm / linux-job (gh)
RuntimeError: Command docker exec -t 4890673c7d2edb7af556617151b6d6c90e49a7c81896eaad373e5c6b218da57a /exec failed with exit code 1

This comment was automatically generated by Dr. CI and updates every 15 minutes.

winskuo-quic · 2024-10-18T05:29:09Z

Hi @cccclai,
As discussed previously, we want to reduce the numbers of observers to reduce AOT time.
This PR should be able to reduce some observers.
Please have a look.
Thanks

cccclai

Looks good, thank you

facebook-github-bot · 2024-10-18T19:50:00Z

@cccclai has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

cccclai · 2024-11-05T21:57:23Z

Hi sorry I miss merging this PR, mind rebasing again?

winskuo-quic · 2024-11-06T00:58:09Z

Hi sorry I miss merging this PR, mind rebasing again?

All good!
I have just rebased.
Thanks

facebook-github-bot · 2024-11-06T16:31:57Z

@cccclai has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

digantdesai · 2024-11-07T22:24:12Z

Seems like there are still merge conflicts, might need another rebase?

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Oct 18, 2024

cccclai approved these changes Oct 18, 2024

View reviewed changes

Qualcomm AI Engine Direct - Reduce redundant observers

66c3e44

winskuo-quic force-pushed the dev1/winskuo/reduce_redundant_observer branch from e6024c9 to 66c3e44 Compare November 6, 2024 00:57

digantdesai merged commit cb2a0e7 into pytorch:main Nov 7, 2024
38 of 41 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Qualcomm AI Engine Direct - Reduce redundant observers #6351

Qualcomm AI Engine Direct - Reduce redundant observers #6351

winskuo-quic commented Oct 18, 2024

pytorch-bot bot commented Oct 18, 2024 •

edited

Loading

winskuo-quic commented Oct 18, 2024

cccclai left a comment

facebook-github-bot commented Oct 18, 2024

cccclai commented Nov 5, 2024

winskuo-quic commented Nov 6, 2024

facebook-github-bot commented Nov 6, 2024

digantdesai commented Nov 7, 2024

Qualcomm AI Engine Direct - Reduce redundant observers #6351

Qualcomm AI Engine Direct - Reduce redundant observers #6351

Conversation

winskuo-quic commented Oct 18, 2024

Summary

pytorch-bot bot commented Oct 18, 2024 • edited Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/6351

❌ 2 New Failures

winskuo-quic commented Oct 18, 2024

cccclai left a comment

Choose a reason for hiding this comment

facebook-github-bot commented Oct 18, 2024

cccclai commented Nov 5, 2024

winskuo-quic commented Nov 6, 2024

facebook-github-bot commented Nov 6, 2024

digantdesai commented Nov 7, 2024

pytorch-bot bot commented Oct 18, 2024 •

edited

Loading