Skip to content

Qualcomm AI Engine Direct - Support tile op for different I/O rank #10054

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged

Conversation

DannyYuyang-quic
Copy link
Collaborator

Summary:

  • Support if the rank of input tensor is less than the rank of output tensor.
  • make_quantizer kwargs alignment.
  • Remove module.eval() since calling eval() is not supported for exported models.

Test plan

python -m backends.qualcomm.tests.test_qnn_delegate TestQNNQuantizedOperator.test_qnn_backend_expand -s ${device_id} -H ${host_id} -m ${soc} -b build-android

Copy link

pytorch-bot bot commented Apr 10, 2025

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/10054

Note: Links to docs will display an error until the docs builds have been completed.

❌ 2 New Failures

As of commit a825fd1 with merge base c9c5481 (image):

NEW FAILURES - The following jobs have failed:

This comment was automatically generated by Dr. CI and updates every 15 minutes.

@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Apr 10, 2025
@DannyYuyang-quic
Copy link
Collaborator Author

@pytorchbot label "release notes: qualcomm"

@pytorch-bot pytorch-bot bot added the release notes: qualcomm Changes to the Qualcomm backend delegate label Apr 10, 2025
Copy link
Contributor

@cccclai cccclai left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good, thank you! Curious which model did you use when identifying this issue.

@facebook-github-bot
Copy link
Contributor

@cccclai has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

@DannyYuyang-quic
Copy link
Collaborator Author

Looks good, thank you! Curious which model did you use when identifying this issue.

I encountered this issue while enabling the image encoder of the LLaVA model.

@DannyYuyang-quic
Copy link
Collaborator Author

@cccclai , I'd like to refine this PR a bit. Could you please wait for me to update this PR before merging?

Summary:

- Support if the rank of input tensor is less than the rank of output tensor.
- make_quantizer kwargs alignment.
- Remove module.eval() since calling eval() is not supported for exported models.
@DannyYuyang-quic DannyYuyang-quic force-pushed the dev1/danny/support_tile_op_with_diff_IO_rank branch from fc5c30b to a825fd1 Compare April 10, 2025 06:56
@DannyYuyang-quic
Copy link
Collaborator Author

@cccclai , I'd like to refine this PR a bit. Could you please wait for me to update this PR before merging?

Done, an equivalent approach but more concise.

@facebook-github-bot
Copy link
Contributor

@cccclai has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

@cccclai cccclai merged commit 7f3b382 into pytorch:main Apr 10, 2025
87 of 90 checks passed
This was referenced Apr 10, 2025
manuelcandales added a commit to manuelcandales/executorch-1 that referenced this pull request Apr 11, 2025
Summary: forward fix for pytorch#10054

Differential Revision: D72834413
@manuelcandales manuelcandales mentioned this pull request Apr 11, 2025
cccclai added a commit that referenced this pull request Apr 11, 2025
Fix the lint error from #10054
kirklandsign pushed a commit that referenced this pull request Apr 11, 2025
…10054)

Summary:

- Support if the rank of input tensor is less than the rank of output
tensor.
- make_quantizer kwargs alignment.
- Remove module.eval() since calling eval() is not supported for
exported models.


### Test plan
``` bash
python -m backends.qualcomm.tests.test_qnn_delegate TestQNNQuantizedOperator.test_qnn_backend_expand -s ${device_id} -H ${host_id} -m ${soc} -b build-android
```
kirklandsign pushed a commit that referenced this pull request Apr 11, 2025
Fix the lint error from #10054
keyprocedure pushed a commit to keyprocedure/executorch that referenced this pull request Apr 21, 2025
…ytorch#10054)

Summary:

- Support if the rank of input tensor is less than the rank of output
tensor.
- make_quantizer kwargs alignment.
- Remove module.eval() since calling eval() is not supported for
exported models.


### Test plan
``` bash
python -m backends.qualcomm.tests.test_qnn_delegate TestQNNQuantizedOperator.test_qnn_backend_expand -s ${device_id} -H ${host_id} -m ${soc} -b build-android
```
keyprocedure pushed a commit to keyprocedure/executorch that referenced this pull request Apr 21, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. release notes: qualcomm Changes to the Qualcomm backend delegate
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants