Qualcomm AI Engine Direct - add cli tool for QNN artifacts #4731

haowhsu-quic · 2024-08-15T11:24:34Z

Summary:

cli tool for deploying precompiled model library / context bin onto executorch runtime
refactor & mionr fixes

Summary: - cli tool for deploying precompiled model library / context bin onto executorch runtime - refactor & mionr fixes

pytorch-bot · 2024-08-15T11:24:38Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/4731

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 43371c1 with merge base 54f8932 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

haowhsu-quic · 2024-08-15T12:07:26Z

Hi @cccclai, this PR is a CLI tool for helping convert pre-generated QNN artifacts (*.so / *.bin) into .pte program. We think this will make user engage with Qualcomm AIHUB more smoothly.
Please have a look, thank you!

cccclai

lgtm. Just one question related to the ai hub models

cccclai · 2024-08-17T21:31:01Z

examples/qualcomm/qaihub_scripts/utils/README.md

@@ -0,0 +1,102 @@
+# CLI Tool for Compile / Deploy Pre-Built QNN Artifacts
+
+An easy-to-use tool for generating / executing .pte program from pre-built model libraries / context binaries from Qualcomm AI Engine Direct. Tool is verified with [host environement](../../../../docs/source/build-run-qualcomm-ai-engine-direct-backend.md#host-os).


Is it generic for all models from ai hub?

Yes, artifacts from AIHUB related to QNN are delivered with .so format. Only large generative AI models are shipped with context binaries.
Both of them could be transformed into .pte program with this tool.

Qualcomm AI Engine Direct - add cli tool for QNN artifacts

43371c1

Summary: - cli tool for deploying precompiled model library / context bin onto executorch runtime - refactor & mionr fixes

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Aug 15, 2024

manuelcandales requested a review from cccclai August 15, 2024 16:05

manuelcandales assigned cccclai Aug 15, 2024

manuelcandales added the module: qnn Related to Qualcomm's QNN delegate label Aug 15, 2024

cccclai approved these changes Aug 17, 2024

View reviewed changes

kirklandsign merged commit 4c06907 into pytorch:main Aug 19, 2024
35 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Qualcomm AI Engine Direct - add cli tool for QNN artifacts #4731

Qualcomm AI Engine Direct - add cli tool for QNN artifacts #4731

haowhsu-quic commented Aug 15, 2024

pytorch-bot bot commented Aug 15, 2024 •

edited

Loading

haowhsu-quic commented Aug 15, 2024

cccclai left a comment

cccclai Aug 17, 2024

haowhsu-quic Aug 18, 2024

		@@ -0,0 +1,102 @@
		# CLI Tool for Compile / Deploy Pre-Built QNN Artifacts

		An easy-to-use tool for generating / executing .pte program from pre-built model libraries / context binaries from Qualcomm AI Engine Direct. Tool is verified with [host environement](../../../../docs/source/build-run-qualcomm-ai-engine-direct-backend.md#host-os).

Qualcomm AI Engine Direct - add cli tool for QNN artifacts #4731

Qualcomm AI Engine Direct - add cli tool for QNN artifacts #4731

Conversation

haowhsu-quic commented Aug 15, 2024

pytorch-bot bot commented Aug 15, 2024 • edited Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/4731

✅ No Failures

haowhsu-quic commented Aug 15, 2024

cccclai left a comment

Choose a reason for hiding this comment

cccclai Aug 17, 2024

Choose a reason for hiding this comment

haowhsu-quic Aug 18, 2024

Choose a reason for hiding this comment

pytorch-bot bot commented Aug 15, 2024 •

edited

Loading