Skip to content

Conversation

huydhn
Copy link
Contributor

@huydhn huydhn commented Jul 17, 2024

After #4203, the app is now uploaded to artifact/tiktoken_${{ matrix.tiktoken }} instead with tiktoken can be either ON or OFF. https://github.com/pytorch/executorch/blob/main/examples/demo-apps/android/LlamaDemo/README.md#alternative-2-build-from-local-machine mentions that tiktoken is only for Llama3. So, we can export it later in another archive like https://ossci-assets.s3.amazonaws.com/executorch-android-llama2-7b.zip when this is updated to run Llama3.

Note that the test is still unstable because the app runs OOM on S22 (#3344)

@huydhn huydhn requested review from guangy10 and kirklandsign July 17, 2024 05:59
Copy link

pytorch-bot bot commented Jul 17, 2024

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/4289

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 New Failure

As of commit 13f29f0 with merge base 1d7d71d (image):

NEW FAILURE - The following job has failed:

This comment was automatically generated by Dr. CI and updates every 15 minutes.

@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jul 17, 2024
@huydhn huydhn marked this pull request as ready for review July 17, 2024 06:36
@facebook-github-bot
Copy link
Contributor

@huydhn has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

@guangy10
Copy link
Contributor

Thanks for the quick fix @huydhn. I think it's better to temporarily disable/remove it from this android job. This android job runs on the PR and should only be used for quick validation of the android demo app build and run with a tiny model. It also runs in the GitHub action to prompt commit from main to stable, the instability would affect the stable branch health and release as we would prefer doing branch-cut off stable if possible. So the better place is to run the benchmarking job in a non-critical path.

I have #4287 to remove/disable the uploading and benchmarking portion off the android job
I also have #4288 starting refactor the android job to build different flavors of android demo apps, which will be used for benchmarking later. After merged those, I will start re-enable the benchmarking job for various of supported LLMs in a non-critical path.

# mentions that tiktoken is only for Llama3. So, we can export it later in another archive
# like https://ossci-assets.s3.amazonaws.com/executorch-android-llama2-7b.zip when this is
# updated to run Llama3
tiktoken: [OFF]
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think the reason of #4203 updating this to include tiktoken tokenizer is to ensure the demo app can build for Llama3. So as mentioned that this job should focus on validating different flavors of android demo app builds. If we remove tiktoken here, there will be a tcc regression.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yeah, this covers only llama2 without tiktoken atm. To cover llama3, for example, we would need to zip the exported model and the tokenizer it uses and upload the archive onto S3, i.e. https://github.com/pytorch/executorch/blob/main/.github/workflows/android.yml#L122. Then the archive can be passed to the job as its extra-data parameter to be copied to the device.

@huydhn huydhn marked this pull request as draft July 18, 2024 18:00
@huydhn huydhn closed this Jul 23, 2024
@huydhn huydhn deleted the fix-android-app-s3-path branch October 13, 2024 02:42
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants