Experiment with private rooted Pixel 3 devices #10192

huydhn · 2025-04-15T08:35:03Z

I'm creating a new experimental workflow to run benchmarks on private rooted Pixel 3 devices. We have only 2 devices there though, so I think let's reduce the number of model we want to cover for now.

Testing

https://github.com/pytorch/executorch/actions/runs/14465442714

pytorch-bot · 2025-04-15T08:35:07Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/10192

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit e0b834c with merge base 4559a61 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

huydhn · 2025-04-15T18:18:29Z

.github/workflows/android-perf-private-device-experiment.yml

+      id-token: write
+      contents: read
+    with:
+      models: ${{ inputs.models }}


@guangy10 @kirklandsign Appreciate if you could pass me the list of models and benchmark configs that you want to cover here. We have only 2 devices there though, so I think let's reduce the number of model we want to cover for now.

Let's pick one non-genAI model (mv3, xnnpack_q8) and one genAI model (llama3.2-1b, spinquant & qlora)

mv3,meta-llama/Llama-3.2-1B-Instruct-SpinQuant_INT4_EO8,meta-llama/Llama-3.2-1B-Instruct-QLORA_INT4_EO8

guangy10 · 2025-04-15T18:41:51Z

.github/workflows/android-perf-private-device-experiment.yml

+      contents: read
+    with:
+      models: ${{ inputs.models }}
+      devices: google_pixel_3_private_rooted


Unfortunately we don't have data points of using the same type of device on public pool, can't do any comparison, so we would mainly rely on the new data on the private & rooted device and compare those across different runs.

guangy10

I recall @kirklandsign mentioned that the metrics instability is frequently seen on nightly batched jobs, on-demand with only 1-2 models are running much stable.

@huydhn In this new schedule, and to gather more data points, we may want to run this workflow more often (every 2 hours maybe? Should have enough time to cooldown from previous run)

@huydhn Do we need to merge this PR? I thought we can reuse the existing workflow instead of creating a new fork

huydhn · 2025-04-15T20:19:48Z

@huydhn Do we need to merge this PR? I thought we can reuse the existing workflow instead of creating a new fork

This uses the existing workflow but gives us the flexibility to use a different schedule and default inputs

I missed this in #10192 for the schedule workflow case. So, it wrongly fail back to `llama` ### Testing https://github.com/pytorch/executorch/actions/runs/14481092309. The results shows up on HUD correctly now https://hud.pytorch.org/benchmark/llms?startTime=Wed%2C%2009%20Apr%202025%2000%3A31%3A41%20GMT&stopTime=Wed%2C%2016%20Apr%202025%2000%3A31%3A41%20GMT&granularity=day&lBranch=update-default-private-devices&lCommit=9fa6a49250fdbd96e8f7a5f4765bc6b32b41b6ce&rBranch=update-default-private-devices&rCommit=9fa6a49250fdbd96e8f7a5f4765bc6b32b41b6ce&repoName=pytorch%2Fexecutorch&benchmarkName=&modelName=All%20Models&backendName=All%20Backends&modeName=All%20Modes&dtypeName=All%20DType&deviceName=Google%20Pixel%203%20(Rooted)%20(Android%2012)&archName=All%20Platforms

I'm creating a new experimental workflow to run benchmarks on private rooted Pixel 3 devices. We have only 2 devices there though, so I think let's reduce the number of model we want to cover for now. ### Testing https://github.com/pytorch/executorch/actions/runs/14465442714

) I missed this in pytorch#10192 for the schedule workflow case. So, it wrongly fail back to `llama` ### Testing https://github.com/pytorch/executorch/actions/runs/14481092309. The results shows up on HUD correctly now https://hud.pytorch.org/benchmark/llms?startTime=Wed%2C%2009%20Apr%202025%2000%3A31%3A41%20GMT&stopTime=Wed%2C%2016%20Apr%202025%2000%3A31%3A41%20GMT&granularity=day&lBranch=update-default-private-devices&lCommit=9fa6a49250fdbd96e8f7a5f4765bc6b32b41b6ce&rBranch=update-default-private-devices&rCommit=9fa6a49250fdbd96e8f7a5f4765bc6b32b41b6ce&repoName=pytorch%2Fexecutorch&benchmarkName=&modelName=All%20Models&backendName=All%20Backends&modeName=All%20Modes&dtypeName=All%20DType&deviceName=Google%20Pixel%203%20(Rooted)%20(Android%2012)&archName=All%20Platforms

Same as #10192, but this is for the private iPhone 15 devices. I will need to follow on this with another PR to get the device pool name / id and maybe the device id ### Testing https://hud.pytorch.org/benchmark/llms?startTime=Wed%2C%2016%20Apr%202025%2004%3A22%3A32%20GMT&stopTime=Wed%2C%2023%20Apr%202025%2004%3A22%3A32%20GMT&granularity=day&lBranch=apple-private-devices&lCommit=be72291c9b643581fedafaf8711d07fd5542b62d&rBranch=apple-private-devices&rCommit=be72291c9b643581fedafaf8711d07fd5542b62d&repoName=pytorch%2Fexecutorch&benchmarkName=&modelName=All%20Models&backendName=All%20Backends&modeName=All%20Modes&dtypeName=All%20DType&deviceName=All%20Devices&archName=All%20Platforms

Experiment with private rooted Pixel 3 devices

24b4300

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Apr 15, 2025

huydhn added topic: not user facing and removed CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. labels Apr 15, 2025

Tweak the workflow to run on PR

2172406

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Apr 15, 2025

huydhn added 4 commits April 15, 2025 01:38

Concurrency deadlock?

718fa53

TIL https://github.com/orgs/community/discussions/30708

217feff

GitHub hiccup

8a87e24

Inherit secrets

c764407

huydhn temporarily deployed to upload-benchmark-results April 15, 2025 10:36 — with GitHub Actions Inactive

huydhn temporarily deployed to upload-benchmark-results April 15, 2025 17:02 — with GitHub Actions Inactive

huydhn requested review from guangy10, kirklandsign and yangw-dev April 15, 2025 18:16

huydhn commented Apr 15, 2025

View reviewed changes

huydhn marked this pull request as ready for review April 15, 2025 18:18

guangy10 reviewed Apr 15, 2025

View reviewed changes

guangy10 approved these changes Apr 15, 2025

View reviewed changes

Run more frequently

e0b834c

huydhn merged commit 683869d into main Apr 15, 2025
5 checks passed

huydhn deleted the experiment-private-devices branch April 15, 2025 21:36

huydhn mentioned this pull request Apr 15, 2025

Set the default list of models running on private devices #10217

Merged

huydhn mentioned this pull request Apr 23, 2025

Run iPhone benchmark on private devices #10380

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Experiment with private rooted Pixel 3 devices #10192

Experiment with private rooted Pixel 3 devices #10192

Uh oh!

huydhn commented Apr 15, 2025 •

edited

Loading

Uh oh!

pytorch-bot bot commented Apr 15, 2025 •

edited

Loading

Uh oh!

huydhn Apr 15, 2025

Uh oh!

guangy10 Apr 15, 2025

Uh oh!

guangy10 Apr 15, 2025

Uh oh!

guangy10 left a comment

Uh oh!

huydhn commented Apr 15, 2025

Uh oh!

Uh oh!

Uh oh!

Experiment with private rooted Pixel 3 devices #10192

Experiment with private rooted Pixel 3 devices #10192

Uh oh!

Conversation

huydhn commented Apr 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Testing

Uh oh!

pytorch-bot bot commented Apr 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/10192

✅ No Failures

Uh oh!

huydhn Apr 15, 2025

Choose a reason for hiding this comment

Uh oh!

guangy10 Apr 15, 2025

Choose a reason for hiding this comment

Uh oh!

guangy10 Apr 15, 2025

Choose a reason for hiding this comment

Uh oh!

guangy10 left a comment

Choose a reason for hiding this comment

Uh oh!

huydhn commented Apr 15, 2025

Uh oh!

Uh oh!

Uh oh!

huydhn commented Apr 15, 2025 •

edited

Loading

pytorch-bot bot commented Apr 15, 2025 •

edited

Loading