Cache PyTorch source builds to reduce CI time #1500

ashay · 2022-10-17T16:21:45Z

This PR contains two patches that reduce the time spent in out-of-tree
builds that reference the PyTorch source code from about 90 minutes to
about 15 minutes.

ci: cache PyTorch source builds

This patch reduces the time spent in regular CI builds by caching
PyTorch source builds. Specifically, this patch:

Makes CI lookup the cache entry for the PyTorch commit hash in
pytorch-version.txt
If lookup was successful, CI fetches the previously-generated WHL
file into the build_tools/python/wheelhouse directory
CI sets the TM_PYTORCH_INSTALL_WITHOUT_REBUILD variable to true
The build_libtorch.sh script then uses the downloaded WHL file
instead of rebuilding PyTorch

ci: warm up PyTorch source cache during daily RollPyTorch action

This patch makes the RollPyTorch action write the updated WHL file to
the cache, so that it can be later retrieved by CI that runs for each
PR. We deliberately add the caching step to the end of the action since
the RollPyTorch action never needs to read from the cache, although
executing this step earlier in the process should not cause problems
either.

This patch reduces the time spent in regular CI builds by caching PyTorch source builds. Specifically, this patch: 1. Makes CI lookup the cache entry for the PyTorch commit hash in pytorch-version.txt 2. If lookup was successful, CI fetches the previously-generated WHL file into the build_tools/python/wheelhouse directory 3. CI sets the `TM_PYTORCH_INSTALL_WITHOUT_REBUILD` variable to `true` 4. The build_libtorch.sh script then uses the downloaded WHL file instead of rebuilding PyTorch

This patch makes the RollPyTorch action write the updated WHL file to the cache, so that it can be later retrieved by CI that runs for each PR. We deliberately add the caching step to the end of the action since the RollPyTorch action never needs to read from the cache, although executing this step earlier in the process should not cause problems either.

ashay · 2022-10-17T16:23:27Z

Here is the first run, which took 2 hours to finish. The second run finished in 15 minutes.

Caches don't seem to be shared across branches, so this CI run will perform a full build of PyTorch, but subsequent CI runs should be faster.

powderluv

magical. thanks

powderluv · 2022-10-18T05:02:28Z

Only other piece we can do is tar.gz the LLVM build so OOT CI can just download that and we done in a min or two. We can check the submodule SHA for LLVM.

ashay · 2022-10-18T05:41:46Z

tar.gz the LLVM build so OOT CI can just download that

That’s really neat idea! I’ll try it out in a separate PR.

* ci: cache PyTorch source builds This patch reduces the time spent in regular CI builds by caching PyTorch source builds. Specifically, this patch: 1. Makes CI lookup the cache entry for the PyTorch commit hash in pytorch-version.txt 2. If lookup was successful, CI fetches the previously-generated WHL file into the build_tools/python/wheelhouse directory 3. CI sets the `TM_PYTORCH_INSTALL_WITHOUT_REBUILD` variable to `true` 4. The build_libtorch.sh script then uses the downloaded WHL file instead of rebuilding PyTorch * ci: warm up PyTorch source cache during daily RollPyTorch action This patch makes the RollPyTorch action write the updated WHL file to the cache, so that it can be later retrieved by CI that runs for each PR. We deliberately add the caching step to the end of the action since the RollPyTorch action never needs to read from the cache, although executing this step earlier in the process should not cause problems either.

ashay added 2 commits October 17, 2022 09:16

ashay requested a review from powderluv October 17, 2022 16:21

powderluv approved these changes Oct 18, 2022

View reviewed changes

ashay merged commit a9942f3 into main Oct 18, 2022

ashay deleted the ashay/pytorch-cache branch October 18, 2022 05:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cache PyTorch source builds to reduce CI time #1500

Cache PyTorch source builds to reduce CI time #1500

ashay commented Oct 17, 2022

ashay commented Oct 17, 2022 •

edited

Loading

powderluv left a comment

powderluv commented Oct 18, 2022

ashay commented Oct 18, 2022

Cache PyTorch source builds to reduce CI time #1500

Cache PyTorch source builds to reduce CI time #1500

Conversation

ashay commented Oct 17, 2022

ashay commented Oct 17, 2022 • edited Loading

powderluv left a comment

Choose a reason for hiding this comment

powderluv commented Oct 18, 2022

ashay commented Oct 18, 2022

ashay commented Oct 17, 2022 •

edited

Loading