[ci] Try to use GCP Bazel cache for more jobs #20836

jwnrt · 2024-01-15T14:58:21Z

Summary

This change lets more jobs read from the cache, and also allows more jobs to write to the cache when running for the master branch.

It's being tested in this Azure run which will write to the cache. A second run will need to be triggered to see if we actually read from the cache.

Details

This change:

Switches usage of ./bazelisk.sh to ci/bazelisk.sh in pipelines.
- This script connects Bazel to our GCP bucket containing a Bazel cache.
- This allows CI to read from the cache.
- If the GCP_BAZEL_CACHE_KEY variable is set, it also writes to the cache.
Adds downloads for the file containing cache write credentials to more jobs.
- This file is stored encrypted in Azure and cannot be downloaded by PRs from forks.
- The location to this file is automatically stored in the GCP_BAZEL_CACHE_KEY_SECUREFILEPATH environment variable.
- The file is only downloaded for the master branch, however I have changed this for this test run to show that the cache can be written to.
- In summary: having this downloaded allows the master branch to write to the cache.

Signed-off-by: James Wainwright <james.wainwright@lowrisc.org>

pamaury · 2024-01-15T17:49:21Z

Thanks @jwnrt for looking at this, this looks like a great idea. It seems that now we will have many instances of

  - task: DownloadSecureFile@1
    condition: eq(variables['Build.SourceBranchName'], 'master')
    name: GCP_BAZEL_CACHE_KEY
    inputs:
      secureFile: "bazel_cache_gcp_key.json"

which is quite error-prone and very specific to our workflow. What do you think about having a yaml file that we can include so we can do instead something like:

- template: ci/use-bazel-remove-cache.yaml

and maybe add a parameter to specific whether access is just read, or read-write?

Signed-off-by: James Wainwright <james.wainwright@lowrisc.org>

jwnrt · 2024-01-16T09:18:45Z

I've extracted this step into a template but haven't added an r/rw parameter since these credentials are only for writing. The GCP bucket is always read if using the ci/bazelisk.sh.

I've tried to make this clearer by giving the template the name load-bazel-cache-write-creds.yml so it's more obvious they just for writing.

jwnrt · 2024-01-16T09:26:47Z

Oh also I should mention that even with this change, I don't think the Bazel cache is working in CI at all.

Before this PR, the sw_build job was set up correctly to use the cache (it had the GCP credentials to write and it was using ci/bazelisk.sh to read) but as far as I can tell has always rebuilt.

I tried to set up a local environment to read from the cache too, but no success. These are the (convoluted) steps I used (thanks to @nbdd0121 for helping with this):

Added the following to ~/.bazelrc (and ~root/.bazelrc for good measure) to replicate what ci/bazelisk.sh builds:

build --remote_cache=https://storage.googleapis.com/opentitan-bazel-cache
build --remote_upload_local_results=false
build --remote_default_exec_properties=OSVersion="Ubuntu 20.04.6 LTS"

Ensured my local environment variables matched CI for those set in rules/nonhermetic.bzl:
- Set HOME="/root" - this is what the CI uses and the HOME env var is non-hermetic, so may have influenced the cache?
- Set PATH=/tools/verilator/v4.210/bin:/tools/verible/bin:/root/.local/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin which is what sw_build uses - we extracted this by running env in this PR's CI run.
- $XILINXD_LICENSE_FILE already matched, and the other vars were unset.
Execute the same build command that sw_build does but using ./bazelisk.sh instead of ci/bazelisk.sh to ensure our own .bazelrc is used.

We did not observe the cache being used at all. There must be something else contributing to the cache that we didn't account for.

Signed-off-by: James Wainwright <james.wainwright@lowrisc.org>

azure-pipelines.yml

pamaury · 2024-01-17T15:39:18Z

ci/bazelisk.sh

@@ -15,12 +15,13 @@ echo "Running bazelisk in $(pwd)."

 # An additional bazelrc must be synthesized to specify precisely how to use the
 # GCP bazel cache.
+GCP_CREDS_FILE="$GCP_BAZEL_CACHE_KEY_SECUREFILEPATH"


I am assuming that the _SECUREFILEPATH is added by DownloadSecureFile to the name? It's not really clear in the documentation https://learn.microsoft.com/en-us/azure/devops/pipelines/tasks/reference/download-secure-file-v1?view=azure-pipelines

DownloadSecureFile will store the path to the file in a variable called $(GCP_BAZEL_CACHE_KEY.secureFilePath), and Azure also has a rule that you can access $(AZURE_VARIABLES.WITH.SCOPES) variables through environment variables by replacing .s with _s

pamaury

Thanks @jwnrt, this looks good to me. I am no bash expert though so I might have missed something.

jwnrt · 2024-01-17T15:42:42Z

Thanks for reviewing, I asked @nbdd0121 to look over the Bash and we simplified it a bit more

These flags need to come _before_ the subcommand, i.e.: ``` bazel --bazelrc=... cquery # VALID bazel cquery --bazelrc=... # INVALID ``` which is done in this script by pushing flags that come before the subcommand name into an array and re-applying them in the correct order later. If these Bash arrays are confusing and you've come across this commit in a `git blame` then hopefully this will help: 1. Bash supports arrays defined using parentheses: `array=("foo" "bar")` 2. Arrays can't be passed as arguments to functions, they are expanded and get mixed into other arguments, so we use a global instead. 4. the syntax for expanding an array into its elements with correct quoting on each is: `"${array[@]}"`. Signed-off-by: James Wainwright <james.wainwright@lowrisc.org>

[ci] Use ci/bazelisk.sh script in CI workflow

2aa0bb3

Signed-off-by: James Wainwright <james.wainwright@lowrisc.org>

jwnrt force-pushed the ci-bazel-cache branch from aaa2801 to ecfc442 Compare January 15, 2024 15:40

jwnrt marked this pull request as ready for review January 15, 2024 15:42

jwnrt requested a review from rswarbrick as a code owner January 15, 2024 15:42

jwnrt force-pushed the ci-bazel-cache branch from ecfc442 to fb0f800 Compare January 15, 2024 15:52

jwnrt requested review from nbdd0121 and pamaury January 15, 2024 15:55

[ci] Add Bazel cache GCP key to more jobs

6c251f3

Signed-off-by: James Wainwright <james.wainwright@lowrisc.org>

jwnrt force-pushed the ci-bazel-cache branch from 68b134a to cfe7343 Compare January 16, 2024 09:15

[ci] Fix ci/bazelisk.sh printing to stdout

6bc6e82

Signed-off-by: James Wainwright <james.wainwright@lowrisc.org>

jwnrt force-pushed the ci-bazel-cache branch from cfe7343 to 6bc6e82 Compare January 16, 2024 09:30

jwnrt mentioned this pull request Jan 16, 2024

[ci] Bazel cache not being used #20844

Open

pamaury reviewed Jan 16, 2024

View reviewed changes

azure-pipelines.yml Show resolved Hide resolved

jwnrt force-pushed the ci-bazel-cache branch 3 times, most recently from d236951 to 73b8d77 Compare January 17, 2024 15:37

pamaury reviewed Jan 17, 2024

View reviewed changes

jwnrt force-pushed the ci-bazel-cache branch from 73b8d77 to 6704287 Compare January 17, 2024 15:39

nbdd0121 approved these changes Jan 17, 2024

View reviewed changes

pamaury approved these changes Jan 17, 2024

View reviewed changes

jwnrt force-pushed the ci-bazel-cache branch from 6704287 to b92c555 Compare January 17, 2024 15:46

jwnrt merged commit 4d88b95 into lowRISC:master Jan 17, 2024
32 checks passed

pamaury mentioned this pull request Feb 29, 2024

Backport ci changes #21754

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ci] Try to use GCP Bazel cache for more jobs #20836

[ci] Try to use GCP Bazel cache for more jobs #20836

jwnrt commented Jan 15, 2024 •

edited

Loading

pamaury commented Jan 15, 2024

jwnrt commented Jan 16, 2024

jwnrt commented Jan 16, 2024 •

edited

Loading

pamaury Jan 17, 2024

jwnrt Jan 17, 2024 •

edited

Loading

pamaury left a comment

jwnrt commented Jan 17, 2024

[ci] Try to use GCP Bazel cache for more jobs #20836

[ci] Try to use GCP Bazel cache for more jobs #20836

Conversation

jwnrt commented Jan 15, 2024 • edited Loading

Summary

Details

pamaury commented Jan 15, 2024

jwnrt commented Jan 16, 2024

jwnrt commented Jan 16, 2024 • edited Loading

pamaury Jan 17, 2024

Choose a reason for hiding this comment

jwnrt Jan 17, 2024 • edited Loading

Choose a reason for hiding this comment

pamaury left a comment

Choose a reason for hiding this comment

jwnrt commented Jan 17, 2024

jwnrt commented Jan 15, 2024 •

edited

Loading

jwnrt commented Jan 16, 2024 •

edited

Loading

jwnrt Jan 17, 2024 •

edited

Loading