[ci] Support running re-execution benchmark with arbitrary version of Firewood #4650

Elvis339 · 2025-12-03T17:43:06Z

Why this should be merged

Enables Firewood to track performance over time by running C-Chain reexecution benchmarks with custom Firewood builds. This establishes the infrastructure for catching performance regressions before they reach production.

ava-labs/firewood#1494

How this works

Firewood triggers the existing C-Chain reexecution workflow via GitHub API with optional firewood and libevm parameters
The workflow passes these to the benchmark task which uses polyrepo to clone, build (via Nix), and configure Firewood
Runs C-Chain reexecution benchmark with the custom build
Optionally uploads results as artifact for Firewood to download and track

The same functionality is available locally for development:

nix develop
./scripts/run_task.sh c-chain-reexecution-firewood-101-250k FIREWOOD_REF=abc123

Changes

Add LIBEVM_REF/FIREWOOD_REF support to reexecute-cchain-range-with-copied-data task
Update composite action and workflows to pass these inputs
Remove redundant c-chain-reexecution-benchmark-firewood.yml workflow
Remove build_firewood.sh script (replaced by polyrepo)
Add documentation to C-Chain Re-Execution README

How this was tested

gh workflow run "C-Chain Re-Execution Benchmark w/ Container" \
  --ref es/enable-firewood-dev-workflow \
  -f task=c-chain-reexecution-firewood-101-250k \
  -f firewood-ref=v0.0.15 \
  -f runner=avalanche-avalanchego-runner-2ti \
  -f timeout-minutes=60

Need to be documented in RELEASES.md?

No

Copilot

Pull request overview

This PR establishes infrastructure for tracking Firewood performance over time by enabling C-Chain reexecution benchmarks with custom Firewood builds. The workflow can be triggered from the Firewood repository with either published versions (for quick testing) or branch/commit references (for comprehensive testing with source builds).

Key changes:

Adds a reusable workflow that accepts Firewood version/branch/commit as input
Implements intelligent build strategy: uses go get for published versions, builds from source for branches/commits
Creates build script for compiling Firewood FFI from source using Nix

Reviewed changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated 2 comments.

File	Description
`.github/workflows/c-chain-reexecution-benchmark-firewood.yml`	New workflow that orchestrates benchmark execution with custom Firewood builds and uploads results as artifacts
`graft/coreth/scripts/build_firewood.sh`	Shell script to clone, build, and optionally integrate Firewood FFI from source
`.github/workflows/c-chain-reexecution-benchmark-container.yml`	Removes container configuration (unrelated cleanup)

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

graft/coreth/scripts/build_firewood.sh

.github/workflows/c-chain-reexecution-benchmark-container.yml

graft/coreth/scripts/build_firewood.sh

Taskfile.yml

.github/workflows/c-chain-reexecution-benchmark-firewood.yml

…/avalanchego into es/enable-firewood-dev-workflow

…ution benchmarks Integrate firewood/libevm dependency overrides into existing workflows using polyrepo, eliminating the need for a separate firewood workflow. - Add LIBEVM_REF/FIREWOOD_REF to reexecute-cchain-range-with-copied-data - Update composite action and workflows to pass inputs - Remove redundant firewood workflow and build_firewood.sh script

…/avalanchego into es/enable-firewood-dev-workflow

tests/reexecute/c/README.md

maru-ava · 2025-12-15T14:00:08Z

Functionally this looks ok. Doc question and lint failure will need to be addressed before merge. Also, in addition to the command that you ran to verify that this is working, please include a passing test run that resulted from that command (this is general best practice when manual testing is required).

maru-ava · 2025-12-15T16:19:13Z

I guess the 10 input params is an indication that this remote execution can't be much more than a stop-gap pending firewood migration to a monorepo?

Elvis339 · 2025-12-15T16:38:51Z

I guess the 10 input params is an indication that this remote execution can't be much more than a stop-gap pending firewood migration to a monorepo?

I wasn't aware of the 10 input param limit first time hitting this constraint.
As I see it, we have two options:

Keep as stop-gap - Accept the current limitations until Firewood is grafted into the monorepo
Move orchestration to Firewood remove this from avalanchego and have polyrepo orchestrate the benchmarks from Firewood's side (as we discussed offline)

That said, both options are effectively stop-gaps. (1) is temporary until Firewood joins the monorepo, and (2) would still require cross-repo coordination that becomes unnecessary once Firewood is grafted.

If you're okay with keeping (1) as a stop-gap for now with a GH issue to track the long-term solution, that would unblock the Firewood team to start collecting performance metrics sooner.

…d runs

maru-ava · 2025-12-15T16:48:58Z

If you're okay with keeping (1) as a stop-gap for now with a GH issue to track the long-term solution, that would unblock the Firewood team to start collecting performance metrics sooner.

I'm fine with that, as per my use of the term 'stop-gap' - a temporary solution pending a better one.

Elvis339 · 2025-12-15T17:01:40Z

Executed:

gh workflow run "C-Chain Re-Execution Benchmark w/ Container" \
  --ref es/enable-firewood-dev-workflow \
  -f task=c-chain-reexecution-firewood-101-250k \
  -f firewood-ref=v0.0.15 \
  -f runner=avalanche-avalanchego-runner-2ti \
  -f timeout-minutes=60

Result: https://github.com/ava-labs/avalanchego/actions/runs/20240259592

maru-ava

The comments I made on the container workflow also apply to the container one.

.github/workflows/c-chain-reexecution-benchmark-container.yml

maru-ava · 2025-12-15T17:14:16Z

.github/workflows/c-chain-reexecution-benchmark-container.yml

      runner:
        description: 'Runner to execute the benchmark. Input to the runs-on field of the job.'
        required: true
-      push-post-state:


I see that the job definitions have been updated to use a matrix field instead of this import parameter, what are the implications of this change?

The main implication is that push-post-state can't be set from manual workflow_dispatch anymore you'd have to add it to the JSON config file instead.

Scheduled runs still work fine since they read from the JSON anyway.

We had to make this trade-off because GitHub has a 10 input limit for workflow_dispatch, and we needed room for firewood-ref and libevm-ref. Figured those are more useful for manual runs (testing custom versions), while push-post-state is typically used in scheduled/automated runs anyway.

If you ever need to manually push state after a run, you can still do it with:

task export-dir-to-s3 SRC=/path/to/current-state DST=s3://bucket/destination

@aaronbuchwald Thoughts?

maru-ava · 2025-12-15T17:14:53Z

.github/workflows/c-chain-reexecution-benchmark-container.yml

+          push-post-state: ${{ matrix.push-post-state || '' }}
          runner_name: ${{ matrix.runner }}
+      - name: Upload benchmark results
+        if: inputs.firewood != ''


Maybe add a ~~command~~ comment here indicating why a value for this parameter should prevent uploading?

Added it here:

avalanchego/.github/actions/c-chain-reexecution-benchmark/action.yml

Line 196 in a89c44e

# Skip when using custom firewood - Firewood downloads the artifact and

but forgot to add it to concrete implementations, thanks for pointing it out.

d428155 87b3d4c

.github/workflows/c-chain-reexecution-benchmark-container.yml

…ref and firewood-ref

Elvis339 · 2025-12-15T17:57:49Z

Re: #4650 (comment)

Why are we using this image in the first place? I get that it wasn't working for firewood, but it would be good to know why it's ok to stop using it.

The image was originally used because it was minimal and we didn't want to maintain our own ubuntu base was missing sudo and the initial ARC implementation needed some baseline. SREs and Infra have been iterating on it since then.
As for why it's ok to remove - I ran into io_uring not being available when trying to build Firewood (which uses io_uring for async I/O). Removed the container and it worked. The self-hosted runners already have all the dependencies we need, so the container was just adding constraints without much benefit for our use case.

.github/workflows/c-chain-reexecution-benchmark-container.yml

maru-ava

I'm ok with this, but I think @aaronbuchwald will need to confirm that these changes are in keeping with his expectations.

ci(c-chain-reexecution-firewood)

d0155dd

Elvis339 self-assigned this Dec 3, 2025

Copilot AI review requested due to automatic review settings December 3, 2025 17:43

Elvis339 requested review from a team and aaronbuchwald as code owners December 3, 2025 17:43

Elvis339 added the ci This focuses on changes to the CI process label Dec 3, 2025

github-project-automation bot added this to avalanchego Dec 3, 2025

Elvis339 mentioned this pull request Dec 3, 2025

Track Firewood Performance via AvalancheGo Reexecution Benchmarks ava-labs/firewood#1494

Open

Copilot AI reviewed Dec 3, 2025

View reviewed changes

graft/coreth/scripts/build_firewood.sh Outdated Show resolved Hide resolved

graft/coreth/scripts/build_firewood.sh Outdated Show resolved Hide resolved

lint

e851903

Elvis339 requested review from joshua-kim and maru-ava as code owners December 3, 2025 18:02

Elvis339 commented Dec 3, 2025

View reviewed changes

.github/workflows/c-chain-reexecution-benchmark-container.yml Show resolved Hide resolved

Merge branch 'master' into es/enable-firewood-dev-workflow

49b7fc3

Elvis339 mentioned this pull request Dec 4, 2025

ci(perf): Track Firewood Performance via AvalancheGo Benchmarks ava-labs/firewood#1493

Open

RodrigoVillar reviewed Dec 5, 2025

View reviewed changes

graft/coreth/scripts/build_firewood.sh Outdated Show resolved Hide resolved

Merge branch 'master' into es/enable-firewood-dev-workflow

2feeda4

Elvis339 requested a review from RodrigoVillar December 8, 2025 16:23

maru-ava reviewed Dec 9, 2025

View reviewed changes

Elvis339 added 4 commits December 9, 2025 19:45

ci(firewood-benchmark): try go get first, fall back to Nix on failure

6f21231

Merge branch 'es/enable-firewood-dev-workflow' of github.com:ava-labs…

b3cc8ed

…/avalanchego into es/enable-firewood-dev-workflow

refactor(build-firewood): move default config to script, simplify task

e437658

ci(firewood-benchmark): test workflow without the commit step

15b52ee

Elvis339 requested a review from maru-ava December 9, 2025 18:10

Elvis339 and others added 4 commits December 9, 2025 22:10

Merge branch 'master' into es/enable-firewood-dev-workflow

999c3f4

Merge branch 'master' into es/enable-firewood-dev-workflow

34a331f

Merge branch 'es/enable-firewood-dev-workflow' of github.com:ava-labs…

e0e784b

…/avalanchego into es/enable-firewood-dev-workflow

Elvis339 requested a review from StephenButtolph as a code owner December 12, 2025 17:12

fix(c-chain-reexecution): add nix build env vars for self-hosted runners

cab333e

maru-ava reviewed Dec 15, 2025

View reviewed changes

tests/reexecute/c/README.md Outdated Show resolved Hide resolved

lint(c-chain-reexecution): set 10 input params per gh workflow limit

c131420

Elvis339 added 2 commits December 15, 2025 20:30

docs

f79c8d5

Merge branch 'master' into es/enable-firewood-dev-workflow

c4561be

ci(c-chain-reexecution): skip benchmark comparison for custom firewoo…

a89c44e

…d runs

maru-ava reviewed Dec 15, 2025

View reviewed changes

ci(c-chain-reexecution): rename libevm and firewood inputs to libevm-…

d428155

…ref and firewood-ref

Elvis339 requested a review from a team as a code owner December 15, 2025 17:49

docs

87b3d4c

Merge branch 'master' into es/enable-firewood-dev-workflow

6f5502b

maru-ava reviewed Dec 15, 2025

View reviewed changes

.github/workflows/c-chain-reexecution-benchmark-container.yml Outdated Show resolved Hide resolved

maru-ava reviewed Dec 15, 2025

View reviewed changes

.github/workflows/c-chain-reexecution-benchmark-container.yml Outdated Show resolved Hide resolved

ci(c-chain-reexecution): clarify firewood-ref skips benchmark comparison

6493598

maru-ava approved these changes Dec 15, 2025

View reviewed changes

[ci] Support running re-execution benchmark with arbitrary version of Firewood #4650

Are you sure you want to change the base?

[ci] Support running re-execution benchmark with arbitrary version of Firewood #4650

Conversation

Elvis339 commented Dec 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Why this should be merged

How this works

Changes

How this was tested

Need to be documented in RELEASES.md?

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

maru-ava commented Dec 15, 2025

Uh oh!

maru-ava commented Dec 15, 2025

Uh oh!

Elvis339 commented Dec 15, 2025

Uh oh!

maru-ava commented Dec 15, 2025

Uh oh!

Elvis339 commented Dec 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

maru-ava left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

maru-ava Dec 15, 2025

Choose a reason for hiding this comment

Uh oh!

Elvis339 Dec 15, 2025

Choose a reason for hiding this comment

Uh oh!

maru-ava Dec 15, 2025

Choose a reason for hiding this comment

Uh oh!

maru-ava Dec 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Elvis339 Dec 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Elvis339 commented Dec 15, 2025

Uh oh!

Uh oh!

Uh oh!

maru-ava left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Elvis339 commented Dec 3, 2025 •

edited

Loading

Elvis339 commented Dec 15, 2025 •

edited

Loading

maru-ava Dec 15, 2025 •

edited

Loading

Elvis339 Dec 15, 2025 •

edited

Loading