tools: Add a tool to aid with running benchmarks by davidlattimore · Pull Request #1454 · davidlattimore/wild

davidlattimore · 2026-01-14T05:01:11Z

No description provided.

lapla-cogito · 2026-01-14T05:12:47Z

README.md

-![Benchmark of lld, mold and wild linking librustc-driver](images/benchmarks/librustc-driver.svg)
-
-Finally, for an especially large binary, we link the chromium web browser with debug info.
+See [benchmarks/ryzen-9955hx.md](benchmarks/ryzen-9955hx.md) for benchmark results.


While the benchmark content seems adequate, I believe the README should also include key benchmark results (particularly those related to link times) as before. Many new users of Wild likely have the question "What makes it so superior to existing linkers?", so having an answer readily available in plain sight would be meaningful. (In this regard, I think moving the "What's working?", "What isn't yet supported?" and benchmarks to the top of the README would be a good idea. Of course, these may not fall under the scope of this PR, though)

Yeah, I agree. A large number of benchmarks is fine, but then we should have a TL;DR section summarizing the results a bit.

karolzwolak · 2026-01-14T18:21:31Z

benchmarks/runner/src/reporting.rs

+                markdown.push_str(&format!(
+                    "![Benchmark of linking {}]({svg_filename})\n\n",
+                    benchmark.config.name
+                ));


I think there should be the text with name of the scenario before the images. It'd allow to quickly search if you knew what you're looking for.

While we're at it, some rough table of contents would be nice too.

Fully agree with both the suggestions! Regarding the later one - I would also include ratio (%) comparison for easier orientation. Digesting the speed-up from the pictures needs some mental capacity :)

I've just prototyped the % change thing. Initially I made the baseline be whatever was slowest (or largest for memory). I didn't really like that though because it sometimes was LLD, sometimes was Mold and if GNU ld was included, then that was the baseline. I contemplated making LLD always be the baseline, but in the end decided to make the last benchmark (which is the latest version of Wild) be the baseline. That means that the other linkers are generally reported as positive percentage increases - except for ripgrep-memory where GNU ld uses less memory than Wild. You can see the results here:
https://github.com/davidlattimore/wild/blob/a9ed091c45244d5ca05928b02448144281c343ac/benchmarks/ryzen-9955hx.md

karolzwolak

I like the svgs, but numbers on the y axis aren't too readable in some scenarios. For example 30000 MiB for the clang-debug memory. We should put some separators in between, or better adjust the unit prefixes.

karolzwolak · 2026-01-14T18:51:59Z

Cargo.toml

 bumpalo-herd = "0.1.2"
 bytesize = "2.0.0"
-clap = { version = "4.0.0", features = ["derive"] }
+clap = { version = "4.1.0", features = ["derive"] }


I may be a bit too pedantic here, but is there any reason to update the version here?

This seems to be due to the minimal-versions check: https://github.com/davidlattimore/wild/actions/runs/20982989095/job/60311647318

Yep, that's it exactly. With this change, Wild no longer builds if we use clap version 4.0.0, so I bumped the minimum version. 4.1.0 is three years old, so it's unlikely that anyone would want to use an older version together with any of our crates.

marxin · 2026-01-14T21:10:56Z

For example 30000 MiB for the clang-debug memory. We should put some separators in between, or better adjust the unit prefixes.

Can we rely on bytesize crate?

marxin · 2026-01-14T21:12:17Z

Generally speaking - I really like the graphs and especially the improvement that has been achieved among the Wild's releases. It's even better investment for the future as the time series is going to be even more interesting.

davidlattimore · 2026-01-15T03:55:40Z

Thanks for the feedback folks!

I've added a few interesting benchmarks back to the README. We also now directly link to each benchmark page from the readme.

The newly added percentages at the bottom of the charts show how each benchmark result differs from the latest version of wild. Using the latest version of wild as the baseline makes sense since this allows us to easily compare it with previous versions of wild or compare it with other linkers.

I added spaces to make the numbers over 1000 more readable. I didn't want to have different charts use different units, because it could be easy to miss that the unit has changed.

I've added a Raspberry Pi benchmarks page and added one Raspberry Pi benchmark back to the main README.

I also added confidence intervals, which show how confident we are in the mean that we computed for each benchmark. Where the confidence interval is too large, we could reduce it by running more iterations for that benchmark.

README.md

davidlattimore · 2026-01-15T04:55:19Z

This PR is now just the tool. The updated benchmarks are now part of the PR to release 0.8.0 - #1456

TechnoPorg · 2026-01-15T19:50:50Z

When I try to run benchmarks with this tool, it doesn't seem like it sets paths correctly:

Error returned from OUT="/tmp/linker-benchmark-out/" "/home/maya/.cache/wild/bench/zed-release/run-with" "target/dist/wild"
wild: error: Failed to open `/tmp/linker-benchmark-out/`
  Caused by:
    Is a directory (os error 21)

I've patched it with args.tmp.join("out")... in benchmarking.rs locally for now.

davidlattimore · 2026-01-15T20:35:55Z

It's supposed to be a file. Perhaps delete the directory?

davidlattimore · 2026-01-15T22:47:52Z

I perhaps should have mentioned that this tool is really intended for doing bulk runs of benchmarks. i.e. for running the benchmarks for a release. For one-off single-program benchmarks, I'd still recommend using poop or hyperfine.

TechnoPorg · 2026-01-16T06:29:30Z

Ah, that makes sense. I was thrown off by the fact that the PathBuf formatting is adding a trailing slash to the path in the command:

"stat" "-f" "-c" "%T" "/tmp/linker-benchmark-out/"

which fails with stat 9.9 on my system:

stat: cannot read file system information for '/tmp/linker-benchmark-out/': Not a directory

Edit: never mind, I messed up, so the above part isn't relevant.

It also doesn't create the file beforehand, so I wonder if it would be better to just check the parent directory of args.tmp instead.

davidlattimore · 2026-01-16T06:39:16Z

Ah, that makes sense. I added the call to "stat" after I'd been running the tool for a while, so the output file would have already existed for me. Yes, checking the parent directory sounds like the thing to do. You're welcome to send a PR to fix it if you'd like? Otherwise, I'll do it.

lapla-cogito reviewed Jan 14, 2026

View reviewed changes

davidlattimore force-pushed the push-rrsstntysskk branch from 00d0d74 to 1d025e9 Compare January 14, 2026 11:15

karolzwolak reviewed Jan 14, 2026

View reviewed changes

davidlattimore force-pushed the push-rrsstntysskk branch 5 times, most recently from a9ed091 to 0aff659 Compare January 15, 2026 03:49

lapla-cogito reviewed Jan 15, 2026

View reviewed changes

README.md Outdated Show resolved Hide resolved

davidlattimore force-pushed the push-rrsstntysskk branch 3 times, most recently from c694b52 to c821900 Compare January 15, 2026 04:50

davidlattimore marked this pull request as ready for review January 15, 2026 04:52

tools: Add a tool to aid with running benchmarks

804d587

davidlattimore force-pushed the push-rrsstntysskk branch from c821900 to 804d587 Compare January 15, 2026 06:29

davidlattimore merged commit e1c79f2 into main Jan 15, 2026
20 checks passed

davidlattimore deleted the push-rrsstntysskk branch January 15, 2026 06:40

TechnoPorg mentioned this pull request Jan 16, 2026

Use parent directory of args.tmp for filesystem checking #1458

Merged

Uh oh!

Comments

Conversation

davidlattimore commented Jan 14, 2026

Uh oh!

lapla-cogito Jan 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

karolzwolak left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

marxin commented Jan 14, 2026

Uh oh!

marxin commented Jan 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

davidlattimore commented Jan 15, 2026

Uh oh!

Uh oh!

davidlattimore commented Jan 15, 2026

Uh oh!

Uh oh!

TechnoPorg commented Jan 15, 2026

Uh oh!

davidlattimore commented Jan 15, 2026

Uh oh!

davidlattimore commented Jan 15, 2026

Uh oh!

TechnoPorg commented Jan 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

davidlattimore commented Jan 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

lapla-cogito Jan 14, 2026 •

edited

Loading

karolzwolak left a comment •

edited

Loading

marxin commented Jan 14, 2026 •

edited

Loading

TechnoPorg commented Jan 16, 2026 •

edited

Loading