Introduce Yul builtin handles for its dialects #15520

clonker · 2024-10-17T10:39:45Z

removes fixedFunctionNames from yul::Dialect
introduces yul::BuiltinHandle struct that functions as a reference to a builtin function
modifies yul::EVMDialect handling of builtins and verbatims so that fetching the function from a handle is O(1); the lookup of a handle from a builtin name (string identifier) is managed through lookup in an unordered_map
the dialect's function builtin converts a handle to the corresponding builtin function; a new function findBuiltin tries to determine a handle corresponding to a builtin's string identifier

cameel

Thanks for splitting this. This was much easier to get through in one sitting.

I didn't find any significant problems. Just some readability and assert tweaks. Once those are addressed, we can merge.

But please also check the external.sh benchmark numbers between this and 0.8.28 to see if there's any impact on performance. Would also be interesting to see if removing the helper map would have any effect on those numbers.

libyul/Dialect.h

libyul/backends/evm/EVMDialect.h

libyul/backends/evm/EVMDialect.cpp

libyul/backends/evm/EVMDialect.h

libyul/backends/evm/EVMDialect.cpp

libyul/backends/evm/EVMDialect.h

libyul/backends/evm/NoOutputAssembly.cpp

libyul/optimiser/ForLoopConditionOutOfBody.cpp

clonker · 2024-10-18T12:00:59Z

But please also check the external.sh benchmark numbers between this and 0.8.28 to see if there's any impact on performance. Would also be interesting to see if removing the helper map would have any effect on those numbers.

Removing the helper map made things consistently worse so I am not even showing it here. Here's the comparison of the PR branch to develop:

I let both benchmark scripts run twice, the percentages are with respect to the best runtimes per benchmark. So the branch seems to improve things consistently. I am a bit surprised, really expected to see a (then hopefully non-drastic) worsening of timings. But I'll take that. Then I actually expect another improvement of runtime once the builtins have their own AST element. Based on the reliability of my gut feeling, that should make things worse again :P

I also made a yolo version of the builtin(handle) implementation by putting it into the header and writing it as follows:

BuiltinFunctionForEvm const& builtin(BuiltinHandle const& _handle) const override 
{
    return isVerbatimHandle(_handle) ? *m_verbatimFunctions[handle.id] : *m_functions[handle.id - verbatimIDOffset];
}

Turns out that living dangerous doesn't gain you much in this case and the results are comparable to the version with asserts inside of the compilation unit.

Generally I find that the benchmarks should be run a couple times, probably, and I also think that the per-second output may not be precise enough. Especially for the faster stuff like compiling uniswap, a one-second difference -which may very well be due to rounding errors and in reality much smaller- makes up for a (perhaps spurious) 1.5% difference in runtime.

libyul/backends/evm/EVMDialect.cpp

libyul/backends/evm/NoOutputAssembly.cpp

cameel · 2024-10-18T17:36:09Z

Removing the helper map made things consistently worse so I am not even showing it here.

Interesting. So the difference is actually big enough to show up in benchmarks? How much was it?

I let both benchmark scripts run twice,

Just a note that to get results faster, I'd recommend commenting out legacy runs. Also sablier and seaport since those stop at an arbitrary point due to StackToDeep and may be unreliable for comparisons.

All of those are enabled by default, but they're not all equally useful :)

Turns out that living dangerous doesn't gain you much in this case and the results are comparable to the version with asserts inside of the compilation unit.

I mean, asserts may look heavy, but if they don't fail, they boil down to one extra if. Unless the condition itself was heavy, that's exactly what I'd have expected here :)

Generally I find that the benchmarks should be run a couple times

Definitely. We didn't want to overengineer them, they're just simple scripts, but I would not mind if they were extended to allow running multiple times with results averaged over those runs. I sometimes do that by hand, but it's really tedious.

I also think that the per-second output may not be precise enough. Especially for the faster stuff like compiling uniswap, a one-second difference -which may very well be due to rounding errors and in reality much smaller- makes up for a (perhaps spurious) 1.5% difference in runtime.

I'm not sure these numbers are accurate enough for sub-second values to matter. Showing them may be misleading. On my system I sometimes see variance between runs that can be counted in seconds. Even for OZ, which takes only ~30 s. For example when I was rerunning benchmarks to get numbers for the release I was quite surprised that the first run did not reproduce the gains I saw in the PR. Only after multiple runs it averaged to something closer. I think that multiple runs with averaged values would be a better way to address this.

From my experience with these benchmarks, reliably measuring a difference on the order of 1-2% is hard because there are other factors impacting them that are potentially greater than that. It depends though - some days I'm getting pretty consistent results between runs, while other days it varies widely. If the difference is this small, it may well not even be there. It's still better than what we were getting with benchmarking external tests in CI. There the variance was in tens of percents, so I consider what we're getting here pretty good regardless :)

clonker · 2024-10-21T06:26:44Z

Interesting. So the difference is actually big enough to show up in benchmarks? How much was it?

For openzeppelin, it took consistently 17s vs 16s over three runs (although binning in seconds doesn't make this particularly meaningful); for uniswap it was (66s,65s) instead of (62s, 64s); for eigenlayer (294s,295s) instead of (292s, 283s). To make it more quantitative I should run the benchmark more often course. Perhaps this is still just chance :)

I mean, asserts may look heavy, but if they don't fail, they boil down to one extra if. Unless the condition itself was heavy, that's exactly what I'd have expected here :)

Nice, good then! :)

Definitely. We didn't want to overengineer them, they're just simple scripts, but I would not mind if they were extended to allow running multiple times with results averaged over those runs. I sometimes do that by hand, but it's really tedious.

I'm not sure these numbers are accurate enough for sub-second values to matter. Showing them may be misleading.

Yes the variance is quite large. Still, especially for averaging runs it would be good to have more precise measurements. Then you might even see faster convergence to a meaningful mean - especially on faster hardware. I haven't done it yet, but it would be interesting to see the distribution and how many samples you'd need for (eg) OZ to get a good grasp. Then it should also be possible to better compare versions.

From my experience with these benchmarks, reliably measuring a difference on the order of 1-2% is hard because there are other factors impacting them that are potentially greater than that. It depends though - some days I'm getting pretty consistent results between runs, while other days it varies widely.

Cosmic radiation :D Still, provided enough samples, you should still be able to see a difference in mean at least :)

cameel · 2024-10-21T14:17:11Z

Yes the variance is quite large. Still, especially for averaging runs it would be good to have more precise measurements.

I'd rather solve that by making the script do the work for us and average them internally. The current precision is already due to me wanting to automate things more - I used to have to round them by hand since I do a lot of these benchmarks. And when I actually need to post the results, it's when the differences are measurable and extra decimal places are just noise.

We could also add an option for precision (even with making bigger precision the default). So far it just didn't seem overly necessary.

Yul dialect: Remove fixedFunctionNames

bddf801

clonker force-pushed the builtin_handles branch from 2fdef10 to 7a05715 Compare October 17, 2024 10:41

clonker changed the title ~~Builtin handles~~ Introduce Yul builtin handles for its dialects Oct 17, 2024

clonker requested a review from cameel October 17, 2024 12:31

cameel reviewed Oct 17, 2024

View reviewed changes

Yul: Use builtin handle struct for builtin referencing

a39a725

clonker force-pushed the builtin_handles branch from 7a05715 to 5f3a6f1 Compare October 18, 2024 08:02

Yul EVM Dialect: Use address table for verbatim functions

d8f69b9

clonker force-pushed the builtin_handles branch 2 times, most recently from e13af0e to ae538de Compare October 18, 2024 08:13

Yul EVM Dialect: Helper map for builtin function lookup

3088275

clonker force-pushed the builtin_handles branch from ae538de to 80e2111 Compare October 18, 2024 11:29

cameel previously approved these changes Oct 18, 2024

View reviewed changes

libyul/backends/evm/EVMDialect.cpp Show resolved Hide resolved

libyul/backends/evm/NoOutputAssembly.cpp Outdated Show resolved Hide resolved

Yul Dialect: Rename functions to function handles

6a2631e

clonker dismissed cameel’s stale review via 6a2631e October 21, 2024 06:00

clonker force-pushed the builtin_handles branch from 80e2111 to 6a2631e Compare October 21, 2024 06:00

cameel approved these changes Oct 21, 2024

View reviewed changes

clonker merged commit abc46f3 into develop Oct 21, 2024
73 checks passed

clonker deleted the builtin_handles branch October 21, 2024 14:04

cameel mentioned this pull request Oct 21, 2024

eof: Add auxdataloadn to reserved identifiers. Move test to eof dir #15529

Merged

cameel mentioned this pull request Jan 31, 2025

Convert verbatim into a single builtin with literal arguments #15805

Open

cameel mentioned this pull request Feb 14, 2025

Yul FunctionCallFinder uses FunctionName instead of strings as search key #15840

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Introduce Yul builtin handles for its dialects #15520

Introduce Yul builtin handles for its dialects #15520

Uh oh!

clonker commented Oct 17, 2024

Uh oh!

cameel left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

clonker commented Oct 18, 2024 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

cameel commented Oct 18, 2024

Uh oh!

clonker commented Oct 21, 2024

Uh oh!

Uh oh!

cameel commented Oct 21, 2024

Uh oh!

Uh oh!

Introduce Yul builtin handles for its dialects #15520

Introduce Yul builtin handles for its dialects #15520

Uh oh!

Conversation

clonker commented Oct 17, 2024

Uh oh!

cameel left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

clonker commented Oct 18, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

cameel commented Oct 18, 2024

Uh oh!

clonker commented Oct 21, 2024

Uh oh!

Uh oh!

cameel commented Oct 21, 2024

Uh oh!

Uh oh!

clonker commented Oct 18, 2024 •

edited

Loading