Monomorphization: Optimize constants #6711

kripken · 2024-07-03T17:22:54Z

Previously the pass would monomorphize a call when we were sending more
refined types than the target expects. This generalizes the pass to also consider
the case where we send a constant in a parameter.

To achieve that, this refactors the pass to explicitly define the "call context",
which is the code around the call (inputs and outputs) that may end up leading
to optimization opportunities when combined with the target function. Also
add comments about the overall design + roadmap.

The existing test is mostly unmodified, and the diff there is smaller when
ignoring whitespace. We do "regress" those tests by adding more local.set
operations, as in the refactoring that makes things a lot simpler, that is, to
handle the general case of an operand having either a refined type or be a
constant, we copy it inside the function, which works either way. This
"regression" is only in the testing version of the pass (the normal version
runs optimizations, which would remove that extra code).

This also enables the pass when GC is disabled. Previously we only handled
refined types, so only GC could benefit. Add a test for MVP content
specifically to show we operate there as well.

tlively

Comments on the code, will look at tests in the morning.

tlively · 2024-07-09T21:04:18Z

src/ir/module-utils.cpp

+}
+
+std::unique_ptr<Function>
+copyFunctionWithoutAdd(Function* func,


Maybe it would be better to rename copyFunction to copyAndAddFunction as a preliminary change so this can just be copyFunction? Alternatively, we could keep only the behavior of copying without adding; it's not too burdensome for callers to call module.addFunction themselves.

A downside to copyAndAddFunction (or to keeping the name but dropping the add) is that then it would not be consistent with copyGlobal, copyTable, etc. which all do the natural thing and add.

It is the not-adding that is the slightly odd thing here, but this code really benefits from it, and it seems not that big a burden to have a public API for it I think.

We definitely want consistency. IMO it would be cleanest if none of the copyXXX functions did the add because then they would all be pure, but no need to block this PR on that.

tlively · 2024-07-09T21:07:51Z

src/passes/Monomorphize.cpp

+// The empirical approach significantly reduces the need for heuristics. For
+// example, rather than have a heuristic for "see if a constant parameter flows
+// into a conditional branch," we simply run the optimizer and let it optimize
+// that case. All other cases handled by the optimizer work as well, without
+// needing to specify them as heuristics, so this gets smarter as the optimizer
+// does.


What is the performance overhead of this approach? It would be interesting to know what percent of the work turns out to be not beneficial and thrown away.

Most of the work is thrown out, I'd bet, but how much depends on heuristics not yet implemented, like we could ignore functions over a certain size, etc.

Overall I think it is fine to waste work here: that's ok if it lets us find optimization opportunities not possible any other way (which I hope is the case 😄 but have not yet proven, though I do have particular use cases that I am confident about).

Fundamentally, rigid heuristics may save work but miss opportunities, while try-it-and-see can find more in return for throwing some work away.

tlively · 2024-07-10T02:27:02Z

src/passes/Monomorphize.cpp

+      }
+    }
+
+    return dropped == other.dropped;


Maybe check this before the operands because it is so cheap?

Good idea, done.

tlively · 2024-07-10T02:33:01Z

src/passes/Monomorphize.cpp

+  bool canBeMovedIntoContext(Expression* curr) {
+    // Constant numbers, funcs, strings, etc. can all be copied, so it is ok to
+    // add them to the context.
+    return Properties::isSingleConstantExpression(curr);


It looks like this is not true for constant global.get, but that might be good to include as well.

Good point, yeah, this could capture anything "copyable" really. I added a TODO.

tlively · 2024-07-10T02:37:01Z

src/passes/Monomorphize.cpp

+[[maybe_unused]] std::ostream& operator<<(std::ostream& o,
+                                          wasm::CallContext& context) {


If you name this dump() or something, it will be callable from a debugger, and therefore even more useful.

I never use a debugger myself, so I would never have thought of that 😄 Done.

tlively · 2024-07-10T02:42:30Z

src/passes/Monomorphize.cpp

+      // TODO: check for either a size decrease (always good) or a significant
+      //       speed increase (as a tiny one, in a huge function, can lead to
+      //       wasteful duplicated code)


Why is code size decreasing always good? Do we add the monomorphized function only when we can entirely replace the original function?

Good catch, yeah, this TODO makes more sense for the case we can entirely replace, which is not the case here. I revised it.

tlively · 2024-07-10T02:47:25Z

src/passes/Monomorphize.cpp

+    }
+
+    // The main body of the function is simply copied from the original.
+    auto* newBody = ExpressionManipulator::copy(func->body, wasm);


Haven't we already made a copy of the body when we copied the function? Why do we need another copy?

Good catch, this was a leftover from an older approach. Fixed.

tlively

LGTM!

kripken added 30 commits June 5, 2024 14:32

work

f6bd1b0

note

dd673f8

note

3232bae

Merge remote-tracking branch 'origin/main' into mono.moar

4517e55

Merge remote-tracking branch 'origin/main' into mono.moar

d47f4c2

comments

c876ff5

comments

f804c95

comments

6817d67

work

7c975c3

work

38488b9

work

0877c0a

work

bc47504

format

a8fbc5e

Merge remote-tracking branch 'origin/main' into mono.moar

a11b9d3

work

38ab29a

work

3339ef7

work

d978b55

work

52ad9f0

work

e428820

bad

996a95d

work

2aa99fa

work

235c5fd

work

9bb4af5

work

7ca4545

work

4d04aea

work

e26d753

work

e48ebb8

work

a669e47

work

1d598ef

work

aaacb45

kripken added 14 commits July 2, 2024 16:47

work

9c2d3fa

work

b4933b2

work

62fb4eb

work

4742228

work

8bb827f

fix

6bedb13

format

d24bb59

Merge remote-tracking branch 'origin/main' into mono.moar

cb50b7c

work

2091277

work

8ddbc3e

work

d781015

work

2aa8509

work

b4f6a41

work

b3fce26

kripken requested a review from tlively July 3, 2024 17:22

kripken added 3 commits July 3, 2024 11:16

fix

a3b8153

fix test

57da116

format

533e849

tlively reviewed Jul 10, 2024

View reviewed changes

kripken added 7 commits July 10, 2024 15:26

Merge remote-tracking branch 'myself/mono.moar' into mono.moar

decabfb

feedback: move cheaper check earlier

0a8e2ab

feedback: TODO for global.get etc.

649e8ad

feedback: rename debug method

b6cfd63

format

e8df956

feedback: improve TODO

d32d44a

feedback: remove second copy of function body

89b6bd4

tlively approved these changes Jul 11, 2024

View reviewed changes

kripken merged commit 5a1daf7 into WebAssembly:main Jul 11, 2024

kripken deleted the mono.moar branch July 11, 2024 17:31

gkdn mentioned this pull request Aug 31, 2024

stringconsts gkdn/binaryen#1

Closed

		[[maybe_unused]] std::ostream& operator<<(std::ostream& o,
		wasm::CallContext& context) {

Monomorphization: Optimize constants #6711

Monomorphization: Optimize constants #6711

Uh oh!

Conversation

kripken commented Jul 3, 2024

Uh oh!

tlively left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

tlively left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants