[interpreter] Begin implementation of a new interpreter #7227

tlively · 2025-01-18T00:26:08Z

The current interpreter used in wasm-shell, the fuzzer, and
optimizations like precompute works by recursively walking the
expression tree and computing expression results as it goes. This kind
of recursive interpretation is not going to work for stack switching,
since stack switching requires stashing context away and restoring it
later. The recursive interpreter stores intermediate values on the
native stack and returns early to implement control flow, so there is no
way to suspend a computation and resume it later.

To support stack switching and support other use future interpreter use
cases such as running the full spec test suite and fuzzing multithreaded
programs, introduce a new interpreter that is not recursive and does not
store intermediate state that needs to persist beyond the execution of a
single instruction on the native stack. The new interpreter works by
iterating through instructions and visiting them one at a time in a
loop. The visitor pushes and pops values from a stack and signals
control flow via its return values. Control flow transfers are handled
by the main interpreter loop, so expressions are only visited when they
are actually executed. This design will not only support stack switching
and other features better than the old interpreter, but it will also
significantly decrease the amount of code in the interpreter.

In addition to the core interpreter loop, also lay out a skeleton of the
execution context for the new interpreter, including a call stack and
store. The code contains several TODOs describing how these runtime
structures will need to be extended to support interpreting the full
spec test suite, including the ability to interpret over multiple linked
instances at once.

Most of the actual interpretation of expressions is left as future work,
but the interpretation of Const expressions and i32.add is implemented
and tested in a new gtest file to demonstrate it working end-to-end. One
of the first milestones for the new interpreter will be getting real
spec tests running with it, at which point the gtest file can be
removed.

The current interpreter used in wasm-shell, the fuzzer, and optimizations like precompute works by recursively walking the expression tree and computing expression results as it goes. This kind of recursive interpretation is not going to work for stack switching, since stack switching requires stashing context away and restoring it later. The recursive interpreter stores intermediate values on the native stack and returns early to implement control flow, so there is no way to suspend a computation and resume it later. To support stack switching and support other use future interpreter use cases such as running the full spec test suite and fuzzing multithreaded programs, introduce a new interpreter that is not recursive and does not store intermediate state that needs to persist beyond the execution of a single instruction on the native stack. The new interpreter works by iterating through instructions and visiting them one at a time in a loop. The visitor pushes and pops values from a stack and signals control flow via its return values. Control flow transfers are handled by the main interpreter loop, so expressions are only visited when they are actually executed. This design will not only support stack switching and other features better than the old interpreter, but it will also significantly decrease the amount of code in the interpreter. In addition to the core interpreter loop, also lay out a skeleton of the execution context for the new interpreter, including a call stack and store. The code contains several TODOs describing how these runtime structures will need to be extended to support interpreting the full spec test suite, including the ability to interpret over multiple linked instances at once. Most of the actual interpretation of expressions is left as future work, but the interpretation of `Const` expressions and i32.add is implemented and tested in a new gtest file to demonstrate it working end-to-end. One of the first milestones for the new interpreter will be getting real spec tests running with it, at which point the gtest file can be removed.

src/interpreter/store.h

kripken · 2025-01-21T19:22:11Z

src/interpreter/store.h

+
+// TODO: generalize this so different users can override memory loads and
+// stores, etc.
+struct WasmStore {


This is "store" in the sense of the spec, I assume?

Yes. Unfortunately just calling it Store caused too many conflicts with our Store expression class.

kripken · 2025-01-21T19:22:25Z

src/interpreter/store.h

+  std::vector<Frame> callStack;
+
+  Frame& getFrame() {
+    assert(callStack.size());


Suggested change

assert(callStack.size());

assert(!callStack.empty());

kripken · 2025-01-21T19:23:29Z

src/interpreter/store.h

+    return callStack.back();
+  }
+
+  Literal pop() { return getFrame().pop(); }


From the name, I expected this to pop the stack frame, but it looks like it gets the stack frame and pops inside that. So it is popping from the value stack and not the call stack?

Right. push and pop are going to be the most common operations that affect the interpreter context, so everything has helpers to make them as easy as possible to use.

sgtm. might be worth a comment to avoid the possible confusion.

kripken · 2025-01-21T19:25:05Z

src/interpreter/interpreter.cpp

+ * limitations under the License.
+ */
+
+#include "interpreter.h"


Suggested change

#include "interpreter.h"

#include "interpreter/interpreter.h"

I believe we generally do includes from src. There is potential ambiguity if we also use local paths in the same dir.

kripken · 2025-01-21T19:27:17Z

src/interpreter/expression-iterator.h

+
+// TODO: This is a quick and dirty hack. We should implement a proper iterator
+// in ir/iteration.h that keeps only a vector of (Expression*, index) pairs to
+// find the current location in the epxression tree.


Suggested change

// find the current location in the epxression tree.

// find the current location in the expression tree.

kripken · 2025-01-21T19:30:30Z

src/interpreter/expression-iterator.h

+
+  // The list of remaining instructions in reverse order so we can pop from the
+  // back to advance the iterator.
+  std::vector<Expression*> exprs;


How will this work with loops etc.?

Generally, I was guessing this would use BinaryenIR itself. We could track the location of the frame pointer using a simple indexing, a vector of indexes basically,

[ index at toplevel, index at the higher level, index at a higher level still, ...]

So [1, 0] would mean we are at the second instruction at the toplevel, and the first of its children, etc. If we keep pointers each of the children then we can quickly traverse between them etc.

Oh, actually we have ChildIterator already. So a stack of ChildIterators could work?

Yep, that's what the TODO above the class is about. If for some reason we wanted to implement loops before implementing that better expression iterator, we would have to copy the vector-based iterator at each loop header and store it in a map on the frame. There are TODOs for those parts in Frame and the ExpressionIterator constructor. It's probably better to implement the better iterators first, though.

All other control flow transfers are forward edges, so they are very simple with the vector-based iterators. You just increment the iterator until you find the block matching the target.

Can we use ChildIterator for this iteration?

Yes, a stack of ChildIterators would work if we don't mind the memory overhead of materializing the children of each expression in the current path. Alternatively we could do something more specialized that only stores an Expression* and an Index for each expression in the path.

ChildIterator contains SmallVector<Expression**, 4> children;. In principle it could be reimplemented to contain only a single Expression* and an index.

But then each step would need to find the children again? That seems likely to be slower.

I might think otherwise if memory caching issues were possible, like it we were storing large amounts of such vectors. But typically they would be quite small?

Though, I am open to being convinced otherwise of course! And if that is faster, we can implement that in ChildIterator as an optimization?

Yeah, it might not matter, and if it does, improving ChildIterator would make sense.

I've updated the TODO to capture all of this.

kripken

Great start!

Building on top of #7227, i32.mul is implemented and tested.

Building on top of #7227, i32.sub is implemented and tested.

Building on top of #7227, i32.mul is implemented and tested.

Building on top of #7227, the following are implemented and tested: - f32.add - f32.sub - f32.mul - f32.div - f32.sqrt - f32.ceil - f32.floor - f32.trunc - f32.nearbyint

Building on top of #7227 , the following are implemented and tested: - f64.add - f64.sub - f64.mul - f64.div - f64.sqrt - f64.ceil - f64.floor - f64.trunc - f64.nearbyint

Building on top of #7227, the following are implemented and tested: - i64.add - i64.sub - i64.mul - i64.eq - i64.ltS - i64.ltU - i64.gtS - i64.gtU While here, I added in relevant cases that leveraged the code added for the above instructions. - i32.eq - f32.eq - f64.eq - i32.ltS - i32.ltU - i32.gtS - i32.ltU

Building on #7227, implemented and tested the following: - i32/i64.and - i32/i64.or - i32/i64.xor - i32/i64.shl - i32/i64.shrU - i32/i64.shrS - i32/i64.rotL - i32/i64.rotR

tlively requested review from kripken and ashleynh January 18, 2025 00:26

tlively mentioned this pull request Jan 18, 2025

Is it possible avoid c++ exeptions? #2917

Open

tlively added 2 commits January 20, 2025 11:05

Merge branch 'main' into new-interpreter

0a41bfe

Merge branch 'main' into new-interpreter

61715d3

kripken reviewed Jan 21, 2025

View reviewed changes

tlively added 2 commits January 21, 2025 20:29

address comments

26affe1

more childiterator comment

dafe3ff

kripken approved these changes Jan 22, 2025

View reviewed changes

tlively merged commit 09300e4 into main Jan 22, 2025
13 checks passed

tlively deleted the new-interpreter branch January 22, 2025 21:21

tlively mentioned this pull request Jan 31, 2025

How to get a call stack when wasm-ctor-eval stops? #7255

Open

This was referenced Feb 1, 2025

[Interpreter] i32.sub #7259

Merged

[Interpreter] i32.mul #7260

Merged

ashleynh added a commit that referenced this pull request Feb 1, 2025

[Interpreter] i32.mul (#7260)

3815594

Building on top of #7227, i32.mul is implemented and tested.

ashleynh added a commit that referenced this pull request Feb 1, 2025

[Interpreter] i32.sub (#7259)

6fe5103

Building on top of #7227, i32.sub is implemented and tested.

ashleynh mentioned this pull request Feb 3, 2025

[Interpreter] i32.mul #7268

Merged

ashleynh added a commit that referenced this pull request Feb 3, 2025

[Interpreter] i32.mul (#7268)

4d41559

Building on top of #7227, i32.mul is implemented and tested.

ashleynh mentioned this pull request Feb 25, 2025

[Interpreter] Float32 #7325

Merged

ashleynh added a commit that referenced this pull request Feb 25, 2025

[Interpreter] Float32 (#7325)

609bcec

Building on top of #7227, the following are implemented and tested: - f32.add - f32.sub - f32.mul - f32.div - f32.sqrt - f32.ceil - f32.floor - f32.trunc - f32.nearbyint

This was referenced Feb 25, 2025

[Interpreter] Float64 #7327

Merged

[Interpreter] I64 #7329

Merged

ashleynh added a commit that referenced this pull request Feb 26, 2025

[Interpreter] Float64 (#7327)

4ea373b

Building on top of #7227 , the following are implemented and tested: - f64.add - f64.sub - f64.mul - f64.div - f64.sqrt - f64.ceil - f64.floor - f64.trunc - f64.nearbyint

ashleynh mentioned this pull request Feb 27, 2025

[Interpreter] i32/i64 Bitwise Operators #7332

Merged

	#include "interpreter.h"
	#include "interpreter/interpreter.h"

	// find the current location in the epxression tree.
	// find the current location in the expression tree.

[interpreter] Begin implementation of a new interpreter #7227

[interpreter] Begin implementation of a new interpreter #7227

Uh oh!

Conversation

tlively commented Jan 18, 2025

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

kripken left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!