Change an `assert_malformed` to `assert_invalid` #43

alexcrichton · 2025-04-11T15:00:43Z

Inspired by changes in bytecodealliance/wasm-tools#2134 and intended to reflect how the maximum page size is an artifact of validation, not binary parsing.

alexcrichton · 2025-04-11T15:03:55Z

There's some discussion about this here as well. One consequence of this change is that large values of the page size wouldn't be easily representable in the text format so printing an invalid module could be "weird"

rossberg · 2025-04-11T16:10:35Z

There is precedence for this in terms of load/store alignment, which also is represented in log2 in the binary format, while the text format in fact is specified to only allow for a u32 non-logarithmic value. And yet checking the max value is part of validation. Technically, that breaks the intended inter-convertibility between the two formats, which is annoying.

In principle, we could easily allow the text format to support the whole value range, but that would require every parser to effectively use bigints to parse logarithmic constants.

keithw · 2025-04-11T17:33:41Z

@rossberg Hmm, that hadn't been my understanding. In the spec main branch, a binary align of 32 or greater is assert_malformed in the binary format (tests at https://github.com/WebAssembly/spec/blob/main/test/core/align.wast#L890, and here's the require (I32.lt_u align 32l) in the reference interpreter binary decoder: https://github.com/WebAssembly/spec/blob/main/interpreter/binary/decode.ml#L223).

As far as I knew, any well-formed alignment in the binary format can currently be represented in the text format and vice versa. It would sort of be nice to keep that principle if possible?

fitzgen

LGTM but I haven't followed the malformed vs invalid discussions in detail, so I am hesitant to merge this until more folks sign off on this / there seems like there's consensus that this is what we want.

alexcrichton · 2025-04-15T22:05:14Z

Makes sense, although I'm not keen on championing this so I'll close this out.

rossberg · 2025-04-22T10:52:45Z

@keithw,

With Wasm 3.0 and multi-memory, we have repurposed the alignment immediate in the binary format as a bitfield, with only 6 bits used for the actual alignment. So any log value larger than 2^6 is now unrepresentable and hence malformed (like in the test you link). Any value between that and the natural alignment will be a validation error, however (see tests starting at https://github.com/WebAssembly/spec/blob/main/test/core/align.wast#L305, which are converted to binary by the harness).

The text format isn't quite consistent with that and still allows any u32 alignment value to go through parsing (and then result in a validation error). Perhaps we should change that.

Relevant spec links:

Binary format: https://wasm-dsl.github.io/spectec/core/binary/instructions.html#binary-memarg
Text format: https://wasm-dsl.github.io/spectec/core/text/instructions.html#text-memarg
Validation: https://wasm-dsl.github.io/spectec/core/valid/instructions.html#t-mathsf-xref-syntax-instructions-syntax-instr-memory-mathsf-load-n-mathsf-xref-syntax-instructions-syntax-sx-mathit-sx-x-xref-syntax-instructions-syntax-memarg-mathit-memarg

rossberg · 2025-04-22T11:08:55Z

Small correction: the tests you have linked do not actually overflow the binary rep. But they are in fact outdated. With 3.0/multi-memory, they have been changed to invalid:

https://github.com/WebAssembly/spec/blob/wasm-3.0/test/core/align.wast#L890

Change an assert_malformed to assert_invalid

4e41796

Inspired by changes in bytecodealliance/wasm-tools#2134 and intended to reflect how the maximum page size is an artifact of validation, not binary parsing.

alexcrichton mentioned this pull request Apr 11, 2025

wasmparser: detect "malformed" cases in parser alone (without validator) bytecodealliance/wasm-tools#2134

Merged

fitzgen approved these changes Apr 15, 2025

View reviewed changes

alexcrichton closed this Apr 15, 2025

alexcrichton deleted the patch-1 branch April 15, 2025 22:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Change an `assert_malformed` to `assert_invalid` #43

Change an `assert_malformed` to `assert_invalid` #43

Uh oh!

alexcrichton commented Apr 11, 2025

Uh oh!

alexcrichton commented Apr 11, 2025

Uh oh!

rossberg commented Apr 11, 2025

Uh oh!

keithw commented Apr 11, 2025

Uh oh!

fitzgen left a comment

Uh oh!

alexcrichton commented Apr 15, 2025

Uh oh!

rossberg commented Apr 22, 2025 •

edited

Loading

Uh oh!

rossberg commented Apr 22, 2025

Uh oh!

Uh oh!

Change an assert_malformed to assert_invalid #43

Change an assert_malformed to assert_invalid #43

Uh oh!

Conversation

alexcrichton commented Apr 11, 2025

Uh oh!

alexcrichton commented Apr 11, 2025

Uh oh!

rossberg commented Apr 11, 2025

Uh oh!

keithw commented Apr 11, 2025

Uh oh!

fitzgen left a comment

Choose a reason for hiding this comment

Uh oh!

alexcrichton commented Apr 15, 2025

Uh oh!

rossberg commented Apr 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rossberg commented Apr 22, 2025

Uh oh!

Uh oh!

Change an `assert_malformed` to `assert_invalid` #43

Change an `assert_malformed` to `assert_invalid` #43

rossberg commented Apr 22, 2025 •

edited

Loading