Handle exactness in MinimizeRecGroups #7555

tlively · 2025-04-26T01:30:04Z

Treat rec groups differing only in the exactness of a reference as
different, but only when custom descriptors is enabled. When custom
descriptors is not enabled, exactness will be erased before the binary
is written, so if two minimized rec groups differed only in exactness,
they would in fact be written as the same rec group. This would change
the behavior of casts meant to differentiate between types in that rec
group, so it would be incorrect.

Use the standard utility rather than reimplementing type intersection. The new code is simpler, shorter, and properly supports exactness, avoiding an assertion failure in the added test case. The other functional change is that when one of the intersected heap types is bottom and the type GLB is a non-nullable reference to bottom, the result of the intersection is `None` where it was previously a `Cone`.

Previously doing e.g. `type.with(HeapType::none)` would cause an assertion failure if `type` was exact because `.with()` would only replace the heap type and exact references to basic heap types are disallowed. Rather than checking for and avoiding this error in all the callers, simply drop exactness when `.with()` is called with a basic heap type. This is reasonable behavior because the only alternative is never correct. Add a test that hits an assertion failure without this fix. AbstractTypeRefining replaces a defined type with `none` and the type updating utility does not check whether the new heap type is basic before doing the replacement.

When custom descriptors are disabled, validate that public types do not contain exact references. If they did, we would drop the exactness and change the identity of the public type during binary writing, which would be incorrect. This still allows internal usage of exact types without custom descriptors enabled, and it is up to the individual passes to ensure that the eventual erasing of exactness does not cause any problems.

Treat rec groups differing only in the exactness of a reference as different, but only when custom descriptors is enabled. When custom descriptors is not enabled, exactness will be erased before the binary is written, so if two minimized rec groups differed only in exactness, they would in fact be written as the same rec group. This would change the behavior of casts meant to differentiate between types in that rec group, so it would be incorrect.

kripken · 2025-04-28T16:11:19Z

Were we not thinking that we could first measure the benefit of lifting/optimizing/lowering exact types when exact types are not enabled, and only after that, consider the work to make all the transforms etc. be modulo exact types? I recall that was the direction we discussed but maybe we didn't fully agree? Or is this necessary for the measurement somehow? (but it seems like we could measure first, ignoring possible bugs like this, as an upper bound?)

tlively · 2025-04-28T17:19:09Z

Yes, I think that investigation makes sense, but this pass is special because its optimization could easily be partially or fully undone by any following type rewriting. Any type rewriting pass that tries to avoid undoing rec group minimization will have to share quite a bit of the conflict resolution logic with rec group minimization anyway, so there will be no way around updating the wasm-type-shape utilities to take features into account either way.

tlively · 2025-04-28T17:21:37Z

I'm generally in favor of ignoring bugs like this for now (and in particular I'm in favor of ignoring the class of bugs where a sequence of type optimizations followed by lowering causes conflicts), but the fuzzer found this bug fairly easily, so I think it's worth fixing.

kripken · 2025-04-28T20:20:45Z

Any type rewriting pass that tries to avoid undoing rec group minimization will have to share quite a bit of the conflict resolution logic with rec group minimization anyway

Wait, do we not expect rec group minimization to happen at the very end anyhow? (in which case no type rewriting happens later) What would be the benefit of minimizing rec groups in the middle?

tlively · 2025-04-28T22:12:48Z

Oh, you're thinking that first we would run the lowering pass to remove exactness and then afterward we would minimize rec groups? That means that the separate lowering pass would have to be run explicitly, which would require users to know to run it. It would also complicate validation, since whether allocations are expected to be exact or not would depend on whether lowering has happened or not.

kripken · 2025-04-28T22:25:11Z

It does seem simpler to lower first and then minify.

One option would be to require the lowering to be done explicitly, but if we minify in -O3 then we can just put the lowering earlier?

Alternatively, if users are expected to explicitly minify, then having them explicitly lower seems reasonable as well. (We could guide them to it by the minify pass erroring clearly.)

If we aren't certain how this will end up (and we do still need to measure the benefit of non-CD exactness), then doing it explicitly for now might make sense?

tlively · 2025-04-30T00:05:48Z

@kripken, is this ready for a second look, given our offline discussion?

kripken

For the record here, offline @tlively convinced me that it makes more sense to optimize modulo exactness (ignore exactness when the feature is not enabled), then strip away exactness in binary writing, than the option of a proper lowering pass. While we do need to consider exactness in the optimizer, otoh this approach lets the IR look the same whether or not the feature for exactness is present (avoiding a situation where say struct.new would be typed differently in the IR, based on features, which could be noticed in many places).

So this approach keeps the IR simpler even if it does add another invariant to keep in mind. Overall it is simpler I think.

@tlively please document this new invariant on the IR in the readme section on the IR, if we haven't already.

tlively added 4 commits April 24, 2025 18:52

tlively requested a review from kripken April 26, 2025 01:30

tlively added 9 commits April 29, 2025 12:05

Merge branch 'main' into with-basic-inexact

fb34a94

Merge branch 'with-basic-inexact' into validate-public-exact

3f54bcc

Merge branch 'validate-public-exact' into minimize-groups-exact

0e7f196

Merge branch 'main' into validate-public-exact

140d72d

Merge branch 'validate-public-exact' into minimize-groups-exact

d7c2407

Merge branch 'main' into validate-public-exact

d9e353c

Merge branch 'validate-public-exact' into minimize-groups-exact

2bcf829

vertical space

64fc4d1

Merge branch 'validate-public-exact' into minimize-groups-exact

b25e9e4

kripken approved these changes Apr 30, 2025

View reviewed changes

Base automatically changed from validate-public-exact to main April 30, 2025 04:20

Merge branch 'main' into minimize-groups-exact

1d27cae

tlively enabled auto-merge (squash) April 30, 2025 04:36

tlively merged commit 762dd9c into main Apr 30, 2025
14 checks passed

tlively deleted the minimize-groups-exact branch April 30, 2025 05:20

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Handle exactness in MinimizeRecGroups #7555

Handle exactness in MinimizeRecGroups #7555

tlively commented Apr 26, 2025

kripken commented Apr 28, 2025

tlively commented Apr 28, 2025

tlively commented Apr 28, 2025

kripken commented Apr 28, 2025

tlively commented Apr 28, 2025

kripken commented Apr 28, 2025

tlively commented Apr 30, 2025

kripken left a comment

Handle exactness in MinimizeRecGroups #7555

Handle exactness in MinimizeRecGroups #7555

Conversation

tlively commented Apr 26, 2025

kripken commented Apr 28, 2025

tlively commented Apr 28, 2025

tlively commented Apr 28, 2025

kripken commented Apr 28, 2025

tlively commented Apr 28, 2025

kripken commented Apr 28, 2025

tlively commented Apr 30, 2025

kripken left a comment

Choose a reason for hiding this comment