Add --fast-math mode #3155

kripken · 2020-09-21T16:52:14Z

Similar to clang and gcc, --fast-math makes us ignore corner cases of floating-point
math like NaN changes and (not done yet) lack of associativity and so forth.

This undoes some changes (#2958 and #3096) where we assumed it was
ok to not change NaN bits, but @binji corrected us. We can only do such things in fast
math mode. This puts those optimizations behind that flag, adds tests for it, and
restores the interpreter to the simpler code from before with no special cases.

MaxGraey · 2020-09-21T17:13:21Z

Could --fast-math be a enum mode? For example:
--fast-math 0 - most strict mode, care about NaN bits.
--fast-math 1 - don't care about NaN bits. Make possible x - 0 -> x, x * 1 -> x, -1 * x -> -x
--fast-math 2 - don't care about NaN bits and -0 / +0 consistency. Make possible reduction x + 0 -> x
--fast-math 3 - don't care about NaN at all (for example: x == x -> 1, x - x -> 0.0). Also don't care about precision lost. Make possible ~~x + C - C -> x,~~ x / C -> x * (1 / C) even if C is not power of two. And etc.

kripken · 2020-09-21T19:07:49Z

@MaxGraey yes, it may make sense to have something more fine-grained eventually. @sunfishcode mentioned that this is what clang and gcc have, with separate flags for NaNs and such.

I think that's a reasonable thing to aim for. For now, I think this PR is a good first step. It fixes fuzzer failures, gets us back to emitting correct code, and avoids just ripping out the relevant optimizations, instead putting them behind a flag.

binji

nice, lgtm (without knowing much about binaryen stuff)

tlively · 2020-09-22T05:22:00Z

src/passes/OptimizeInstructions.cpp

@@ -1369,7 +1372,8 @@ struct OptimizeInstructions
    }
    {
      double value;
-      if (matches(curr, binary(Abstract::Sub, any(), fval(&value))) &&
+      if (fastMath &&
+          matches(curr, binary(Abstract::Sub, any(), fval(&value))) &&
          value == 0.0) {
        // x - (-0.0)   ==>   x + 0.0


What's wrong about this optimization without fastMath? Is it that it doesn't change non-canonical NaN bits but it should?

Hmm, actually, re-reading it now I'm not sure. It doesn't remove a math operation (unlike the others), so it would still change NaNs as expected, I think? I guess we'd need to read the spec (wasm? IEEE?) carefully. If no one knows offhand, the safe thing may be to land this with a TODO for later.

Imteresting article:
https://754r.ucbtest.org/background/nan-propagation.pdf
See "What should happen when two payloads are combined?"

And yes seems to be IEEE754 doesn't specify how two NaNs with different payloads should be combined. I guess it should be reflect on wasm spec tests

Btw webassembly spec specify Na N-propogation rules for fneg, fabs and fcopysign:
https://webassembly.github.io/spec/core/bikeshed/index.html#nan-propagation%E2%91%A0

src/passes/OptimizeInstructions.cpp

sunfishcode · 2020-09-22T16:14:31Z

src/pass.h

+  // and assuming math follows the algebraic rules for associativity and so
+  // forth (which IEEE floats do not, strictly speaking). This is inspired by
+  // gcc/clang's -ffast-math flag.
+  bool fastMath = false;


"Fast math" has many meanings, including ignoring negative zero, ignoring infinities, ignoring NaNs, ignoring signaling NaN, allowing for greater precision, and allowing for reduced precision. GCC and clang have moved to have several different flags for these things, as no single definition of "fast math" works for everyone. I encourage Binaryen to follow GCC and clang here.

Agreed, @sunfishcode , that is the plan. This is just the first step.

What is the status of this plan?

@sunfishcode So far the use cases for the fast math flag have only been ignoring NaNs, AFAIK. So we've not added more specific flags. I imagine we will when we start to optimize them.

Do you have more use cases or ideas perhaps?

With more levels we could do more floating points optimizations:
#3155 (comment)

A separate ignoreNaNs flag, which could also be enabled by fastMath, would allow users that want to ignore NaNs do so without having to know this implementation detail about Binaryen, and without opting into unknown optimizations in future versions of Binaryen.

Makes sense.

Is this something you'd use soon @sunfishcode ? If so I can open a PR later today probably. (Or maybe @MaxGraey you'd want to?) If it's not urgent we could open an issue so we don't forget.

@kripken If this PR only add separate ignoreNaNs flags then I think it would be better that you open PR. I might do a more substantial PR adding a few levels for fastMath a bit later.

Ok, I found some time, PR up: #4262

Co-authored-by: Thomas Lively <7121787+tlively@users.noreply.github.com>

kripken · 2020-09-30T19:04:47Z

Added the review suggestion change, and the fuzzer meanwhile found another case I missed, c4cfcba

kripken added 13 commits September 18, 2020 14:59

fix

c7dd870

fix comment

71e9ecb

fix comment

95a27c0

fix

87aec7d

fix

5f85c59

fix spec test

b6a9f3c

update

9ec23ee

Merge remote-tracking branch 'origin/master' into addsub

32646b7

rework

80270a1

format

26c1e06

fix

e7f5a9a

format

2936b8a

more

301c14e

kripken requested review from binji, tlively and aheejin September 21, 2020 16:52

changelog

e661403

kripken mentioned this pull request Sep 21, 2020

Don't change NaN bits on trivial add/sub in the interpreter #3143

Closed

binji approved these changes Sep 21, 2020

View reviewed changes

tlively approved these changes Sep 22, 2020

View reviewed changes

sunfishcode reviewed Sep 22, 2020

View reviewed changes

kripken and others added 4 commits September 30, 2020 11:52

Update src/passes/OptimizeInstructions.cpp

10a814a

Co-authored-by: Thomas Lively <7121787+tlively@users.noreply.github.com>

Merge remote-tracking branch 'origin/master' into ff

4c10bc5

another case

c4cfcba

test

dfef85a

kripken merged commit 0704710 into master Sep 30, 2020

kripken deleted the ff branch September 30, 2020 19:39

This was referenced Oct 1, 2020

Add javascript api for fast-math option #3188

Merged

fast-math: Fold float multiplication by minus one to unary negative #3189

Merged

arunetm mentioned this pull request Jan 12, 2021

Proposal: Fp IEEE compliance level flags for Wasm (FP-Fast-Math for Wasm Scalar & SIMD) WebAssembly/design#1393

Closed

Add --fast-math mode #3155

Add --fast-math mode #3155

Uh oh!

Conversation

kripken commented Sep 21, 2020

Uh oh!

MaxGraey commented Sep 21, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

kripken commented Sep 21, 2020

Uh oh!

binji left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

MaxGraey Oct 19, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

kripken commented Sep 30, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

MaxGraey commented Sep 21, 2020 •

edited

Loading

MaxGraey Oct 19, 2021 •

edited

Loading

kripken commented Sep 30, 2020 •

edited

Loading