Inefficient x64 codegen for fmin/fmax

In attempting to implement [`fmin` and `fmax`](https://github.com/WebAssembly/simd/blob/master/proposals/simd/SIMD.md#floating-point-min-and-max), I observed that the semantics of these instructions prevents a single instruction lowering on x64. V8 has a 9-instruction lowering for [F32x4Min](https://github.com/v8/v8/blob/19be4913881bb02c5d9b4f1c7547ee2d1273120b/src/compiler/backend/x64/code-generator-x64.cc#L2531-L2548), for example, and the other min/max implementations for F32x4/F64x2 are not better.

Also, I noticed that the V8 implementation [quiets and clears the NaN payload](https://github.com/v8/v8/blob/19be4913881bb02c5d9b4f1c7547ee2d1273120b/src/compiler/backend/x64/code-generator-x64.cc#L2542); this behavior does not seem to be specified in the spec but I suspect that it is necessary for passing the spec tests. Is this correct?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Inefficient x64 codegen for fmin/fmax #186

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Inefficient x64 codegen for fmin/fmax #186

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions