Inefficient x64 codegen for conversion instructions

Certain [SIMD conversions](https://github.com/WebAssembly/simd/blob/master/proposals/simd/SIMD.md#conversions) seem to have inefficient lowerings in x64. `f32x4.convert_i32x4_u` is lowered to 8 instruction by [v8](https://github.com/v8/v8/blob/19be4913881bb02c5d9b4f1c7547ee2d1273120b/src/compiler/backend/x64/code-generator-x64.cc#L2448-L2464). The signed version, `f32x4.convert_i32x4_s`, on the other hand, is lowered to a [single instruction](https://github.com/v8/v8/blob/19be4913881bb02c5d9b4f1c7547ee2d1273120b/src/compiler/backend/x64/code-generator-x64.cc#L2444-L2447). 

~~I can't find the v8 implementation for `i32x4.trunc_sat_f32x4_s` and `i32x4.trunc_sat_f32x4_u` but I think the situation is the same: the signed version should have a single instruction lowering to `CVTTPS2DQ` and the unsigned version will require some longer sequence.~~ [edit: this is incorrect, see #173 for a more correct discussion of this inefficiency]

The 64x2 versions of these instructions were dropped in #178. For similar reasons (@ngzhian: "because it is uncommon for such instructions to be used, and hardware support is not widespread"), should we remove the unsigned versions?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Inefficient x64 codegen for conversion instructions #190

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Inefficient x64 codegen for conversion instructions #190

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions