x86-32: floating-point return values undergo implicit format conversion

With SSE2 disabled, the floating-point semantics are pretty hopeless, with implicit conversions happening all over the place (e.g. spilling registers to stack). Most of those issues go away with SSE2 enabled, because we use SSE2 instructions and registers for all float/double operations.

The remaining issue with SSE2 enabled, is that the default C ABI requires that float and double values are returned in x87 registers. Returning a float or double value thus converts to x86_fp80 (and then back, in the caller). This conversion means that a signaling NaN cannot be returned, because the behind-the-scenes conversion to x87_fp80 will raise an FP invalid exception, and quiet the NaN.

LLVM does support other ABIs which don't have this problem: you can either use an alternative calling convention on the function (such as "fastcc"), or by annotating the return type with "inreg" (as seen here):
https://github.com/llvm/llvm-project/blob/575a6483062b8a77b35f48589b2acc1020195ac7/llvm/lib/Target/X86/X86CallingConv.td#L300-L304

While this is a fundamental problem with the x86-32 ABI, I believe we _could_ potentially fix it on the LLVM side, without breaking the ABI, because loading/storing an 80-bit value from x87 FPU register does not trigger a conversion operation. Thus, we could potentially write custom conversion routines to go from 32/64-bit float to 80-bit float (and back), and use that at the call boundary.

Such a routine would have runtime overhead vs using the X87 FPU's native conversion support, and it's also unclear whether anyone cares enough about precise x86-32 FP semantics in order to actually bother implementing it. But, it seemed worth at least recording the issue, and a possible resolution.

	// The X86-32 calling convention returns FP values in FP0, unless marked
	// with "inreg" (used here to distinguish one kind of reg from another,
	// weirdly; this is really the sse-regparm calling convention) in which
	// case they use XMM0, otherwise it is the same as the common X86 calling
	// conv.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

x86-32: floating-point return values undergo implicit format conversion #66803

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

x86-32: floating-point return values undergo implicit format conversion #66803

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions