Fix fptrunc Float64 -> Float16 rounding through Float32 #57809

xal-0 · 2025-03-17T23:27:15Z

Widening from Float32 to Float64 and then rounding to Float16 will not introduce any error, but going from Float64 -> Float32 -> Float16 will round incorrectly if the intermediate Float32 is halfway between two Float16s.

Fixes #57805.

Thanks to @vtjnash for suggesting the fix.

Widening from Float32 to Float64 and then rounding (by any method) to Float16 will not introduce any error, but going from Float64 -> Float32 -> Float16 will round incorrectly if the intermediate Float32 is halfway between two Float16s. Fixes JuliaLang#57805. Co-authored-by: Jameson Nash <jameson@juliacomputing.com>

Keno · 2025-03-18T04:19:07Z

src/runtime_intrinsics.c

-#define fintrinsic_write_float16(p, x)  *(uint16_t *)p = float_to_half(x)
-#define fintrinsic_write_bfloat16(p, x) *(uint16_t *)p = float_to_bfloat(x)
+#define fintrinsic_write_float16(p, x)  *(uint16_t *)p = double_to_half(x)
+#define fintrinsic_write_bfloat16(p, x) *(uint16_t *)p = double_to_bfloat(x)


I don't think the same logic applies to bfloat16. Since bfloat16 is just a truncated float32, shouldn't it always be legal to go through Float32? If not, it'd be good to have a test.

According to the double_to_bfloat code, the same correction to the rounding direction is required whenever the float32 (after truncation from float64) is exactly halfway between 2 bfloat16 values, but wasn't required for the subnormal case (as was demonstrated in the original issue for float16)

@vtjnash

Widening from Float32 to Float64 and then rounding to Float16 will not introduce any error, but going from Float64 -> Float32 -> Float16 will round incorrectly if the intermediate Float32 is halfway between two Float16s. Fixes #57805. Thanks to @vtjnash for suggesting the fix. Co-authored-by: Jameson Nash <jameson@juliacomputing.com> (cherry picked from commit a676b12)

xal-0 and others added 2 commits March 17, 2025 16:24

Merge branch 'master' into fptrunc-float16-bugfix

6c5b321

giordano added bugfix This change fixes an existing bug float16 backport 1.10 Change should be backported to the 1.10 release backport 1.11 Change should be backported to release-1.11 backport 1.12 Change should be backported to release-1.12 labels Mar 18, 2025

oscardssmith approved these changes Mar 18, 2025

View reviewed changes

oscardssmith merged commit a676b12 into JuliaLang:master Mar 18, 2025
9 of 12 checks passed

Keno reviewed Mar 18, 2025

View reviewed changes

KristofferC mentioned this pull request Mar 20, 2025

Backports release 1.12 #57536

Merged

KristofferC removed the backport 1.12 Change should be backported to release-1.12 label Mar 24, 2025

KristofferC mentioned this pull request Mar 31, 2025

Backports release 1.11 #57714

Merged

71 tasks

KristofferC mentioned this pull request Apr 25, 2025

Backports for julia 1.11.6 #58224

Open

71 tasks

KristofferC mentioned this pull request Jun 4, 2025

Backports for 1.10.10 #57715

Merged

75 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Fix fptrunc Float64 -> Float16 rounding through Float32 #57809

Fix fptrunc Float64 -> Float16 rounding through Float32 #57809

Uh oh!

xal-0 commented Mar 17, 2025

Uh oh!

Uh oh!

Keno Mar 18, 2025

Uh oh!

vtjnash Mar 18, 2025

Uh oh!

Uh oh!

Uh oh!

Fix fptrunc Float64 -> Float16 rounding through Float32 #57809

Fix fptrunc Float64 -> Float16 rounding through Float32 #57809

Uh oh!

Conversation

xal-0 commented Mar 17, 2025

Uh oh!

Uh oh!

Keno Mar 18, 2025

Choose a reason for hiding this comment

Uh oh!

vtjnash Mar 18, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!