[libclc] Enable -ffp-contract=fast-honor-pragmas except for exp/trig/hyperbolic funcs #153137

wenju-he · 2025-08-12T04:51:51Z

Enable -ffp-contract=fast-honor-pragmas globally improves performance.
Disable in functions that may have problem with the flag.

…unctions According to OpenCL spec, native_* functions have implementation-defined accuracy and typically have better performance. We can enable floating- point contraction optimizations for them.

arsenm · 2025-08-12T04:57:10Z

I think fp contract should be globally enabled in the build, and selectively disabled in the handful of places that it is problematic (namely specific blocks in expF, sinbF, and trig reductions)

arsenm · 2025-08-12T04:57:51Z

libclc/CMakeLists.txt

@@ -304,7 +304,7 @@ set_source_files_properties(
  ${CMAKE_CURRENT_SOURCE_DIR}/opencl/lib/generic/math/native_sin.cl
  ${CMAKE_CURRENT_SOURCE_DIR}/opencl/lib/generic/math/native_sqrt.cl
  ${CMAKE_CURRENT_SOURCE_DIR}/opencl/lib/generic/math/native_tan.cl
-  PROPERTIES COMPILE_OPTIONS -fapprox-func
+  PROPERTIES COMPILE_OPTIONS "-fapprox-func;-ffp-contract=fast"


Also maybe should use -ffp-contract=fast-honor-pragmas, not sure if the stupid interpretation ever got fixed for fast

…o exponential/trigonometric/hyperbolic funcs

arsenm · 2025-08-12T07:08:42Z

libclc/clc/lib/generic/math/clc_acos.cl

@@ -6,6 +6,8 @@
 //
 //===----------------------------------------------------------------------===//

+#pragma clang fp contract(off)


This can be much more targeted. The problematic areas can be specific block scopes inside of individual functions. I'd suggest running the conformance test with it enabled globally, and then finding the specific places that require this

e.g. in exp f32

- float e = BUILTIN_RINT_F32(ph); - float a = ph - e + pl; + float a, e; + { + #pragma OPENCL FP_CONTRACT OFF + e = BUILTIN_RINT_F32(ph); + a = ph - e + pl; + } +

This can be much more targeted. The problematic areas can be specific block scopes inside of individual functions. I'd suggest running the conformance test with it enabled globally, and then finding the specific places that require this

thanks, I'll run opencl cts on intel gpu to find the places.

[libclc] Enable -ffp-contract=fast compile option for math native_* f…

719a691

…unctions According to OpenCL spec, native_* functions have implementation-defined accuracy and typically have better performance. We can enable floating- point contraction optimizations for them.

wenju-he requested a review from frasercrmck August 12, 2025 04:51

llvmbot added the libclc libclc OpenCL library label Aug 12, 2025

wenju-he requested a review from arsenm August 12, 2025 04:52

arsenm added the floating-point Floating-point math label Aug 12, 2025

arsenm reviewed Aug 12, 2025

View reviewed changes

enable -ffp-contract=fast-honor-pragmas globally, add contract(off) t…

5bff222

…o exponential/trigonometric/hyperbolic funcs

wenju-he changed the title ~~[libclc] Enable -ffp-contract=fast compile option for math native_* functions~~ [libclc] Enable -ffp-contract=fast-honor-pragmas except for exp/trig/hyperbolic funcs Aug 12, 2025

arsenm reviewed Aug 12, 2025

View reviewed changes

wenju-he marked this pull request as draft August 12, 2025 07:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[libclc] Enable -ffp-contract=fast-honor-pragmas except for exp/trig/hyperbolic funcs #153137

[libclc] Enable -ffp-contract=fast-honor-pragmas except for exp/trig/hyperbolic funcs #153137

Uh oh!

wenju-he commented Aug 12, 2025 •

edited

Loading

Uh oh!

arsenm commented Aug 12, 2025

Uh oh!

arsenm Aug 12, 2025

Uh oh!

arsenm Aug 12, 2025

Uh oh!

arsenm Aug 12, 2025

Uh oh!

wenju-he Aug 12, 2025

Uh oh!

Uh oh!

[libclc] Enable -ffp-contract=fast-honor-pragmas except for exp/trig/hyperbolic funcs #153137

Are you sure you want to change the base?

[libclc] Enable -ffp-contract=fast-honor-pragmas except for exp/trig/hyperbolic funcs #153137

Uh oh!

Conversation

wenju-he commented Aug 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

arsenm commented Aug 12, 2025

Uh oh!

arsenm Aug 12, 2025

Choose a reason for hiding this comment

Uh oh!

arsenm Aug 12, 2025

Choose a reason for hiding this comment

Uh oh!

arsenm Aug 12, 2025

Choose a reason for hiding this comment

Uh oh!

wenju-he Aug 12, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

wenju-he commented Aug 12, 2025 •

edited

Loading