Integrate cpp_double_fp_backend #648

ckormanyos · 2025-01-06T17:35:48Z

No description provided.

Fixes #92 with good final report

minor change in header order

fix silly mistakes

Gsoc2021 double float chris

…into gsoc2021_double_float_chris # Conflicts: # .github/workflows/multiprecision_quad_double_only.yml # .gitignore # performance/performance_test.cpp # test/test_arithmetic.hpp

Gsoc2021 double float chris

…into cpp_double_fp_backend

cosurgi · 2025-01-17T22:58:54Z

I posted some latest YADE benchmark results in BoostGSoC21#190 , suddenly it starts to look good with clang.
(initially I posted this here, but then I moved this post over there)

ckormanyos · 2025-01-18T14:02:33Z

Note to self: TODO Hit the edge-cases of the new eval_pow method.

ckormanyos · 2025-01-18T18:03:22Z

Performance of algebraic functions re-affirmed in BoostGSoC21#190

cosurgi · 2025-01-18T21:19:54Z

OK, so the bad performance mystery was solved and I did benchmarks of YADE software yade -n --quickperformance -j 4 on a quite recent CPU Intel i7-14700KF and the results are good. Some are interesting. We can definitely mark the performance problem of the cpp_double_fp_backend as solved. Now only the compiler developers will have something to talk about :)

Here are the results:

`cpp_double_double`

type	calculation speed	factor
`cpp_double_double` g++ 12.2	449.15 iter/sec	1
`float128` g++ 12.2	263.15 iter/sec	1.70
`cpp_bin_float<32>` g++ 12.2	211.81 iter/sec	2.12
`cpp_dec_float<31>` g++ 12.2	78.15 iter/sec	5.74
`mpfr_float_backend<31>` g++ 12.2	51.01 iter/sec	8.80

Here we can see that cpp_double_double beats everyone else by over a factor of two.

`cpp_double_long_double`

type	calculation speed	factor
`cpp_bin_float<39>` g++ 12.2	122.55 iter/sec	1
`cpp_double_long_double` clang++ 19.1.4	108.79 iter/sec	1.12
`cpp_bin_float<39>` clang++ 19.1.4	102.19 iter/sec	1.20
`cpp_dec_float<39>` g++ 12.2	71.42 iter/sec	1.71
`mpfr_float_backend<39>` g++ 12.2	45.75 iter/sec	2.67
`cpp_double_long_double` g++ 12.2	14.97 iter/sec	8.18

Here we can see that cpp_double_long_double performs very good. But the compiler developers will have a mystery to solve: cpp_bin_float<39> g++ 12.2 is faster than cpp_double_long_double clang++ 19.1.4 by just a little, which in turn is faster than cpp_double_long_double g++ 12.2 by a factor of 8.

`cpp_double_float128`

type	calculation speed	factor
`cpp_bin_float<67>` g++ 12.2	118.43 iter/sec	1
`mpfr_float_backend<67>` g++ 12.2	43.34 iter/sec	2.73
`cpp_dec_float<67>` g++ 12.2	40.09 iter/sec	2.95
`cpp_double_float128` g++ 12.2	14.99 iter/sec	7.90

Here we can see that cpp_double_float128 has a lot of potential to beat cpp_bin_float<67> once the g++ developers sort out the problems with cpp_double_long_double g++ 12.2. The increase in performance should be about by a factor of 8 :)

So all is good. I think we can merge this branch once documentation and other small TODOs are complete.

ckormanyos · 2025-01-19T08:10:15Z

We can definitely mark the performance problem of the cpp_double_fp_backend as solved.

Thank you Janek (@cosurgi) that was a big effort, and it really provided a lot of information and clarity.

Some of the results on cpp_double_long_double, where long double is 80-bit, 10-byte in width are interesting. That hardware version of the 10-byte floating-point representation is running on the legendary (modernized) versions of the i387 FPU, the hardware that really put 10-byte floating-point on the map.

The newer i7 processors have extremely powerful 64-bit floating-point hardware operations, and it seems like these are being very well supported nowdays in hardware and software.

Down the road I will be doing some non-x86_64 measurements on M1 and/or M2 and a few embedded bare-metal controllers like an ARM(R) Cortex(R) M7, having double-precision floating-point FPU support.

All-in-all I'm somewhat surprised at how fast cpp_double_double ended up in certain harware/software configurations. As mentioned in previous posts, this backend (and of course that type specifically) have lots of room for optimization improvement.

I'm happy enough with it to make a first release out of this state.

Cc: @sinandredemption and @jzmaddock

jzmaddock · 2025-01-19T09:08:53Z

There might be one more thing to check: that each of the backend/compiler configurations are doing (roughly) the same amount of work. Something that can happen when there is a tolerance set for termination is you can hit "unfortunate" parameters which cause the code to thrash through many needless iterations which don't actually get you any closer to the end result. I have no idea if this is the case here, but because they don't behave quite like exactly rounded IEEE types, things like double double can easily break assumptions present in the code.

ckormanyos · 2025-01-19T17:20:47Z

There might be one more thing to check: that each of the backend/compiler configurations are doing (roughly) the same amount of work. Something that can happen when there is a tolerance set for termination is you can hit "unfortunate" parameters which cause the code to thrash through many needless iterations which don't actually get you any closer to the end result.

Indeed. There are several potential dangers.

Let's say we use cpp_double_double and a particular tolreance $dx$ is set to

$$ |dx| < 1^{-300} $$

At the same time, we know that min_exponent10 for the type is something like $-291$. So the tolerance is never reached or reached after useless iterations.

Even worse, this backend is new, so there might be undiscovered problems in the areas of subnormal/zero. So you might iterate until the maximum iteration setting.

We actually had several cases like this when John helped me see through the last tricky spots in the specfun tests. Who knows if we really got all the edge cases?

Cc: @jzmaddock and @cosurgi

…into cpp_double_fp_backend # Conflicts: # .github/workflows/multiprecision.yml # README.md

ckormanyos · 2025-04-19T13:44:28Z

So I have updated this PR to the post-1.88 develop branch of Multiprecision. And it's going green once again (with the updated CI). There were a few trip-ups along the way, but nothing out of the ordinary.

It's time to continue working the known final points in BoostGSoC21/multiprecision/issues/160.

I'm not sure if all this will be ready for 1.89, but there is a chance.

Cc: @cosurgi and @jzmaddock and @sinandredemption

…into cpp_double_fp_backend

LegalizeAdulthood

Just some mild observations while we're waiting for this to land.

.gitignore

doc/reference_cpp_double_fp_backend.qbk

doc/tutorial.qbk

doc/tutorial_cpp_double_fp_backend.qbk

example/cpp_double_fp_gamma_bessel.cpp

include/boost/multiprecision/detail/functions/pow.hpp

performance/Jamfile.v2

include/boost/multiprecision/float128.hpp

performance/performance_test.cpp

ckormanyos · 2025-06-18T20:48:15Z

Hi @LegalizeAdulthood thank you for your review points. I will get to these. Your points seems sensible.

It's been a while in the making, but I use this backend already locally and it seriously accelerates perturbative Mandelbrot calculations like by a factor of 3. No other backend that I am aware of kicks it like this one --- at double-double for about 32 digits.

I do not know if/when I'll get this Boost-ready, but I'm still on it.

Cc: @jzmaddock

ckormanyos · 2025-06-18T20:53:40Z

Hi @LegalizeAdulthood I'll leave all the conversations open for now. I need to get back to these a bit later. Thanks again for contributing.

LegalizeAdulthood · 2025-06-18T22:12:22Z

I use this backend already locally and it seriously accelerates perturbative Mandelbrot calculations like by a factor of 3. No other backend that I am aware of kicks it like this one

Nice! My friend integrated the QD library into his ManPWin and I think he reported a significant speedup as well.

I think currently, at least for open source fractal renderers, kalles fraktaler 3 is one of the fastest out there, if not the fastest. He has SIMD and GPU (OpenCL) paths, I haven't studied the code extensively enough to know the full details though.

If I can be of assistance in helping this pull request be accepted, let me know. It will ultimately help me too :)

ckormanyos · 2025-06-19T10:16:09Z

If I can be of assistance in helping this pull request be accepted, let me know.

Hi Richard (@LegalizeAdulthood) if you get a chance, you could consider using the cpp_double_fp_backend backend. Boost.Multiprecision is header-only. So if you checkout the cpp_double_fp_backend_integration branch, you can immediately use classes such as boost::multiprecision::cpp_double_double.

Other than that, I think there are still some edge cases in rounding and round-tripping. Those are the only open points left. I deactivated many of the tests in rounding and round-tripping and I think these should pass. I'll need to talk with John about these sometime down the road.

ckormanyos · 2025-06-19T11:31:16Z

Just some mild observations while we're waiting for this to land.

Handled in 748b751

ckormanyos · 2025-06-19T20:43:43Z

Redundant with #515

sinandredemption and others added 30 commits August 23, 2021 00:02

correct spelling mistakes

d0be8ff

Merge branch 'develop' into gsoc2021_quad_float

9a3dc23

Merge pull request #102 from BoostGSoC21/gsoc2021_quad_float

46d276f

Fixes #92 with good final report

minor change in header order

87a3fd2

Merge pull request #110 from BoostGSoC21/report-patch-1

f44724c

minor change in header order

fix silly mistakes

5f6e233

Merge pull request #111 from BoostGSoC21/report-patch-2

9e2d4be

fix silly mistakes

Merge branch 'develop' into gsoc2021_double_float_chris

8126d3b

Restore/adapt all tests from test_exp.cpp

18f3c8d

Try get more arithmetic tests working

cc0fb2d

Merge pull request #112 from BoostGSoC21/gsoc2021_double_float_chris

d3e52c1

Gsoc2021 double float chris

Merge branch 'develop' of https://github.com/boostorg/multiprecision …

ccf6734

…into gsoc2021_double_float_chris # Conflicts: # .github/workflows/multiprecision_quad_double_only.yml # .gitignore # performance/performance_test.cpp # test/test_arithmetic.hpp

Retry some failed CI runs

efd25b5

Simply disable selected failed arithmetic tests

ad7ef1c

Merge pull request #113 from BoostGSoC21/gsoc2021_double_float_chris

d9cf324

Gsoc2021 double float chris

Merge branch 'develop' into gsoc2021_double_float

169b1c6

Simplify syntax and adapt one arithmetic test

7effe8d

DF refactor name handle warn and check standalone

73ceded

Merge branch 'develop' into gsoc2021_double_float_chris

7fb8556

Merge pull request #114 from BoostGSoC21/gsoc2021_double_float_chris

7eb5279

Gsoc2021 double float chris

Basic constexpr double-float with lots TODO

cc4c9fe

C++11 constexpr correct but get rid of 11 anyway

0acbd45

Restore mysterious extra_normalize and its TODOs

ee46432

Correct typo(s) in eval_is_zero()

4f365c9

Small repairs but lot of eval_ops/edge-work TODO

e7b5078

Try improve DF edge cases but still lots TODO

6b463c3

Concentrate on double-float tests only

4aeb888

For DF repair sqrt(0) and adapt pow() tests

6b2f91b

Merge branch 'develop' into gsoc2021_double_float_chris

6a26ebb

Merge pull request #115 from BoostGSoC21/gsoc2021_double_float_chris

0989d30

Gsoc2021 double float chris

ckormanyos added 2 commits January 17, 2025 19:00

Merge branch 'develop' into cpp_double_fp_backend

5c72f8a

Merge branch 'develop' of https://github.com/boostorg/multiprecision …

375b2d3

…into cpp_double_fp_backend

Add power-to-n edge cases for cover

594156f

Try to hit a few more cover lines

41ddd91

Add more pow_n edge cases

ff854af

ckormanyos added 4 commits January 19, 2025 18:26

Remove a bit more unused backend code

ce2ab08

Merge branch 'develop' of https://github.com/boostorg/multiprecision …

7e26452

…into cpp_double_fp_backend # Conflicts: # .github/workflows/multiprecision.yml # README.md

Sync with upstream PR beanch

e46baf4

Repair limits test lowest on cpp-double-pp

28b2658

Merge branch 'develop' of https://github.com/boostorg/multiprecision …

49c9d7c

…into cpp_double_fp_backend

LegalizeAdulthood reviewed Jun 17, 2025

View reviewed changes

Merge branch 'develop' into cpp_double_fp_backend

a3d7e63

ckormanyos closed this Jun 19, 2025

ckormanyos mentioned this pull request Jun 23, 2025

Commissioning and Quality cpp_double_fp_backend BoostGSoC21/multiprecision#160

Closed

18 tasks

LegalizeAdulthood mentioned this pull request Jun 24, 2025

Manp win dd qd PaulTheLionHeart/manpwin#1

Merged

ckormanyos deleted the cpp_double_fp_backend branch June 26, 2025 04:25

Integrate cpp_double_fp_backend #648

Integrate cpp_double_fp_backend #648

Uh oh!

Conversation

ckormanyos commented Jan 6, 2025

Uh oh!

cosurgi commented Jan 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ckormanyos commented Jan 18, 2025

Uh oh!

ckormanyos commented Jan 18, 2025

Uh oh!

cosurgi commented Jan 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

cpp_double_double

cpp_double_long_double

cpp_double_float128

Uh oh!

ckormanyos commented Jan 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jzmaddock commented Jan 19, 2025

Uh oh!

ckormanyos commented Jan 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ckormanyos commented Apr 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

LegalizeAdulthood left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ckormanyos commented Jun 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ckormanyos commented Jun 18, 2025

Uh oh!

LegalizeAdulthood commented Jun 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ckormanyos commented Jun 19, 2025

Uh oh!

ckormanyos commented Jun 19, 2025

Uh oh!

ckormanyos commented Jun 19, 2025

Uh oh!

Uh oh!

cosurgi commented Jan 17, 2025 •

edited

Loading

cosurgi commented Jan 18, 2025 •

edited

Loading

`cpp_double_double`

`cpp_double_long_double`

`cpp_double_float128`

ckormanyos commented Jan 19, 2025 •

edited

Loading

ckormanyos commented Jan 19, 2025 •

edited

Loading

ckormanyos commented Apr 19, 2025 •

edited

Loading

LegalizeAdulthood left a comment •

edited

Loading

ckormanyos commented Jun 18, 2025 •

edited

Loading

LegalizeAdulthood commented Jun 18, 2025 •

edited

Loading