[APFloat] Fix literals with long significands. #102051

keinflue · 2024-08-05T20:04:40Z

This is my first contribution attempt to LLVM. Please let me know of any issues.

The function powerOf5 is called (only) by convertFromDecimalString to calculate 5^m where m is sum of the length of the significand and the magnitude of the (normalized) exponent.

powerOf5 previously used fixed-sized arrays to store its (intermediate) results. Their size was chosen large enough to hold the sum of a maximum exponent and a maximum precision. However, this didn't account for literals with long overspecifying significands.

This PR replaces the arrays with SmallVectors to also support these cases.

The new definitive limit for the length of the sum of significand and exponent is UINT_MAX/815 (~5M) due to an overflow in the length calculation for the SmallVectors.

However, processing time will likely be the practical limitation.

Open questions:

SmallVector's inline size is chosen as 0, because the vectors will usually not be that small. Alternatives: Use the previous fixed-size array length or some value corresponding to short common literals.
The new limit mentioned above is asserted. Is error reporting necessary instead?
The exponent bounds of calcSemantics are replaced with the largest possible range. I am not sure whether this is allowed.
The test case is still missing, because I am not sure whether the long literal from the linked issue should be embedded in a unit test or whether another kind of test would be preferred.

I compiled and verified ninja check-llvm, ninja check-llvm-unit and ninja check-clang on Linux x86-64.

Fixes #28406

Replace fixed-size arrays in powerOf5 with SmallVector to support significands longer than about 10000 digits. The new limit should be around UINT_MAX/815, practically limited by processing time. Fixes llvm#28406

github-actions · 2024-08-05T20:05:01Z

Thank you for submitting a Pull Request (PR) to the LLVM Project!

This PR will be automatically labeled and the relevant teams will be
notified.

If you wish to, you can add reviewers by using the "Reviewers" section on this page.

If this is not working for you, it is probably because you do not have write
permissions for the repository. In which case you can instead tag reviewers by
name in a comment by using @ followed by their GitHub username.

If you have received no comments on your PR for a week, you can request a review
by "ping"ing the PR by adding a comment “Ping”. The common courtesy "ping" rate
is once a week. Please remember that you are asking for valuable time from other developers.

If you have further questions, they may be answered by the LLVM GitHub User Guide.

You can also ask questions in a comment on this PR, on the LLVM Discord or on the forums.

llvmbot · 2024-08-05T20:05:39Z

@llvm/pr-subscribers-llvm-adt

@llvm/pr-subscribers-llvm-support

Author: None (keinflue)

Changes

This is my first contribution attempt to LLVM. Please let me know of any issues.

The function powerOf5 is called (only) by convertFromDecimalString to calculate 5^m where m is sum of the length of the significand and the magnitude of the (normalized) exponent.

powerOf5 previously used fixed-sized arrays to store its (intermediate) results. Their size was chosen large enough to hold the sum of a maximum exponent and a maximum precision. However, this didn't account for literals with long overspecifying significands.

This PR replaces the arrays with SmallVectors to also support these cases.

The new definitive limit for the length of the sum of significand and exponent is UINT_MAX/815 (~5M) due to an overflow in the length calculation for the SmallVectors.

However, processing time will likely be the practical limitation.

Open questions:

SmallVector's inline size is chosen as 0, because the vectors will usually not be that small. Alternatives: Use the previous fixed-size array length or some value corresponding to short common literals.
The new limit mentioned above is asserted. Is error reporting necessary instead?
The exponent bounds of calcSemantics are replaced with the largest possible range. I am not sure whether this is allowed.
The test case is still missing, because I am not sure whether the long literal from the linked issue should be embedded in a unit test or whether another kind of test would be preferred.

I compiled and verified ninja check-llvm, ninja check-llvm-unit and ninja check-clang on Linux x86-64.

Fixes #28406

Full diff: https://github.com/llvm/llvm-project/pull/102051.diff

1 Files Affected:

(modified) llvm/lib/Support/APFloat.cpp (+51-41)

diff --git a/llvm/lib/Support/APFloat.cpp b/llvm/lib/Support/APFloat.cpp
index 7f68c5ab9b7cf..0f91953cca301 100644
--- a/llvm/lib/Support/APFloat.cpp
+++ b/llvm/lib/Support/APFloat.cpp
@@ -18,6 +18,7 @@
 #include "llvm/ADT/FoldingSet.h"
 #include "llvm/ADT/Hashing.h"
 #include "llvm/ADT/STLExtras.h"
+#include "llvm/ADT/SmallVector.h"
 #include "llvm/ADT/StringExtras.h"
 #include "llvm/ADT/StringRef.h"
 #include "llvm/Config/llvm-config.h"
@@ -27,6 +28,7 @@
 #include "llvm/Support/raw_ostream.h"
 #include <cstring>
 #include <limits.h>
+#include <limits>
 
 #define APFLOAT_DISPATCH_ON_SEMANTICS(METHOD_CALL)                             \
   do {                                                                         \
@@ -308,24 +310,6 @@ constexpr RoundingMode APFloatBase::rmTowardNegative;
 constexpr RoundingMode APFloatBase::rmTowardZero;
 constexpr RoundingMode APFloatBase::rmNearestTiesToAway;
 
-/* A tight upper bound on number of parts required to hold the value
-   pow(5, power) is
-
-     power * 815 / (351 * integerPartWidth) + 1
-
-   However, whilst the result may require only this many parts,
-   because we are multiplying two values to get it, the
-   multiplication may require an extra part with the excess part
-   being zero (consider the trivial case of 1 * 1, tcFullMultiply
-   requires two parts to hold the single-part result).  So we add an
-   extra one to guarantee enough space whilst multiplying.  */
-const unsigned int maxExponent = 16383;
-const unsigned int maxPrecision = 113;
-const unsigned int maxPowerOfFiveExponent = maxExponent + maxPrecision - 1;
-const unsigned int maxPowerOfFiveParts =
-    2 +
-    ((maxPowerOfFiveExponent * 815) / (351 * APFloatBase::integerPartWidth));
-
 unsigned int APFloatBase::semanticsPrecision(const fltSemantics &semantics) {
   return semantics.precision;
 }
@@ -763,25 +747,44 @@ ulpsFromBoundary(const APFloatBase::integerPart *parts, unsigned int bits,
 
 /* Place pow(5, power) in DST, and return the number of parts used.
    DST must be at least one part larger than size of the answer.  */
-static unsigned int
-powerOf5(APFloatBase::integerPart *dst, unsigned int power) {
+static SmallVector<APFloatBase::integerPart, 0> powerOf5(unsigned int power) {
   static const APFloatBase::integerPart firstEightPowers[] = { 1, 5, 25, 125, 625, 3125, 15625, 78125 };
-  APFloatBase::integerPart pow5s[maxPowerOfFiveParts * 2 + 5];
+
+  // Make sure that the following expression does not overflow.
+  assert(power <= UINT_MAX / 815);
+
+  // A tight upper bound on number of parts required to hold the value
+  // pow(5, power) is
+  //
+  //   power * 815 / (351 * integerPartWidth) + 1
+  //
+  // However, whilst the result may require only this many parts,
+  // because we are multiplying two values to get it, the
+  // multiplication may require an extra part with the excess part
+  // being zero (consider the trivial case of 1 * 1, tcFullMultiply
+  // requires two parts to hold the single-part result).  So we add an
+  // extra one to guarantee enough space whilst multiplying.
+  const unsigned int maxPowerOfFiveParts =
+      2 + ((power * 815) / (351 * APFloatBase::integerPartWidth));
+
+  SmallVector<APFloatBase::integerPart, 0> pow5s(maxPowerOfFiveParts * 2 + 5);
   pow5s[0] = 78125 * 5;
 
   unsigned int partsCount = 1;
-  APFloatBase::integerPart scratch[maxPowerOfFiveParts], *p1, *p2, *pow5;
-  unsigned int result;
-  assert(power <= maxExponent);
+  SmallVector<APFloatBase::integerPart, 0> scratch(maxPowerOfFiveParts);
+  APFloatBase::integerPart *p1, *p2, *pow5;
+  unsigned int resultSize;
+
+  SmallVector<APFloatBase::integerPart, 0> dst(maxPowerOfFiveParts);
 
-  p1 = dst;
-  p2 = scratch;
+  p1 = dst.data();
+  p2 = scratch.data();
 
   *p1 = firstEightPowers[power & 7];
   power >>= 3;
 
-  result = 1;
-  pow5 = pow5s;
+  resultSize = 1;
+  pow5 = pow5s.data();
 
   for (unsigned int n = 0; power; power >>= 1, n++) {
     /* Calculate pow(5,pow(2,n+3)) if we haven't yet.  */
@@ -796,10 +799,10 @@ powerOf5(APFloatBase::integerPart *dst, unsigned int power) {
     if (power & 1) {
       APFloatBase::integerPart *tmp;
 
-      APInt::tcFullMultiply(p2, p1, pow5, result, partsCount);
-      result += partsCount;
-      if (p2[result - 1] == 0)
-        result--;
+      APInt::tcFullMultiply(p2, p1, pow5, resultSize, partsCount);
+      resultSize += partsCount;
+      if (p2[resultSize - 1] == 0)
+        resultSize--;
 
       /* Now result is in p1 with partsCount parts and p2 is scratch
          space.  */
@@ -811,10 +814,14 @@ powerOf5(APFloatBase::integerPart *dst, unsigned int power) {
     pow5 += partsCount;
   }
 
-  if (p1 != dst)
-    APInt::tcAssign(dst, p1, result);
+  if (p1 != dst.data())
+    APInt::tcAssign(dst.data(), p1, resultSize);
 
-  return result;
+  // Truncate to the actually used elements from the initial worst-case sizing
+  // during initialization.
+  dst.truncate(resultSize);
+
+  return dst;
 }
 
 /* Zero at the end to avoid modular arithmetic when adding one; used
@@ -2950,9 +2957,9 @@ IEEEFloat::opStatus
 IEEEFloat::roundSignificandWithExponent(const integerPart *decSigParts,
                                         unsigned sigPartCount, int exp,
                                         roundingMode rounding_mode) {
-  unsigned int parts, pow5PartCount;
-  fltSemantics calcSemantics = { 32767, -32767, 0, 0 };
-  integerPart pow5Parts[maxPowerOfFiveParts];
+  unsigned int parts;
+  fltSemantics calcSemantics = {std::numeric_limits<ExponentType>::max(),
+                                std::numeric_limits<ExponentType>::min(), 0, 0};
   bool isNearest;
 
   isNearest = (rounding_mode == rmNearestTiesToEven ||
@@ -2960,8 +2967,11 @@ IEEEFloat::roundSignificandWithExponent(const integerPart *decSigParts,
 
   parts = partCountForBits(semantics->precision + 11);
 
+  // Make sure that abs(exp) is representable.
+  assert(exp > INT_MIN);
+
   /* Calculate pow(5, abs(exp)).  */
-  pow5PartCount = powerOf5(pow5Parts, exp >= 0 ? exp: -exp);
+  SmallVector<integerPart, 0> pow5Parts = powerOf5(exp >= 0 ? exp : -exp);
 
   for (;; parts *= 2) {
     opStatus sigStatus, powStatus;
@@ -2977,8 +2987,8 @@ IEEEFloat::roundSignificandWithExponent(const integerPart *decSigParts,
 
     sigStatus = decSig.convertFromUnsignedParts(decSigParts, sigPartCount,
                                                 rmNearestTiesToEven);
-    powStatus = pow5.convertFromUnsignedParts(pow5Parts, pow5PartCount,
-                                              rmNearestTiesToEven);
+    powStatus = pow5.convertFromUnsignedParts(
+        pow5Parts.data(), pow5Parts.size(), rmNearestTiesToEven);
     /* Add exp, as 10^n = 5^n * 2^n.  */
     decSig.exponent += exp;

keinflue · 2024-08-16T21:11:18Z

Ping

topperc · 2024-08-17T00:38:03Z

Can you add a test to unittests/ADT/APFloatTest.cpp for a case that would have failed previously?

llvm/lib/Support/APFloat.cpp

keinflue · 2024-09-01T17:17:23Z

ping

krzysz00 · 2024-09-03T19:05:59Z

llvm/lib/Support/APFloat.cpp

+  const unsigned int maxPowerOfFiveParts =
+      2 + ((power * 815) / (351 * APFloatBase::integerPartWidth));
+
+  SmallVector<APFloatBase::integerPart, 0> pow5s(maxPowerOfFiveParts * 2 + 5);


, 0> is a heap allocation, I don't think that's what you want?

Either , 1> or just not specifying it

The actual minimum size required (see lines above) is 9 and I think SmallVector's default is smaller. It increases with both the length of the significand of the floating point literal as well as its exponent. I used 0 because I have no idea what a good estimation for typical floating point literals is. The old code used a fixed size array of roughly length 1200. I could of course use that, but felt that it was a bit much waste of stack space for the typical use case. All in all the arrays took roughly 14kB stack.

krzysz00 · 2024-09-03T19:07:18Z

llvm/lib/Support/APFloat.cpp

  bool isNearest;

  isNearest = (rounding_mode == rmNearestTiesToEven ||
               rounding_mode == rmNearestTiesToAway);

  parts = partCountForBits(semantics->precision + 11);

+  // Make sure that abs(exp) is representable.
+  assert(exp > INT_MIN);


Why's this an assert()? Shouldn't it fail in some interesting way?

Also prefer std::numeric_limits

Why's this an assert()? Shouldn't it fail in some interesting way?

With the limited range of supported exponents this could only happen at this point (if at all) if the significand is almost INT_MAX digits long. That's an edge case that I haven't looked at in more detail. I expect that this will already fail earlier. In any case, it is not really relevant for the functionality in this PR. I simply added it when I saw the std::abs-type construct following it as a default precaution to catch the usual UB edge case. I can remove it again as well.

In principle the function should report this as failure and on the Clang side a diagnostic about implementation limits should be produced - for the new assert in power5 as well - but I wasn't sure whether that would be in scope for this bug fix, since there was no error reporting for the implementation limit beforehand either. I simply increase the implied limit without fixing the lack of error reporting to keep the scope of this first contribution small. Let me know if you'd like to see that fixed as par of this PR as well.

Also prefer std::numeric_limits

Will do. I kept using the macros in style with the file. If it wasn't for that, I would have always preferred std::numeric_limits.

keinflue

Thank you for the review.

keinflue · 2024-09-03T20:47:21Z

llvm/lib/Support/APFloat.cpp

+  const unsigned int maxPowerOfFiveParts =
+      2 + ((power * 815) / (351 * APFloatBase::integerPartWidth));
+
+  SmallVector<APFloatBase::integerPart, 0> pow5s(maxPowerOfFiveParts * 2 + 5);


The actual minimum size required (see lines above) is 9 and I think SmallVector's default is smaller. It increases with both the length of the significand of the floating point literal as well as its exponent. I used 0 because I have no idea what a good estimation for typical floating point literals is. The old code used a fixed size array of roughly length 1200. I could of course use that, but felt that it was a bit much waste of stack space for the typical use case. All in all the arrays took roughly 14kB stack.

keinflue · 2024-09-03T21:11:30Z

llvm/lib/Support/APFloat.cpp

  bool isNearest;

  isNearest = (rounding_mode == rmNearestTiesToEven ||
               rounding_mode == rmNearestTiesToAway);

  parts = partCountForBits(semantics->precision + 11);

+  // Make sure that abs(exp) is representable.
+  assert(exp > INT_MIN);


Why's this an assert()? Shouldn't it fail in some interesting way?

With the limited range of supported exponents this could only happen at this point (if at all) if the significand is almost INT_MAX digits long. That's an edge case that I haven't looked at in more detail. I expect that this will already fail earlier. In any case, it is not really relevant for the functionality in this PR. I simply added it when I saw the std::abs-type construct following it as a default precaution to catch the usual UB edge case. I can remove it again as well.

In principle the function should report this as failure and on the Clang side a diagnostic about implementation limits should be produced - for the new assert in power5 as well - but I wasn't sure whether that would be in scope for this bug fix, since there was no error reporting for the implementation limit beforehand either. I simply increase the implied limit without fixing the lack of error reporting to keep the scope of this first contribution small. Let me know if you'd like to see that fixed as par of this PR as well.

Also prefer std::numeric_limits

Will do. I kept using the macros in style with the file. If it wasn't for that, I would have always preferred std::numeric_limits.

[APFloat] Fix literals with long significands.

4074828

Replace fixed-size arrays in powerOf5 with SmallVector to support significands longer than about 10000 digits. The new limit should be around UINT_MAX/815, practically limited by processing time. Fixes llvm#28406

llvmbot added the llvm:support label Aug 5, 2024

topperc reviewed Aug 17, 2024

View reviewed changes

llvm/lib/Support/APFloat.cpp Outdated Show resolved Hide resolved

fixup! [APFloat] Fix literals with long significands.

52d1b9a

llvmbot added the llvm:adt label Aug 17, 2024

[APFloat] Add unit test case for llvm#28406.

e5cb5e7

keinflue force-pushed the fix-28406 branch from ffd69ee to e5cb5e7 Compare August 17, 2024 19:36

keinflue requested a review from topperc August 18, 2024 19:19

kuhar requested a review from krzysz00 September 2, 2024 13:56

krzysz00 reviewed Sep 3, 2024

View reviewed changes

keinflue commented Sep 3, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[APFloat] Fix literals with long significands. #102051

[APFloat] Fix literals with long significands. #102051

keinflue commented Aug 5, 2024

github-actions bot commented Aug 5, 2024

llvmbot commented Aug 5, 2024 •

edited

Loading

keinflue commented Aug 16, 2024

topperc commented Aug 17, 2024

keinflue commented Sep 1, 2024

krzysz00 Sep 3, 2024

keinflue Sep 3, 2024

krzysz00 Sep 3, 2024

kuhar Sep 3, 2024

keinflue Sep 3, 2024

keinflue left a comment

keinflue Sep 3, 2024

keinflue Sep 3, 2024

[APFloat] Fix literals with long significands. #102051

Are you sure you want to change the base?

[APFloat] Fix literals with long significands. #102051

Conversation

keinflue commented Aug 5, 2024

github-actions bot commented Aug 5, 2024

llvmbot commented Aug 5, 2024 • edited Loading

keinflue commented Aug 16, 2024

topperc commented Aug 17, 2024

keinflue commented Sep 1, 2024

krzysz00 Sep 3, 2024

Choose a reason for hiding this comment

keinflue Sep 3, 2024

Choose a reason for hiding this comment

krzysz00 Sep 3, 2024

Choose a reason for hiding this comment

kuhar Sep 3, 2024

Choose a reason for hiding this comment

keinflue Sep 3, 2024

Choose a reason for hiding this comment

keinflue left a comment

Choose a reason for hiding this comment

keinflue Sep 3, 2024

Choose a reason for hiding this comment

keinflue Sep 3, 2024

Choose a reason for hiding this comment

llvmbot commented Aug 5, 2024 •

edited

Loading