-
Notifications
You must be signed in to change notification settings - Fork 1.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
LAPACK test fails with Error code from DDRGES3 = 9
on AMD Genoa
#4032
Comments
Can you please re-test with the |
Using the newly released GCC 13 could also be an option - at least according to my first tests, it appears to have fixed the tree-optimizer bugs that had me put the pragmas in the known affected source files. |
Thanks for the suggestions: tried the |
Some progress: building with GCC 13.1.0 reduces the number of numerical errors to 26, all in the COMPLEX (4) or COMPLEX16 (21) tests. However, the error with code 9 in DDRGES3 remains. Will try development branch on Monday then. |
Also related : Reference-LAPACK/lapack#744 and Reference-LAPACK/lapack#475 (the latter was supposed to be fixed by Reference-LAPACK/lapack#477 but this appears to be rather fragile code with a long history of odd and fleeting convergence problems) Note also how the reported result is always a dramatic 4.5E+15 |
Test summary from building with GCC 13.1 and
There is no difference between I also changed
but this also had no notable effect. Surprisingly to me, changing the overall optimization from |
Yes, the |
Thanks, I will ignore the errors for now and will keep an eye on the future releases of GCC and OpenBLAS. |
We're seeing this same problem (1 failing test due to a non-numerical issue, "
Is there an easy way to selectively disable this particular test, to avoid blindly ignoring other failing tests which do signal a problem worth looking into? |
probably by editing lapack-netlib/TESTING/dgg.in to either disable all these tests or removing the parameter(s) of the offending one - have not confirmed this though. Interesting that it would happen with the generic build as well, where there is no FMA optimization beyond what the compiler does, and only some loop unrolling (assuming easybuild's "generic" corresponds to TARGET=GENERIC in OpenBLAS) |
sorry, dgd.in not dgg - and specifically remove the "6" from the first list of matrix dimensions in line 3 of that file (6 eigenvalues + error code 3 => INFO=9) |
@martin-frbg Seems like that worked like a charm, see easybuilders/easybuild-easyconfigs#18887 + EESSI/software-layer#334 (comment) in which we're retrying the build of OpenBLAS 0.3.23 with the patch included. |
System:
Build and Test Commands:
Building and testing OpenBLAS-0.3.23 (using Easybuild) with the following commands:
Test results:
make tests
completes without error, LAPACK tests return summary:I think the other error is coming from
All details are in testing_results.txt
Questions:
How can I get this other error resolved?
And should I worry about the 269 tests with numerical errors?
The text was updated successfully, but these errors were encountered: