[enh] Add intel compiler support #131

icfaust · 2024-02-14T09:32:31Z

Macro definition for X86_SIMD_SORT_UNROLL_LOOP will break x86-simd-sort when compiling with icc as they define __GNUC__. This introduces a check for an Intel compiler by looking if __INTEL_COMPILER has been set, and sets the appropriate pragma (not done for __INTEL_LLVM_COMPILER due to compiling performance issues).

There is also an issue with avx2 where icc will complain that there isn't a return type for avx2_vector for 32 and 64 bit. The compile time evaluation and static assert are placed ahead of a generalized return, which shouldn't change the logic or runtime. I definitely recommend the maintainers to evaluate those changes if they make sense (should the static assert require a previous constexpr check?)

Intel LLVM C++ compilers do not define __builtin_cpu_supports("avx512fp16") which requires certain code not to be run when using icpx on sapphirerapids ISA-capable machines.

Testing uses the intel apt repository for getting the icpx compiler. The GPG key can be added, but that will add something that must be updated in the future/ requires future maintenance. Suggestions are welcome.

Meson from apt is out of date enough that it doesn't know of the icpx compiler. The pip version is up to date and is instead used in the new additional CI job.

Tasks

Implement icc-specific changes
Implement icpx-specific changes
Add tests for Intel LLVM C++ compiler
Pass tests

icfaust · 2024-02-14T20:58:07Z

I will reach out to compiler development about __builtin_cpu_supports("avx512fp16") issues with icpx

icfaust · 2024-02-15T10:29:07Z

So I had erroneously set the intel compiler check within an if statement for __GNUC__ which wouldn't actually be reached for icc and icpx. When made available to those compilers, the compile times for icpx skyrocketed both locally and in CI (to timeout). It will compile but takes significant time. I have removed loop unrolling for icpx, and have returned the new CI to the previous state. This will allow for CI to properly run, and will be investigated if it is a compiler issue on intel's side.

icfaust · 2024-02-15T14:52:42Z

Local testing shows that icpx/icc do not support __builtin_cpu_supports("avx512fp16"), even if icpx supports __FLT16_MAX__/ _Float16 types. ICC just doesn't support __FLT16_MAX__ at all, so isn't a problem. I will recondition the requisite check. This is a compiler problem (not even available in most recent builds).

r-devulap

LGTM. Thank you for adding CI as well.

icfaust added 10 commits February 14, 2024 10:27

Update xss-common-includes.h

99782e9

Update xss-common-includes.h

10c66ad

Update avx2-32bit-qsort.hpp

31e7dc9

Update avx2-64bit-qsort.hpp

120fd38

add docker CI test using intel basekit

b179b60

generalize python

bf3d055

forgot a then

e2fa52b

pip issues

440231b

unable to find pip

5bedf66

set ffmath optimizations for icx and isNaN support

6ee04eb

icfaust changed the title ~~[enh] Add icc compiler support~~ [enh] Add intel compiler support Feb 14, 2024

icfaust added 19 commits February 14, 2024 05:04

test to see if cxxflags is getting set

3f77158

bash spacing issue correction

98cc321

small comment out to test overall status

0f3da6a

may have incorrectly called the compiler

1d9a374

see if linking to /usr/bin causes the problem

5b1ff41

bad clang format

1470014

EOF

b4313fe

weirdness with file movement in docker

f7dc230

reverted changes to docker shell script

910b123

move away from docker

e49fe20

rename back to 32bit

11136c8

missing the intel apt-get repository

9902a7f

dev to deb

7107413

deal with key problem with override

771ff88

add sources

a17d54b

switch to basekit

8ff93f6

allow unauthenticated

a94511b

add setvars.sh

eeee62a

move sourcing

35af511

icfaust added 8 commits February 14, 2024 09:30

add cxxflags to examples

0e4097c

wrong parantheses

6c87b15

remove source

c33402a

add check

614f2a2

add source again

bc23e06

out of date meson check

bacbc83

Update c-cpp.yml

15194a2

Update c-cpp.yml

210890b

icfaust added 4 commits February 14, 2024 23:42

__GNUC__ defined to be 4 for icc/icpx

9f74bf1

remove examples due to timeout

0c672f8

readd examples, remove icpx loop unrolling

7838cf7

forgot to remove comment out

3c8911f

Update x86simdsort.cpp

75b1731

r-devulap approved these changes Feb 26, 2024

View reviewed changes

r-devulap merged commit 87955f6 into numpy:main Feb 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

[enh] Add intel compiler support #131

[enh] Add intel compiler support #131

Uh oh!

icfaust commented Feb 14, 2024 •

edited

Loading

Uh oh!

icfaust commented Feb 14, 2024

Uh oh!

icfaust commented Feb 15, 2024 •

edited

Loading

Uh oh!

icfaust commented Feb 15, 2024 •

edited

Loading

Uh oh!

r-devulap left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Uh oh!

[enh] Add intel compiler support #131

[enh] Add intel compiler support #131

Uh oh!

Conversation

icfaust commented Feb 14, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

icfaust commented Feb 14, 2024

Uh oh!

icfaust commented Feb 15, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

icfaust commented Feb 15, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

r-devulap left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

icfaust commented Feb 14, 2024 •

edited

Loading

icfaust commented Feb 15, 2024 •

edited

Loading

icfaust commented Feb 15, 2024 •

edited

Loading