Expose various floating-point intrinsics for Avx512F and Avx512DQ #85716

tannergooding · 2023-05-03T16:25:33Z

This exposes some instructions unique to the AVX512 family of instructions making progress towards completing:

There will be a separate PR to utilize some of these in our scalar math APIs. For example, vrange can be used to implement a faster/correct Max/MaxMagnitude/MaxNumber/MaxNumberMagnitude and Min/MinMagnitude/MinNumber/MinNumberMagnitude (where-as currently we can only accelerate on x86/x64 if one input is constant).

Likewise vfixup can be used to handle many complex branching conditions where various edge cases are being handled.

ghost · 2023-05-03T16:25:40Z

Note regarding the new-api-needs-documentation label:

This serves as a reminder for when your PR is modifying a ref *.cs file and adding/modifying public APIs, please make sure the API implementation in the src *.cs file is documented with triple slash comments, so the PR reviewers can sign off that change.

ghost · 2023-05-03T16:25:48Z

Tagging subscribers to this area: @JulieLeeMSFT, @jakobbotsch
See info in area-owners.md if you want to be subscribed.

Issue Details

This exposes some instructions unique to the AVX512 family of instructions making progress towards completing:

There will be a separate PR to utilize some of these in our scalar math APIs. For example, vrange can be used to implement a faster/correct Max/MaxMagnitude/MaxNumber/MaxNumberMagnitude and Min/MinMagnitude/MinNumber/MinNumberMagnitude (where-as currently we can only accelerate on x86/x64 if one input is constant).

Likewise vfixup can be used to handle many complex branching conditions where various edge cases are being handled.

Author:	tannergooding
Assignees:	tannergooding
Labels:	`area-CodeGen-coreclr`, `new-api-needs-documentation`
Milestone:	-

…ation

EgorBo · 2023-05-04T00:24:53Z

src/coreclr/jit/hwintrinsic.h

            case NI_SSE41_CeilingScalar:
+            case NI_AVX_Ceiling:
+            {
+                FALLTHROUGH;


nit: remove { FALLTHROUGH; }

I had it split "explicitly" here to help visualize the groupings between Ceiling vs RoundToPositiveInfinity (and likewise Floor vs RoundToNegativeInfinity), particularly since Ceiling/Floor don't have AVX512 equivalents.

EgorBo

LGTM

sebastienros · 2023-05-08T19:09:51Z

This PR is part of a regression I just filed: #85930
Based on the recent fix for AVX512 I assume it could be related, you know better.

tannergooding · 2023-05-08T19:16:39Z

I wouldn't expect it, this was a zero spmi diffs change since it just added new APIs, it didn't update any existing paths to use them.

tannergooding added 4 commits May 2, 2023 08:55

Expose GetExponent and GetMantissa for Avx512F

ad57433

Expose Reciprocal14 and ReciprocalSqrt14 for Avx512F

8836eb2

Expose RoundScale and Scale for Avx512F

150c9dc

Expose Fixup for Avx512F + Range and Reduce for Avx512DQ

8066f54

ghost added area-CodeGen-coreclr CLR JIT compiler in src/coreclr/src/jit and related components such as SuperPMI new-api-needs-documentation labels May 3, 2023

ghost assigned tannergooding May 3, 2023

tannergooding added 2 commits May 3, 2023 09:40

Ensure the RMW handling for Fixup avoids allocating a register

5b2cf7d

Ensure the NI_AVX512F_Fixup handling in isRMWHWIntrinsic compiles

a72e94d

tannergooding added the avx512 Related to the AVX-512 architecture label May 3, 2023

tannergooding added 2 commits May 3, 2023 10:05

Ensure vrange is marked as INS_Flags_IsDstDstSrcAVXInstruction

f9204c8

Apply formatting patch

ef7d87c

build-analysis bot mentioned this pull request May 3, 2023

IOException running NuGet-Migrations during tests in dotnet CLI first run #80619

Closed

tannergooding added 2 commits May 3, 2023 14:30

Ensure vfixupimm is correctly handled in the JIT

a496a48

Ensure FixupScalar only checks the first element when doing RMW valid…

1fd9f2a

…ation

EgorBo reviewed May 4, 2023

View reviewed changes

EgorBo approved these changes May 4, 2023

View reviewed changes

tannergooding merged commit cb5fe56 into dotnet:main May 4, 2023

tannergooding deleted the avx512-4 branch May 4, 2023 02:44

ghost locked as resolved and limited conversation to collaborators Jun 8, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Expose various floating-point intrinsics for Avx512F and Avx512DQ #85716

Expose various floating-point intrinsics for Avx512F and Avx512DQ #85716

Uh oh!

tannergooding commented May 3, 2023

Uh oh!

ghost commented May 3, 2023

Uh oh!

ghost commented May 3, 2023

Uh oh!

EgorBo May 4, 2023

Uh oh!

tannergooding May 4, 2023

Uh oh!

EgorBo left a comment

Uh oh!

sebastienros commented May 8, 2023

Uh oh!

tannergooding commented May 8, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Expose various floating-point intrinsics for Avx512F and Avx512DQ #85716

Expose various floating-point intrinsics for Avx512F and Avx512DQ #85716

Uh oh!

Conversation

tannergooding commented May 3, 2023

Uh oh!

ghost commented May 3, 2023

Uh oh!

ghost commented May 3, 2023

Uh oh!

EgorBo May 4, 2023

Choose a reason for hiding this comment

Uh oh!

tannergooding May 4, 2023

Choose a reason for hiding this comment

Uh oh!

EgorBo left a comment

Choose a reason for hiding this comment

Uh oh!

sebastienros commented May 8, 2023

Uh oh!

tannergooding commented May 8, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants