Skip to content

Conversation

@EgorBo
Copy link
Member

@EgorBo EgorBo commented Aug 4, 2023

Fixes #90001

It seems that we can swap op1 and op3 operandands in PermuteVar32x16x2(upper, mask, lower) if op1 is containable. For that, we need to "reverse" mask (toggle bits) so there was a small bug in the toggle bit mask - I simplified the code to switches to remove calculations

@ghost ghost assigned EgorBo Aug 4, 2023
@ghost ghost added the area-CodeGen-coreclr CLR JIT compiler in src/coreclr/src/jit and related components such as SuperPMI label Aug 4, 2023
@ghost
Copy link

ghost commented Aug 4, 2023

Tagging subscribers to this area: @JulieLeeMSFT, @jakobbotsch
See info in area-owners.md if you want to be subscribed.

Issue Details

Fixes #90001

It seems that we can swap op1 and op3 operandands in PermuteVar32x16x2(upper, mask, lower) if op1 is containable. For that, we need to "reverse" mask (toggle bits) so there was a small bug in the toggle bit mask - I simplified the code to switches to remove calculations

Author: EgorBo
Assignees: EgorBo
Labels:

area-CodeGen-coreclr

Milestone: -

@EgorBo EgorBo requested a review from tannergooding August 4, 2023 10:43
@EgorBo EgorBo merged commit e07d1ee into dotnet:main Aug 4, 2023
@EgorBo EgorBo deleted the fix-avx512 branch August 4, 2023 16:09
@ghost ghost locked as resolved and limited conversation to collaborators Sep 3, 2023
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.

Labels

area-CodeGen-coreclr CLR JIT compiler in src/coreclr/src/jit and related components such as SuperPMI

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Incorrect codegen for PermuteVar32x16x2

2 participants