Finish Avx512 specific lightup for Vector128/256/512<T>

With https://github.com/dotnet/runtime/issues/80814, we achieved functional parity of `Vector512<T>` with `Vector128<T>` and `Vector256<T>`. However, there are some new instructions available in Avx512 capable hardware that will allow additional hardware acceleration opportunities for all three types.

This includes:
- [x] ConvertToDouble() - `vcvtqq2pd` & `vcvtuqq2pd`
- [x] ConvertToInt64() - `vcvtpd2qq`
- [x] ConvertToUInt32() - `vcvtps2udq`
- [x] ConvertToUInt64() - `vcvtpd2uqq`
- [x] ConditionalSelect() - `vpternlog`
- [x] Shuffle() - `vpermi2*`, `vpermt2*`, etc

We should also ensure that all APIs are accelerated as intrinsic, where applicable, in particular the following are still managed fallbacks (but accelerated):
- [x] Vector512.Dot()
- [x] Vector512.Sum()

There may be others as well, so a general audit to validate would be good.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Finish Avx512 specific lightup for Vector128/256/512<T> #85207

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Finish Avx512 specific lightup for Vector128/256/512<T> #85207

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions