Skip to content

Conversation

@copybara-service
Copy link
Contributor

@copybara-service copybara-service bot commented Dec 8, 2025

Added convert<bf16x8, f32xXx2> to arm_neon and x86 to decrease intrinsics usage in reduce kernels.
Also added convert<x8x16, f32x4x4> to x86_sse2 and sse41.

@copybara-service copybara-service bot changed the title Added convert<bf16x8, f32x4x2> to arm_neon to reduce intrinsics usage in reduce kernels. Added convert<bf16x8, f32xXx2> to arm_neon and x86 to decrease intrinsics usage in reduce kernels. Dec 8, 2025
…sics usage in reduce kernels.

Also added convert<x8x16, f32x4x4> to x86_sse2 and sse41.

PiperOrigin-RevId: 841900379
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant