Skip to content

hexagon: apply repl optimization in flash attn softmax as #22993#23455

Open
njsyw1997 wants to merge 1 commit into
ggml-org:masterfrom
aizip:yiwei/hex-repl-fa
Open

hexagon: apply repl optimization in flash attn softmax as #22993#23455
njsyw1997 wants to merge 1 commit into
ggml-org:masterfrom
aizip:yiwei/hex-repl-fa

Conversation

@njsyw1997
Copy link
Copy Markdown
Contributor

Overview

Substitute hvx_vec_splat_f16 with hvx_vec_repl_f16 in hmx softmax. Everything looks good. No obvious performance improvement since it's not on the hot path. Just for alignment.

Additional information

Requirements

@njsyw1997 njsyw1997 requested a review from a team as a code owner May 21, 2026 01:52
@github-actions github-actions Bot added ggml changes relating to the ggml tensor library for machine learning Hexagon labels May 21, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ggml changes relating to the ggml tensor library for machine learning Hexagon

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants