Skip to content

Conversation

@jeffbolznv
Copy link
Collaborator

@jeffbolznv jeffbolznv commented Aug 15, 2025

The shaders with f32 accumulators are still not used, but we should hook them up when ggml adds a way to request f32. There are a couple other fixes in here, too.

See #15274.

@jeffbolznv jeffbolznv requested a review from 0cc4m as a code owner August 15, 2025 03:56
@jeffbolznv jeffbolznv marked this pull request as draft August 15, 2025 03:56
@github-actions github-actions bot added testing Everything test related Vulkan Issues specific to the Vulkan backend ggml changes relating to the ggml tensor library for machine learning labels Aug 15, 2025
…oked up

- There's no explicit way to request f32 precision for mul_mat_id, but there
probably should be, and this gets the code in place for that.
- A couple fixes to check_results.
- Remove casts to fp16 in coopmat1 FA shader (found by inspection).
@jeffbolznv jeffbolznv marked this pull request as ready for review August 15, 2025 17:59
@0cc4m 0cc4m merged commit de21927 into ggml-org:master Aug 16, 2025
47 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ggml changes relating to the ggml tensor library for machine learning testing Everything test related Vulkan Issues specific to the Vulkan backend

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants