Skip to content

Commit a65fa3a

Browse files
committed
vulkan : disable FA for mask->ne[2] != 1
1 parent 89ee2f1 commit a65fa3a

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

ggml/src/ggml-vulkan/ggml-vulkan.cpp

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -10268,7 +10268,7 @@ static bool ggml_backend_vk_device_supports_op(ggml_backend_dev_t dev, const ggm
1026810268
// TODO: support broadcast
1026910269
// note: this was initially implemented in https://github.com/ggml-org/llama.cpp/pull/14449, but
1027010270
// the interface of ggml_flash_attn_ext() changed in https://github.com/ggml-org/llama.cpp/pull/14505
10271-
if (op->src[0]->ne[3] != 1) {
10271+
if (op->src[0]->ne[3] != 1 || op->src[3]->ne[2] != 1) {
1027210272
return false;
1027310273
}
1027410274
// It's straightforward to support different K/V dequant, but would

0 commit comments

Comments
 (0)