ggml : update ggml_backend_cpu_device_supports_op #10867

ggerganov · 2024-12-17T15:57:24Z

This fixes the following failures:

./build/bin/test-backend-ops -b CPU -o CPY

  CPY(type_src=f16,type_dst=q6_K,ne=[256,4,4,4],permute=[0,0,0,0]): OK
  CPY(type_src=f16,type_dst=q6_K,ne=[256,2,3,4],permute=[0,2,1,3]): OK
  CPY(type_src=f16,type_dst=iq2_xxs,ne=[256,4,4,4],permute=[0,0,0,0]): not supported [CPU] not supported [CPU] 
  CPY(type_src=f16,type_dst=iq2_xxs,ne=[256,2,3,4],permute=[0,2,1,3]): not supported [CPU] not supported [CPU] 
  CPY(type_src=f16,type_dst=iq2_xs,ne=[256,4,4,4],permute=[0,0,0,0]): not supported [CPU] not supported [CPU] 
  CPY(type_src=f16,type_dst=iq2_xs,ne=[256,2,3,4],permute=[0,2,1,3]): not supported [CPU] not supported [CPU] 
  CPY(type_src=f16,type_dst=iq2_s,ne=[256,4,4,4],permute=[0,0,0,0]): /llama.cpp/ggml/src/ggml-cpu/ggml-cpu.c:2996: fatal error
/llama.cpp/ggml/src/ggml-cpu/ggml-cpu.c:2996: /llama.cpp/ggml/src/ggml-cpu/ggml-cpu.c:2996: fatal error
/llama.cpp/ggml/src/ggml-cpu/ggml-cpu.c:2996: fatal errorfatal error/llama.cpp/ggml/src/ggml-cpu/ggml-cpu.c:2996: 
/llama.cpp/ggml/src/ggml-cpu/ggml-cpu.c:2996: fatal error/llama.cpp/ggml/src/ggml-cpu/ggml-cpu.c:2996: fatal error
/llama.cpp/ggml/src/ggml-cpu/ggml-cpu.c:2996: fatal errorAbort trap: 6

ggml-ci

ggerganov · 2024-12-17T15:58:00Z

@slaren Is this a valid fix, or is it better to skip these types in the tests?

slaren · 2024-12-17T16:00:00Z

My understanding is that these types are effectively unusable without an imatrix, so they should be entirely disabled for quantizing in ggml_cpy. I think what needs to be fixed is ggml_backend_cpu_device_supports_op.

ggml-ci

* ggml : fix cpy op for IQ-quants to use reference impl ggml-ci * ggml : disable tests involving i-matrix quantization * ggml : update ggml_backend_cpu_device_supports_op ggml-ci

ggml : fix cpy op for IQ-quants to use reference impl

b0597b1

ggml-ci

github-actions bot added testing Everything test related ggml changes relating to the ggml tensor library for machine learning labels Dec 17, 2024

ggml : disable tests involving i-matrix quantization

8cc7145

ggerganov changed the title ~~ggml : fix cpy op for IQ-quants to use reference impl~~ ggml : fix cpy tests requiring i-matrix quantization Dec 17, 2024

ggerganov changed the title ~~ggml : fix cpy tests requiring i-matrix quantization~~ ggml : disable cpy tests requiring i-matrix quantization Dec 17, 2024

ggml : update ggml_backend_cpu_device_supports_op

4fbb801

ggml-ci

ggerganov changed the title ~~ggml : disable cpy tests requiring i-matrix quantization~~ ggml : update ggml_backend_cpu_device_supports_op Dec 17, 2024

slaren approved these changes Dec 17, 2024

View reviewed changes

ggerganov merged commit 0006f5a into master Dec 17, 2024
51 of 55 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ggml : update ggml_backend_cpu_device_supports_op #10867

ggml : update ggml_backend_cpu_device_supports_op #10867

ggerganov commented Dec 17, 2024

ggerganov commented Dec 17, 2024

slaren commented Dec 17, 2024 •

edited

Loading

ggml : update ggml_backend_cpu_device_supports_op #10867

ggml : update ggml_backend_cpu_device_supports_op #10867

Conversation

ggerganov commented Dec 17, 2024

ggerganov commented Dec 17, 2024

slaren commented Dec 17, 2024 • edited Loading

slaren commented Dec 17, 2024 •

edited

Loading