Open
Description
Name and Version
version: 5215(5f5e39e)
built with MSVC 19343.34808.0
Operating systems
Windows
GGML backends
SYCL
Hardware
Core Ultra 5 125U 32GB mem(ThinkPad X1 Carbon Gen12)
Driver Version: 32.0.101.6739
Models
Qwen3-4B-gguf Q4_0 (https://huggingface.co/unsloth/Qwen3-4B-GGUF/tree/main)
Problem description & steps to reproduce
When attempting inference with the model, the screen briefly goes black and fails to function properly. However, the Q4_K_M model operates normally.
in addition, cuda build (cu11.7,b5215) work properly with Q4_0.
how to reproduce:
- llama-cli.exe -ngl 99 -m Qwen3-4B-Q4_0.gguf
- input question
- Black out occur
in event log, llama-cli.exe shows following application error
First Bad Commit
No response
Relevant log output
1. llama-cli.exe -ngl 99 -m Qwen3-4B-Q4_0.gguf
2. input question
3. Black out occur