Description
Describe the bug
A770 can't run deepseek R1 Q4 with flashmoe
How to reproduce
Steps to reproduce the error:
- install the gpu driver following the instruction (https://dgpu-docs.intel.com/driver/client/overview.html)
- download the gguf [DeepSeek-R1-Q4_K_M.gguf] which includs 9 files.
- ./flash-moe -m /PATH/TO/DeepSeek-R1-Q4_K_M-00001-of-00009.gguf --prompt "What's AI?" -no-cnv
- then there is the error message:
Screenshots
./flash-moe -m /home/deepseek/文档/deepseek/DeepSeek-R1-Q4_K_M-00001-of-00009.gguf --prompt "What's AI?" -no-cnv
terminate called after throwing an instance of 'std::filesystem::__cxx11::filesystem_error'
what(): filesystem error: Cannot convert character sequence: Invalid or incomplete multibyte or wide character
./flash-moe: 第 25 行: 8026 已中止 (核心已转储) LD_LIBRARY_PATH=$LD_LIBRARY_PATH:$(cd "$(dirname "$0")";pwd)
Environment information
ubuntu 22.04.05
384G ddr5
2 A770 gpus
Additional context
Add any other context about the problem here.