Skip to content

A770 can't run deepseek R1 Q4 with flashmoe #13183

Open
@luningxie

Description

@luningxie

Describe the bug
A770 can't run deepseek R1 Q4 with flashmoe

How to reproduce
Steps to reproduce the error:

  1. install the gpu driver following the instruction (https://dgpu-docs.intel.com/driver/client/overview.html)
  2. download the gguf [DeepSeek-R1-Q4_K_M.gguf] which includs 9 files.
  3. ./flash-moe -m /PATH/TO/DeepSeek-R1-Q4_K_M-00001-of-00009.gguf --prompt "What's AI?" -no-cnv
  4. then there is the error message:

Screenshots

./flash-moe -m /home/deepseek/文档/deepseek/DeepSeek-R1-Q4_K_M-00001-of-00009.gguf --prompt "What's AI?" -no-cnv
terminate called after throwing an instance of 'std::filesystem::__cxx11::filesystem_error'
what(): filesystem error: Cannot convert character sequence: Invalid or incomplete multibyte or wide character
./flash-moe: 第 25 行: 8026 已中止 (核心已转储) LD_LIBRARY_PATH=$LD_LIBRARY_PATH:$(cd "$(dirname "$0")";pwd) $(cd "$(dirname "$0")";pwd)/llama-cli-bin -t $CORES -e -ngl 999 --color --no-context-shift -ot exps=CPU "$@"

Environment information
ubuntu 22.04.05
384G ddr5
2 A770 gpus

Additional context
Add any other context about the problem here.

Metadata

Metadata

Assignees

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions