Gemma-7b-it quantized still not working

Hi, I install the latest llama.cpp on macos and tried [gemma-7b-it-GGUF-Q4](https://huggingface.co/mlabonne/gemma-7b-it-GGUF), but the model still generate nonsense either in chat mode or completion mode. I have added the `<bos>` token to the prompt.

```bash
 ./main -m  ./mlabonne/gemma-7b-it-GGUF/gemma-7b-it.Q4_0.gguf -n 256  --color -i
```

Results:

<img width="1128" alt="Screenshot 2024-03-16 at 1 16 49 PM" src="https://github.com/ggerganov/llama.cpp/assets/19323568/a15db22a-bc79-4783-b0f9-64ab13585caa">


Please help check or instruct how to fix this problem.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Gemma-7b-it quantized still not working #6095

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Gemma-7b-it quantized still not working #6095

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions