Closed
Description
Phi-3 4k model include in all responses the end token "<|end|>"
Im using: https://huggingface.co/microsoft/Phi-3-mini-4k-instruct-gguf and llama.cpp for docker cuda server in the latest version.
Thanks in advance.
Phi-3 4k model include in all responses the end token "<|end|>"
Im using: https://huggingface.co/microsoft/Phi-3-mini-4k-instruct-gguf and llama.cpp for docker cuda server in the latest version.
Thanks in advance.