Issue: Is Falcon 40B in GGML format form TheBloke usable? #1404

dlippold · 2023-09-10T17:15:09Z

In principle Falcon 40B should be usable as it is specified in #775 and #849. But there is a link to https://huggingface.co/tiiuae/falcon-40b-instruct which is not the GGML format from TheBloke.

No response

cebtenzzre · 2023-10-11T14:19:11Z

Despite the titles of the HF repos, these are GGCC files, which are only really supported by ggllm.cpp. But this project is based on llama.cpp. You need GGML files, like these: https://huggingface.co/TheBloke/falcon-40b-instruct-GGML/tree/ef68241787499747cb21a6c8bd48384d0864003a

The GGUF version of Falcon will be supported in the next release, which is a newer format that should be easier to find.

cebtenzzre closed this as completed Oct 11, 2023

Provide feedback