Open
Description
Hi, Thanks for the great work. Hope this gets merged into llama.cpp, but till then, I'm able to get things to work in the command line. However, when running the python example, I get this error:
FileNotFoundError: Shared library with base name "bert" not found
I think I'm missing a package? I did the pip install requirements bit, so not sure what I'm getting wrong.
EDIT 1: Just noticed this has been merged into llama.cpp. For some reason I get an error when loading it into llama.cpp
llama_model_load: error loading model: error loading model hyperparameters: key not found in model: bert.context_length
This gguf was converted using bert.cpp. Does the original model have to be converted through llama.cpp?
EDIT 2: I see there's an issue with the embeddings implementation in llama.cpp
Also tried converting the model using llama.cpp convert.py but get this error:
Loading model file /home/sravanth/vecsearch/UAE-Large-V1/model.safetensors
Traceback (most recent call last):
File "/home/sravanth/llama.cpp/convert.py", line 1483, in <module>
main()
File "/home/sravanth/llama.cpp/convert.py", line 1430, in main
params = Params.load(model_plus)
File "/home/sravanth/llama.cpp/convert.py", line 317, in load
params = Params.loadHFTransformerJson(model_plus.model, hf_config_path)
File "/home/sravanth/llama.cpp/convert.py", line 256, in loadHFTransformerJson
f_norm_eps = config["rms_norm_eps"],
KeyError: 'rms_norm_eps'
Metadata
Metadata
Assignees
Labels
No labels