Description
So I am looking at https://github.com/antimatter15/alpaca.cpp and I see they are already running 30B Alpaca models, while we are struggling to run 7B due to the recent tokenizer updates.
I also see that the models are now even floating on Hugging Face - I guess license issues are no longer a problem?
We should add detailed instructions for obtaining the Alpaca models and a temporary explanation how to use the following script to make the models compatible with the latest master
:
The bigger issue is that people keep producing the old version of the ggml
models instead of migrating to the latest llama.cpp
changes. And therefore, we now need this extra conversion step. It's best to figure out the steps for generating the Alpaca models and generate them in the correct format.
Edit: just don't post direct links to the models!