sync master #8

tc-mb · 2024-05-28T19:02:55Z

Add optional MLP bias for Granite models

Add optional MLP bias for ARCH_LLAMA to support Granite models. Partially addresses ggerganov/issues/7116 Still needs some more changes to properly support Granite.

llama: honor add_space_prefix from the model configuration

propagate the add_space_prefix configuration from the HF model configuration to the gguf file and honor it with the gpt2 tokenizer.

llama: add support for small granite models

it works only for the small models 3b and 8b.

The convert-hf-to-gguf.py script uses the vocabulary size of the granite models to detect granite and set the correct configuration.

* Add optional MLP bias for Granite models Add optional MLP bias for ARCH_LLAMA to support Granite models. Partially addresses ggerganov/issues/7116 Still needs some more changes to properly support Granite. * llama: honor add_space_prefix from the model configuration propagate the add_space_prefix configuration from the HF model configuration to the gguf file and honor it with the gpt2 tokenizer. Signed-off-by: Giuseppe Scrivano <gscrivan@redhat.com> * llama: add support for small granite models it works only for the small models 3b and 8b. The convert-hf-to-gguf.py script uses the vocabulary size of the granite models to detect granite and set the correct configuration. Signed-off-by: Giuseppe Scrivano <gscrivan@redhat.com> --------- Signed-off-by: Giuseppe Scrivano <gscrivan@redhat.com> Co-authored-by: Steffen Roecker <sroecker@redhat.com>

tc-mb merged commit 28d4a7f into prepare-PR-of-minicpm-v2.5 May 28, 2024
76 of 132 checks passed

github-actions bot added the python label May 28, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

sync master #8

sync master #8

tc-mb commented May 28, 2024

sync master #8

sync master #8

Conversation

tc-mb commented May 28, 2024