add support for Orion-14B by sharpHL · Pull Request #5118 · ggml-org/llama.cpp

sharpHL · 2024-01-24T18:10:16Z

support for the Orion-14B related models
https://huggingface.co/OrionStarAI/Orion-14B-Chat
https://huggingface.co/OrionStarAI/Orion-14B-Chat-Plugin
https://huggingface.co/OrionStarAI/Orion-14B-Chat-RAG

…B-Chat)

arch-btw · 2024-01-25T10:59:53Z

Can confirm that it works with https://huggingface.co/OrionStarAI/Orion-14B-Chat/blob/main/Orion-14B-Chat.gguf (converted to Q5_K_M).

Although, it is not clear what the correct prompt format is, -i -ins seems to work.

sorasoras · 2024-01-25T17:43:31Z

Can confirm working on rocm

Tangweirui2021

These changes do can fix the convert problem. And it also enables the model to run correctly.

llama.cpp

sharpHL

Orion-14B-support

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>

llama.cpp

Co-authored-by: slaren <slarengh@gmail.com>

llama.cpp

zyxcambridge · 2024-01-29T13:14:29Z

llm_load_print_meta: BOS token = 1 ''
llm_load_print_meta: EOS token = 2 ''
llm_load_print_meta: UNK token = 0 ''
llm_load_print_meta: PAD token = 0 ''
llm_load_print_meta: LF token = 64 '<0x0A>'
llm_load_tensors: ggml ctx size = 0.34 MiB
ggml_backend_metal_buffer_from_ptr: error: failed to allocate buffer, size = 0.00 MiB
llama_model_load: error loading model: failed to allocate buffer
llama_load_model_from_file: failed to load model
llama_init_from_gpt_params: error: failed to load model 'Orion-14B-Chat.gguf'
main: error: unable to load model
(base) zhangyixin@zhangyixin llama.cpp %

* add support for Orion-14B(https://huggingface.co/OrionStarAI/Orion-14B-Chat) * flake8 support * Update llama.cpp Co-authored-by: Georgi Gerganov <ggerganov@gmail.com> * Update llama.cpp Co-authored-by: Georgi Gerganov <ggerganov@gmail.com> * Update llama.cpp Co-authored-by: Georgi Gerganov <ggerganov@gmail.com> * Update llama.cpp Co-authored-by: Georgi Gerganov <ggerganov@gmail.com> * Update llama.cpp Co-authored-by: slaren <slarengh@gmail.com> * Update llama.cpp * Update llama.cpp --------- Co-authored-by: lixiaopu <lixiaopu@cmcm.com> Co-authored-by: Georgi Gerganov <ggerganov@gmail.com> Co-authored-by: slaren <slarengh@gmail.com>

lixiaopu and others added 3 commits January 25, 2024 01:58

add support for Orion-14B(https://huggingface.co/OrionStarAI/Orion-14…

d64bb81

…B-Chat)

flake8 support

154319c

Merge branch 'ggerganov:master' into Orion-14B-support

0bd6d42

LostRuins mentioned this pull request Jan 26, 2024

(COMPATIBILITY) [v1.54 Smooth Sampling] - unknown model architecture: 'orion' LostRuins/koboldcpp#638

Closed

Tangweirui2021 approved these changes Jan 26, 2024

View reviewed changes

ggerganov approved these changes Jan 26, 2024

View reviewed changes

llama.cpp Show resolved Hide resolved

llama.cpp Show resolved Hide resolved

llama.cpp Show resolved Hide resolved

llama.cpp Outdated Show resolved Hide resolved

llama.cpp Show resolved Hide resolved

sharpHL commented Jan 27, 2024

View reviewed changes

sharpHL and others added 5 commits January 27, 2024 19:19

Update llama.cpp

0185aa7

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>

Update llama.cpp

db44ddf

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>

Update llama.cpp

aac36f9

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>

Merge branch 'ggerganov:master' into Orion-14B-support

220f917

Update llama.cpp

82f5d56

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>

slaren reviewed Jan 27, 2024

View reviewed changes

llama.cpp Outdated Show resolved Hide resolved

sharpHL and others added 2 commits January 28, 2024 00:57

Update llama.cpp

97fbb22

Co-authored-by: slaren <slarengh@gmail.com>

Merge branch 'ggerganov:master' into Orion-14B-support

40f5570

ggerganov reviewed Jan 28, 2024

View reviewed changes

llama.cpp Show resolved Hide resolved

Update llama.cpp

5918c98

ggerganov reviewed Jan 28, 2024

View reviewed changes

llama.cpp Show resolved Hide resolved

Update llama.cpp

f514e67

ggerganov merged commit f2e69d2 into ggml-org:master Jan 28, 2024

prusnak mentioned this pull request Jan 28, 2024

convert-hf-to-gguf.py Qwen-72B-Chat model get Killed result #5156

Closed

ggerganov added a commit that referenced this pull request Jan 31, 2024

llama : reorder build_orion() at correct place (#5118)

d3bac7d

jordankanter pushed a commit to jordankanter/llama.cpp that referenced this pull request Feb 3, 2024

llama : reorder build_orion() at correct place (ggml-org#5118)

3196e58

sharpHL deleted the Orion-14B-support branch April 8, 2025 15:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

add support for Orion-14B#5118

add support for Orion-14B#5118
ggerganov merged 12 commits intoggml-org:masterfrom
sharpHL:Orion-14B-support

sharpHL commented Jan 24, 2024

Uh oh!

arch-btw commented Jan 25, 2024

Uh oh!

sorasoras commented Jan 25, 2024

Uh oh!

Tangweirui2021 left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sharpHL left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

zyxcambridge commented Jan 29, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

8 participants

Comments

Conversation

sharpHL commented Jan 24, 2024

Uh oh!

arch-btw commented Jan 25, 2024

Uh oh!

sorasoras commented Jan 25, 2024

Uh oh!

Tangweirui2021 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sharpHL left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

zyxcambridge commented Jan 29, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

8 participants