GPT-NeoX has only minimal inference support

Steps to reproduce:
1. Download https://huggingface.co/EleutherAI/gpt-neox-20b
2. Convert the model and attempt to use it:
```
$ TMPDIR=/var/tmp ./convert-gptneox-hf-to-gguf.py gpt-neox-20b 1 --outfile gpt-neox-20b.f16.gguf
$ ./main -m gpt-neox-20b.f16.gguf
<snip>
llama_model_loader: - type  f32:  354 tensors
llama_model_loader: - type  f16:  178 tensors
error loading model: cannot find tokenizer scores in model file

llama_load_model_from_file: failed to load model
llama_init_from_gpt_params: error: failed to load model 'gpt-neox-20b.f16.gguf'
main: error: unable to load model
```

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

GPT-NeoX has only minimal inference support #3293

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

GPT-NeoX has only minimal inference support #3293

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions