feature: support for exllama and AutoGPTQ

### Discussed in https://github.com/go-skynet/LocalAI/discussions/763

<div type='discussions-op-text'>

<sup>Originally posted by **yarray** July 17, 2023</sup>
Although llama.cpp can now support GPU via cublas, it seems that [exllama](https://github.com/turboderp/exllama) runs times faster if with a good enough GPU (3090 as an example). Is there any plan to support exllama, or in general, other loaders to load LLM?</div>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

feature: support for exllama and AutoGPTQ #796

Discussed in #763

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Uh oh!

feature: support for exllama and AutoGPTQ #796

Description

Discussed in #763

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions