Feature request：support ExLlama #296

Closed as not planned

Labels

feature requeststale

opened

on Jun 28, 2023

ExLlama (https://github.com/turboderp/exllama)

It's currently the fastest and most memory-efficient executor of models that I'm aware of.

Is there an interest from the maintainers in adding this support?

Metadata

Assignees

No one assigned

Labels

feature requeststale

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests