Skip to content
This repository has been archived by the owner on Jun 24, 2024. It is now read-only.
This repository has been archived by the owner on Jun 24, 2024. It is now read-only.

Support GGJT v3 #252

Closed
Closed
@philpax

Description

There is a new quantization level in llama.cpp, which means there will be models published with it in the near-future. We will need to support this.

ggerganov/llama.cpp#1508

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions