Skip to content

Openrouter Provider preferences #286

@Munsio

Description

@Munsio

It seems that Openrouter now has a few provider which provide a lower quantization size than others, we need to ensure with our calls to openrouter that we are not going to mix those with multiple requests.

Providers with different quantization:
https://openrouter.ai/models/meta-llama/llama-3.1-8b-instruct/status

Metadata

Metadata

Assignees

Labels

bugSomething isn't workingenhancementNew feature or requestpostponedThis issue/PR is postponed until there is a very good reason (e.g. $$$) to implement it.

Type

No type

Projects

No projects

Relationships

None yet

Development

No branches or pull requests

Issue actions