-
Notifications
You must be signed in to change notification settings - Fork 10
Closed
Labels
bugSomething isn't workingSomething isn't workingenhancementNew feature or requestNew feature or requestpostponedThis issue/PR is postponed until there is a very good reason (e.g. $$$) to implement it.This issue/PR is postponed until there is a very good reason (e.g. $$$) to implement it.
Milestone
Description
It seems that Openrouter now has a few provider which provide a lower quantization size than others, we need to ensure with our calls to openrouter that we are not going to mix those with multiple requests.
Providers with different quantization:
https://openrouter.ai/models/meta-llama/llama-3.1-8b-instruct/status
Metadata
Metadata
Assignees
Labels
bugSomething isn't workingSomething isn't workingenhancementNew feature or requestNew feature or requestpostponedThis issue/PR is postponed until there is a very good reason (e.g. $$$) to implement it.This issue/PR is postponed until there is a very good reason (e.g. $$$) to implement it.