Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Bug] The specified tag is not a valid quantization scheme. #1476

Open
Mushoz opened this issue Oct 28, 2024 · 2 comments
Open

[Bug] The specified tag is not a valid quantization scheme. #1476

Mushoz opened this issue Oct 28, 2024 · 2 comments

Comments

@Mushoz
Copy link

Mushoz commented Oct 28, 2024

Bug description.
When trying to pull a specific quantization tag for a model through Ollama I was getting the following error: The specified tag is not a valid quantization scheme.

At first I thought it was an Ollama bug, so I reported it to their issue tracker. However, it seems that it's really a Huggingface bug instead. I can replicate the issue with curl:

curl -vL -H Accept:application/vnd.docker/distribution.manifest.v2+json hf.co/v2/bartowski/Replete-LLM-V2.5-Qwen-32b-GGUF/manifests/IQ4_NL

Gives me the exact same error, while:

curl -vL -H Accept:application/vnd.docker/distribution.manifest.v2+json hf.co/v2/bartowski/Replete-LLM-V2.5-Qwen-32b-GGUF/manifests/Q4_K_M

Does not. It seems like Huggingface does not recognize IQ4_NL as a valid quantization scheme.

Upon further testing, IQ2_XXS suffers from the exact same problem, while IQ2_XS does work fine. This test was done with this repo: https://huggingface.co/bartowski/Meta-Llama-3.1-70B-Instruct-GGUF

Describe the expected behaviour
If the Quant is valid and exists within the specified repo, it should pull successfully.

@Vaibhavs10
Copy link
Member

Hi @Mushoz - Thanks for the issue; we're looking into this with @ngxson 🤗

@Vaibhavs10
Copy link
Member

Hi @Mushoz - can you retry now, this should be fixed.

image

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants