Attributed in huggingface/transformers #1

LysandreJik · 2024-04-19T15:47:38Z

Hello!

FYI we've been using your code in order to offer support for gguf files within the python ecosystem, by offering the ability to load them within transformers.

We're doing so here, we've credited you in the documentation and I've added you as a co-author: https://github.com/LysandreJik/transformers/pull/2/files

We'll open a PR on the main fork in the coming days so I wanted to give you an opportunity to give it a look beforehand.

Thanks a lot for your work 🤗

cc @younesbelkada

The text was updated successfully, but these errors were encountered:

99991 · 2024-04-19T16:25:26Z

Very cool! I am glad that you found my code is useful!

But I am also a bit worried about potential bugs. I've only tested with tinyllama so far. It might totally break for any other model. For example, I am not sure about the transposed shapes.

In addition, I am not sure if this is the best way forward for the transformers library. Not having to add additional dependencies is certainly nice, but using NumPy is significantly slower than writing the bit wrangling code in C, because of all the copying from NumPy array to NumPy array.

99991 · 2024-04-21T19:01:28Z

Anyway, it might be nice to have a NumPy implementation to fall back on. For better completeness, I have implemented the missing quantization formats Q2_K, Q3_K and Q5_K. I have not implemented the other formats, since they are expected to be worse than the existing ones.

a417edb

99991 · 2024-08-15T14:23:31Z

I've just heard that llama.cpp implements dequantization now, so you might want to consider switching to it instead of pygguf since it supports more quantization formats: ggerganov/llama.cpp#8939

LysandreJik · 2024-08-26T09:50:53Z

Thanks for the heads-up @99991!

cc @SunMarc for your information

SunMarc · 2024-08-26T13:38:20Z

Thanks for the heads-up @99991. Really appreciate it ! We already have a PR opened to make the switch: huggingface/transformers#32625

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Attributed in huggingface/transformers #1

Attributed in huggingface/transformers #1

LysandreJik commented Apr 19, 2024

99991 commented Apr 19, 2024

99991 commented Apr 21, 2024

99991 commented Aug 15, 2024 •

edited

Loading

LysandreJik commented Aug 26, 2024

SunMarc commented Aug 26, 2024

Attributed in huggingface/transformers #1

Attributed in huggingface/transformers #1

Comments

LysandreJik commented Apr 19, 2024

99991 commented Apr 19, 2024

99991 commented Apr 21, 2024

99991 commented Aug 15, 2024 • edited Loading

LysandreJik commented Aug 26, 2024

SunMarc commented Aug 26, 2024

99991 commented Aug 15, 2024 •

edited

Loading