Clamp out of range values in K quantizer #6888

jart · 2024-04-25T00:00:13Z

This assertion fails when quantizing Mixtral 8x7b as Q5_K_M, because I used convert.py --outtype f32 and the Mixtral weights use bf16 which has a much larger exponent range than the K quantizer is expecting. If --outtype f16 is used then the assert doesn't fail.

See #2982

This assertion fails when quantizing Mixtral 8x7b as Q5_K_M, because I used `convert.py --outtype f32` and the Mixtral weights use bf16 which has a much larger exponent range than the K quantizer is expecting. If --outtype f16 is used then the assert doesn't fail. See ggerganov#2982

mofosyne added bugfix fixes an issue or bug Review Complexity : Medium Generally require more time to grok but manageable by beginner to medium expertise level model Model specific labels May 9, 2024

mofosyne marked this pull request as draft May 18, 2024 05:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Clamp out of range values in K quantizer #6888

Clamp out of range values in K quantizer #6888

jart commented Apr 25, 2024

Clamp out of range values in K quantizer #6888

Are you sure you want to change the base?

Clamp out of range values in K quantizer #6888

Conversation

jart commented Apr 25, 2024