Skip to content

GPTQModel v0.9.5

Compare
Choose a tag to compare
@Qubitium Qubitium released this 05 Jul 13:48
· 281 commits to main since this release
f0a1ee8

What's Changed

Another large update with added support for Intel/Qbits quantization/inference on CPU. Cuda kernels have been fully deprecated in favor of better performing Exllama (v1/v2), Marlin, and Triton kernels.

Full Changelog: v0.9.4...v0.9.5