Skip to content

GPTQModel v1.0.7

Compare
Choose a tag to compare
@Qubitium Qubitium released this 08 Oct 14:19
· 74 commits to main since this release
e208d38

What's Changed

Fixed marlin (faster) kernel was not auto-selected for some models and autoround quantization save throwing json errors.

Full Changelog: v1.0.6...v1.0.7