v1.0.0
What's Changed
40% faster multi-threaded packing
, new lm_eval
api, fixed python 3.9 compat.
- Add
lm_eval
api by @PZS-ModelCloud in #338 - Multi-threaded
packing
in quantization by PZS-ModelCloud in #354 - [CI] Add TGI unit test by @PZS-ModelCloud in #348
- [CI] Updates by @CSY-ModelCloud in #347, #352, #353, #355, @CSY-ModelCloud in #357
- Fix python 3.9 compat by @PZS-ModelCloud in #358
Full Changelog: v0.9.11...v1.0.0