Skip to content

LMDeploy Release v0.6.4

Latest
Compare
Choose a tag to compare
@lvhan028 lvhan028 released this 09 Dec 12:08
· 28 commits to main since this release
14b64c7

What's Changed

🚀 Features

💥 Improvements

🐞 Bug fixes

  • disable prefix-caching for vl model by @grimoire in #2825
  • Fix gemma2 accuracy through the correct softcapping logic by @AllentDan in #2842
  • fix accessing before initialization by @lvhan028 in #2845
  • fix the logic to verify whether AutoAWQ has been successfully installed by @grimoire in #2844
  • check whether backend_config is None or not before accessing its attr by @lvhan028 in #2848
  • [ascend] convert kv cache to nd format in ascend graph mode by @tangzhiyi11 in #2853

📚 Documentations

🌐 Other

New Contributors

Full Changelog: v0.6.3...v0.6.4