Skip to content

v0.1.5

Compare
Choose a tag to compare
@github-actions github-actions released this 24 Jul 11:25
· 4 commits to main since this release
b5b8745

What's Changed #27

  • add vllm>=0.4.1 by @liyucheng09 in #19, #44
  • Feature(MInference): update HF demo information, thanks @ak's sponsoring by @iofu728 in #22
  • Feature(MInference): add unittest by @iofu728 in #31, #32
  • Feature(MInference): add triton-based decoding in case flash_attn is not available by @liyucheng09 in #35
  • Feature(MInference): add e2e benchmark using vllm by @iofu728 in #49
  • Feature(MInference): support llama 3.1 by @iofu728 in #54
  • Hotfix(MInference): fix the import warnings, fix the apply_rotary_pos… by @iofu728 in #30

New Contributors

Full Changelog: v0.1.4...v0.1.5