Skip to content

V0.1.5.post1: Support LLaMA-3-70B, Multi-gpu, fix kernel / sqrt(dk)

Latest
Compare
Choose a tag to compare
@github-actions github-actions released this 13 Aug 09:17
· 1 commit to main since this release
5e37e0d

What's Changed

  • Feature(MInference): support LLaMA-3-70B-1M and multi-gpu PP by @iofu728 in #59
  • Fix(MInference): fix e2e benchmark guideline & fix A-shape multi gpu by @iofu728 in #66
  • Fix(MInference): fix the vs pattern loss / sqrt(dk) by @PiotrNawrot in #70

Full Changelog: v0.1.5...v0.1.5.post1