Skip to content

CTranslate2 3.20.0

Compare
Choose a tag to compare
@guillaumekln guillaumekln released this 18 Sep 16:13
· 75 commits to master since this release

New features

  • Update the Transformers converter to support more model architectures:
    • MixFormerSequential (used by microsoft/phi-1_5)
  • Accept batch inputs in methods generate_tokens
  • Add method Generator.async_generate_tokens to return an asynchronous generator compatible with asyncio

Fixes and improvements

  • Remove the epsilon value in the softmax CPU kernel for consistency with other implementations
  • Optimize implementation of the Dynamic Time Wrapping (DTW) function (used for Whisper alignment)
  • Avoid an unnecessary copy of the input arguments in method Whisper::align