Speech recognition
Speech synthesis
Speaker verification
Speaker identification
✔️
✔️
✔️
✔️
Spoken Language identification
Audio tagging
Voice activity detection
Keyword spotting
✔️
✔️
✔️
✔️
Architecture
Android
iOS
Windows
macOS
linux
x64
✔️
✔️
✔️
✔️
x86
✔️
✔️
arm64
✔️
✔️
✔️
✔️
✔️
arm32
✔️
✔️
riscv64
✔️
Supported programming languages
C++
C
Python
C#
Java
JavaScript
Kotlin
Swift
Go
Dart
✔️
✔️
✔️
✔️
✔️
✔️
✔️
✔️
✔️
✔️
It also supports WebAssembly.
This repository supports running the following functions locally
Speech-to-text (i.e., ASR); both streaming and non-streaming are supported
Text-to-speech (i.e., TTS)
Speaker identification
Speaker verification
Spoken language identification
Audio tagging
VAD (e.g., silero-vad )
Keyword spotting
on the following platforms and operating systems:
with the following APIs
C++, C, Python, Go, C#
Java, Kotlin, JavaScript
Swift
Dart
Links for pre-built Android APKs
Links for pre-built Flutter APPs
Description
URL
中国用户
Streaming speech recognition
Address
点此
Links for pre-trained models
Please see
https://k2-fsa.github.io/sherpa/social-groups.html
for 新一代 Kaldi 微信交流群 and QQ 交流群 .