GitHub - k2-fsa/sherpa-onnx at refs/tags/v1.10.20

Branches Tags

Name		Name	Last commit message	Last commit date
Latest commit History 729 Commits
.github		.github
android		android
c-api-examples		c-api-examples
cmake		cmake
dart-api-examples		dart-api-examples
dotnet-examples		dotnet-examples
ffmpeg-examples		ffmpeg-examples
flutter-examples		flutter-examples
flutter		flutter
go-api-examples		go-api-examples
ios-swift		ios-swift
ios-swiftui		ios-swiftui
java-api-examples		java-api-examples
kotlin-api-examples		kotlin-api-examples
mfc-examples		mfc-examples
nodejs-addon-examples		nodejs-addon-examples
nodejs-examples		nodejs-examples
python-api-examples		python-api-examples
scripts		scripts
sherpa-onnx		sherpa-onnx
swift-api-examples		swift-api-examples
toolchains		toolchains
wasm		wasm
.clang-format		.clang-format
.clang-tidy		.clang-tidy
.flake8		.flake8
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
CMakeLists.txt		CMakeLists.txt
CPPLINT.cfg		CPPLINT.cfg
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
build-aarch64-linux-gnu.sh		build-aarch64-linux-gnu.sh
build-android-arm64-v8a.sh		build-android-arm64-v8a.sh
build-android-armv7-eabi.sh		build-android-armv7-eabi.sh
build-android-x86-64.sh		build-android-x86-64.sh
build-android-x86.sh		build-android-x86.sh
build-arm-linux-gnueabihf.sh		build-arm-linux-gnueabihf.sh
build-ios-no-tts.sh		build-ios-no-tts.sh
build-ios-shared.sh		build-ios-shared.sh
build-ios.sh		build-ios.sh
build-riscv64-linux-gnu.sh		build-riscv64-linux-gnu.sh
build-swift-macos.sh		build-swift-macos.sh
build-wasm-simd-asr.sh		build-wasm-simd-asr.sh
build-wasm-simd-kws.sh		build-wasm-simd-kws.sh
build-wasm-simd-nodejs.sh		build-wasm-simd-nodejs.sh
build-wasm-simd-tts.sh		build-wasm-simd-tts.sh
release.sh		release.sh
setup.py		setup.py

Repository files navigation

Supported functions

Speech recognition	Speech synthesis	Speaker verification	Speaker identification
✔️	✔️	✔️	✔️

Spoken Language identification	Audio tagging	Voice activity detection
✔️	✔️	✔️

Keyword spotting	Add punctuation
✔️	✔️

Supported platforms

Architecture	Android	iOS	Windows	macOS	linux
x64	✔️		✔️	✔️	✔️
x86	✔️		✔️
arm64	✔️	✔️	✔️	✔️	✔️
arm32	✔️				✔️
riscv64					✔️

Supported programming languages

C++	C	Python	C#	Java	JavaScript	Kotlin	Swift	Go	Dart
✔️	✔️	✔️	✔️	✔️	✔️	✔️	✔️	✔️	✔️

It also supports WebAssembly.

Introduction

This repository supports running the following functions locally

Speech-to-text (i.e., ASR); both streaming and non-streaming are supported
Text-to-speech (i.e., TTS)
Speaker identification
Speaker verification
Spoken language identification
Audio tagging
VAD (e.g., silero-vad)
Keyword spotting

on the following platforms and operating systems:

x86, x86_64, 32-bit ARM, 64-bit ARM (arm64, aarch64), RISC-V (riscv64)
Linux, macOS, Windows, openKylin
Android, WearOS
iOS
NodeJS
WebAssembly
Raspberry Pi
RV1126
LicheePi4A
VisionFive 2
旭日X3派
etc

with the following APIs

C++, C, Python, Go, C#
Java, Kotlin, JavaScript
Swift
Dart

Links for pre-built Android APKs

Description	URL	中国用户
Streaming speech recognition	Address	点此
Text-to-speech	Address	点此
Voice activity detection (VAD)	Address	点此
VAD + non-streaming speech recognition	Address	点此
Two-pass speech recognition	Address	点此
Audio tagging	Address	点此
Audio tagging (WearOS)	Address	点此
Speaker identification	Address	点此
Spoken language identification	Address	点此
Keyword spotting	Address	点此

Links for pre-built Flutter APPs

Real-time speech recognition

Description	URL	中国用户
Streaming speech recognition	Address	点此

Text-to-speech

Description	URL	中国用户
Android (arm64-v8a, armeabi-v7a, x86_64)	Address	点此
Linux (x64)	Address	点此
macOS (x64)	Address	点此
macOS (arm64)	Address	点此
Windows (x64)	Address	点此

Note: You need to build from source for iOS.

Links for pre-trained models

Description	URL
Speech recognition (speech to text, ASR)	Address
Text-to-speech (TTS)	Address
VAD	Address
Keyword spotting	Address
Audio tagging	Address
Speaker identification (Speaker ID)	Address
Spoken language identification (Language ID)	See multi-lingual Whisper ASR models from Speech recognition
Punctuation	Address

Useful links

Documentation: https://k2-fsa.github.io/sherpa/onnx/
Bilibili 演示视频: https://search.bilibili.com/all?keyword=%E6%96%B0%E4%B8%80%E4%BB%A3Kaldi

How to reach us

Please see https://k2-fsa.github.io/sherpa/social-groups.html for 新一代 Kaldi 微信交流群 and QQ 交流群.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Supported functions

Supported platforms

Supported programming languages

Introduction

Links for pre-built Android APKs

Links for pre-built Flutter APPs

Real-time speech recognition

Text-to-speech

Links for pre-trained models

Useful links

How to reach us

About

Releases 112

Contributors 104

Languages

License

k2-fsa/sherpa-onnx

Folders and files

Latest commit

History

Repository files navigation

Supported functions

Supported platforms

Supported programming languages

Introduction

Links for pre-built Android APKs

Links for pre-built Flutter APPs

Real-time speech recognition

Text-to-speech

Links for pre-trained models

Useful links

How to reach us

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 112

Contributors 104

Languages