Port of OpenAI's Whisper model in C/C++
-
Updated
Apr 8, 2025 - C++
Port of OpenAI's Whisper model in C/C++
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
Speech-to-text, text-to-speech, speaker diarization, speech enhancement, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, HarmonyOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, support 11 programming languages
🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.
The official repository of the Eesen project
Speech Note Linux app. Note taking, reading and translating with offline Speech to Text, Text to Speech and Machine translation.
OBS plugin for local speech recognition and captioning using AI
Whisper Dart is a cross platform library for dart and flutter that allows converting audio to text / speech to text / inference from Open AI models
Cross-platform, real-time, offline speech recognition plugin for Unreal Engine. Based on Whisper OpenAI technology, whisper.cpp.
This plugin integrates Azure Speech Cognitive Services in Unreal Engine.
CleanStream is an OBS plugin that uses AI to clean live audio streams from unwanted words and utterances
Welcome to the Microsoft Voice Assistant samples repository! Here you will find samples to help you get started building client application for your bot or Custom Command service. You will also be able to easily deploy a working Custom Command based Voice Assistant to your own Azure subscription
Server framework for Kaldi ASR Toolkit
Speech-to-Text based on SileroVAD + whisper.cpp (GGML Whisper) for ROS 2
A speech recognition plugin for Unreal Engine 5. This is essentially a port of Pocketsphinx, to be used within an Unreal Engine project.
📖 LanMIT: A Toolkit for Improving Language Models in Low-resourced Speech Recognition based on Kaldi.
Whisper Is Library for transcribe sound wav AKA Speech To Text Or Extract Text From Audio
Dockerized Whisper C++ speech-to-text API for easy deployment and rapid integration. Offering the latest stable and nightly builds for efficient audio transcription.
Continuous Dictation Speech Recognition and Speech Synthesis in Win32
VTuber application which only requires your voice and microphone, no need for a webcam or other tracking nonsense.
Add a description, image, and links to the speech-to-text topic page so that developers can more easily learn about it.
To associate your repository with the speech-to-text topic, visit your repo's landing page and select "manage topics."