speech-to-text

Star

Here are 63 public repositories matching this topic...

ggml-org / whisper.cpp

Star

Port of OpenAI's Whisper model in C/C++

inference transformer speech-recognition openai speech-to-text whisper

Updated Apr 8, 2025
C++

mozilla / DeepSpeech

Star

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

machine-learning embedded deep-learning offline tensorflow speech-recognition neural-networks speech-to-text deepspeech on-device

Updated Sep 3, 2024
C++

Speech-to-text, text-to-speech, speaker diarization, speech enhancement, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, HarmonyOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, support 11 programming languages

Updated Apr 8, 2025
C++

coqui-ai / STT

Star

🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.

deep-learning tensorflow voice-recognition speech-recognition automatic-speech-recognition speech-to-text stt asr speech-recognizer speech-recognition-api

Updated Mar 11, 2024
C++

srvk / eesen

Star

The official repository of the Eesen project

tensorflow speech-recognition speech-to-text kaldi asr ctc ctc-loss

Updated May 23, 2019
C++

mkiol / dsnote

Star

Speech Note Linux app. Note taking, reading and translating with offline Speech to Text, Text to Speech and Machine translation.

text-to-speech translator translation offline machine-translation sailfishos tts speech-synthesis speech-recognition speech-to-text nmt linux-desktop stt asr flatpak-applications

Updated Apr 7, 2025
C++

locaal-ai / obs-localvocal

Star

OBS plugin for local speech recognition and captioning using AI

plugin translation ai livestream live-streaming speech-recognition speech-to-text obs transcription obs-studio whisper realtime-translator obs-studio-plugin realtime-transcribe openai-whisper whisper-cpp real-time-transcription

Updated Feb 6, 2025
C++

azkadev / whisper

Sponsor

Star

Whisper Dart is a cross platform library for dart and flutter that allows converting audio to text / speech to text / inference from Open AI models

Updated Feb 10, 2025
C++

gtreshchev / RuntimeSpeechRecognizer

Star

Cross-platform, real-time, offline speech recognition plugin for Unreal Engine. Based on Whisper OpenAI technology, whisper.cpp.

voice-recognition speech-recognition openai unreal-engine ue4 speech-to-text whisper speech-processing audio-processing unreal-engine-4 ue4-plugin speech-detection whis ue5 unreal-engine-5 ue5-plugin whisper-cpp whisper-ai

Updated Feb 23, 2025
C++

lucoiso / UEAzSpeech

Sponsor

Star

This plugin integrates Azure Speech Cognitive Services in Unreal Engine.

text-to-speech azure speech unreal tts speech-synthesis unrealengine speech-recognition unreal-engine ue4 speech-to-text azure-cognitive-services unreal-engine-4 unreal-engine-plugin azure-cognitive-service ue5 unreal-engine-5

Updated May 27, 2024
C++

locaal-ai / obs-cleanstream

Star

CleanStream is an OBS plugin that uses AI to clean live audio streams from unwanted words and utterances

plugin ai speech-to-text obs transcription profanity-detection obs-studio whisper profanity-filter real-time-filter obs-studio-plugin realtime-detection obs-plugin profanity-filtering profanity-blocking realtime-transcribe real-time-transcription

Updated Dec 19, 2024
C++

Azure-Samples / Cognitive-Services-Voice-Assistant

Star

Welcome to the Microsoft Voice Assistant samples repository! Here you will find samples to help you get started building client application for your bot or Custom Command service. You will also be able to easily deploy a working Custom Command based Voice Assistant to your own Azure subscription

microsoft bot sdk bots wpf bot-framework voice-commands microsoft-bot-framework speech-recognition dotnet-core microsoft-cognitive-services speech-to-text voice-control voice-assistant botframework voice-synthesis

Updated Oct 4, 2023
C++

skit-ai / kaldi-serve

Star

Server framework for Kaldi ASR Toolkit

grpc speech-recognition speech-to-text kaldi asr grpc-server kaldi-asr kaldi-server

Updated Sep 17, 2023
C++

mgonzs13 / whisper_ros

Star

Speech-to-Text based on SileroVAD + whisper.cpp (GGML Whisper) for ROS 2

speech-recognition vad speech-to-text ros2 voice-activity-detection whisper-cpp ggml

Updated Apr 5, 2025
C++

ukustra / sphinx-ue4

Star

A speech recognition plugin for Unreal Engine 5. This is essentially a port of Pocketsphinx, to be used within an Unreal Engine project.

plugin sphinx speech-recognition unreal-engine ue4 speech-to-text pocketsphinx unreal-engine-4 unrealengine4 ue5 unreal-engine-5 unrealengine5

Updated Nov 16, 2024
C++

charlesliucn / LanMIT

Star

📖 LanMIT: A Toolkit for Improving Language Models in Low-resourced Speech Recognition based on Kaldi.

language-modeling speech-recognition speech-to-text keyword-spotting kaldi-asr low-resource-languages

Updated Jul 12, 2019
C++

General-Developer / whisper_library

Star

Whisper Is Library for transcribe sound wav AKA Speech To Text Or Extract Text From Audio

dart machine-learning ai ml artificial-intelligence openai translate speech-to-text flutter whisper transcribe ggml

Updated Apr 8, 2025
C++

ErcinDedeoglu / WhisperDock

Star

Dockerized Whisper C++ speech-to-text API for easy deployment and rapid integration. Offering the latest stable and nightly builds for efficient audio transcription.

api docker machine-learning speech-to-text audio-transcription whisper-cpp

Updated Apr 6, 2025
C++

WindowsNT / SpeechRec

Star

Continuous Dictation Speech Recognition and Speech Synthesis in Win32

text-to-speech cplusplus uwp speech-synthesis speech-recognition speech-to-text win32

Updated May 28, 2020
C++

team-pp-studio / VoiceTuber

Star

VTuber application which only requires your voice and microphone, no need for a webcam or other tracking nonsense.

open-source opensource real-time opengl cplusplus realtime speech-to-text 2d visemes phoneme vtuber speach-recognition pngtuber

Updated Sep 29, 2024
C++

Improve this page

Add a description, image, and links to the speech-to-text topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the speech-to-text topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

speech-to-text

Here are 63 public repositories matching this topic...

ggml-org / whisper.cpp

mozilla / DeepSpeech

k2-fsa / sherpa-onnx

coqui-ai / STT

srvk / eesen

mkiol / dsnote

locaal-ai / obs-localvocal

azkadev / whisper

gtreshchev / RuntimeSpeechRecognizer

lucoiso / UEAzSpeech

locaal-ai / obs-cleanstream

Azure-Samples / Cognitive-Services-Voice-Assistant

skit-ai / kaldi-serve

mgonzs13 / whisper_ros

ukustra / sphinx-ue4

charlesliucn / LanMIT

General-Developer / whisper_library

ErcinDedeoglu / WhisperDock

WindowsNT / SpeechRec

team-pp-studio / VoiceTuber

Improve this page

Add this topic to your repo