Cutting edge AI technology for automated audio transcription. A nice GUI for OpenAIs Whisper and pyannote (speaker identification)
-
Updated
Nov 5, 2024 - Python
Cutting edge AI technology for automated audio transcription. A nice GUI for OpenAIs Whisper and pyannote (speaker identification)
Whisper.cpp Speech-to-text with Voice Acticity Detection
PKGBUILD generation for whisper.cpp models
whisper.cpp HTTP transcription server with OpenAI-like API in Docker
Private voice keyboard, AI chat, images, webcam, recordings, voice control with >= 4 GiB of VRAM.
Record your global audio and transcribe with whisper.cpp and llama.cpp
An experiment on getting sentiment analysis using whisper
youtube-transcriber using whisper and yt-dlp
Offline srt producer gui with whisper.cpp
SummaryTube is a project designed to download YouTube videos, extract text using `whisper.cpp` (which requires less VRAM than importing Whisper in Python and supports Apple Metal), and then utilize the OpenAI API to summarize the entire video and generate bulleted points.
Python command line utility wrappers for Whispercpp and other speech-to-text utilities
Voice-to-text widget that allows to input transcribed speech into any text field on the screen. Script controlled by chosen function button. Runs on Whisper model rewritten in C++.
whisper.cpp bindings for python
Transcribes videos and describes them with OpenAI APIs or local models.
A Maubot to transcribe audio messages using local open-source libraries
God-GPT: a PoC of a godlike autonomous agent that leverages the Dalee-2 and whisper.cpp
Benchmarks + matplotlib visualizations for OpenAI Whisper Experiments
a web application to covert movie file to transcript text by whisper.cpp and gradio
Add a description, image, and links to the whisper-cpp topic page so that developers can more easily learn about it.
To associate your repository with the whisper-cpp topic, visit your repo's landing page and select "manage topics."