VOXD is a speech-to-text, voice-typing, dictation software for linux distributions. It is an open-source, free of charge, USER-FRIENDLY software, for as many linux distros as possible.
-
Updated
Oct 22, 2025 - Python
VOXD is a speech-to-text, voice-typing, dictation software for linux distributions. It is an open-source, free of charge, USER-FRIENDLY software, for as many linux distros as possible.
🎙️ AI-powered Telegram bot for voice-to-text transcription using OpenAI Whisper. CPU-only, no GPU required, privacy-focused with local processing.
Automatically generate accurate, per-word video captions with timestamps using Whisper ASR and FFmpeg, perfect for YouTube, social media, and accessibility.
a python script that can auto generate subtitle in YouTube Videos
One-command audio transcription from any video platform Transform video URLs into text transcripts instantly with automatic audio download, AI transcription, and clipboard integration. Perfect for content creators, researchers, students, and anyone who needs quick video-to-text conversion.
🎬 AI-powered subtitle generator using OpenAI Whisper. Multi-language support, batch processing, GPU acceleration. Generate SRT/WebVTT subtitles instantly!
"An offline video & audio transcription tool powered by OpenAI Whisper. Convert your tutorials, lectures, and podcasts into accurate text transcripts and use AI to generate summaries, notes, and mind maps — saving hours of time and boosting productivity."
AI-powered assistant that automatically joins Google Meet sessions, transcribes conversations in real-time, and cleans transcripts using OpenAI, Google Gemini, and Forefront APIs.
🤖 Trascrizione automatica di documenti storici italiani con AI. Utilizza Google Gemini per digitalizzare certificati di decesso del 1800, estraendo dati strutturati ed esportandoli in Excel per ricerche genealogiche e storiche.
A Python application that converts MP4 videos to MP3 audio and transcribes the audio to text using OpenAI's Whisper API. Features a modern, user-friendly interface built with ttkbootstrap.
🎤 Enable real-time audio transcription and translation using advanced AI tools for seamless communication across languages.
Add a description, image, and links to the ai-transcription topic page so that developers can more easily learn about it.
To associate your repository with the ai-transcription topic, visit your repo's landing page and select "manage topics."