Transcribe and translate voice into LRC file using Whisper and LLMs (GPT, Claude, et,al). 使用whisper和LLM(GPT,Claude等)来转录、翻译你的音频为字幕文件。
-
Updated
Jul 28, 2025 - Python
Transcribe and translate voice into LRC file using Whisper and LLMs (GPT, Claude, et,al). 使用whisper和LLM(GPT,Claude等)来转录、翻译你的音频为字幕文件。
On-device speech-to-text engine powered by deep learning
A dynamic, scalable AI chatbot built with Django REST framework, supporting custom training from PDFs, documents, websites, and YouTube videos. Leveraging OpenAI's GPT-3.5, Pinecone, FAISS, and Celery for seamless integration and performance.
📱 🏃 🍎 Fitness application that’s used to keep track of your physical fitness data, daily calorie count, invite friends to work out together and ultimately get healthy.
VOXD is a speech-to-text, voice-typing, dictation software for linux distributions. It is an open-source, free of charge, USER-FRIENDLY software, for as many linux distros as possible.
A bash script using OpenAI Whisper API for continuous audio transcription with automatic silence detection
Chrome Web Speech API
Privacy-First Voice-to-Text with AI Enhancement for macOS
🎬 KaKa Subtitle Assistant | VideoCaptioner - English Branch - An intelligent subtitle assistant based on LLM and Faster Whisper, one click video and subtitle high speed muxing. No need for discreet GPU. Video sub generating, sentence breaking, proofing...all-in-one. Make subtitles with ease.
ChatGPT Voice Chatbot Telegram is a Python and Flask-based GitHub repository that enables users to communicate with an AI chatbot using voice-to-text and text-to-voice technologies powered by OpenAI. The repository provides a flexible and customizable solution for building advanced voice-enabled chatbots using natural language processing.
This package can be used to connect Telegram bot to AI engines such as OpenAI ChatGPT, Dall-E, Midjourney, Stable Diffusion, etc.
Codo-File is a code editor that primarily supports JavaScript and Python, with partial Dart support. Additionally, it features a real-time website editor where you can create your own website in the browser using HTML, CSS, and JavaScript. The project also includes an image-to-text feature and a voice-to-text feature .
一个简洁且优秀的描述是:这是一款在任何网页上实现无缝语音转文字的 Chrome 扩展,使用先进的 ASR API。
Kotlin Multiplatform Mobile Translator App
Free ChatGPT voice interaction and integration into python workflows.
A simple iOS App that can convert speech/voice into text. Only English voice is supported for now. Used Swift 5, AVKit and Speech.
Telegram bot with ASR
ANDROID APP to AUTO GENERATE SUBTITLE FILE and TRANSLATED SUBTITLE FILE (using unofficial online Google Translate API) for any audio/video files
Use ChatGPT in your own voice to place a phone call on your behalf, just by prompting it.
ANDROID APP to AUTO GENERATE SUBTITLE FILE and TRANSLATED SUBTITLE FILE (using unofficial online Google Translate API) for any audio/video files using 2 ACTIVITIES
Add a description, image, and links to the voice-to-text topic page so that developers can more easily learn about it.
To associate your repository with the voice-to-text topic, visit your repo's landing page and select "manage topics."