RoboML is an aggregator package written for prototyping and deploying open source ML models for robotics use cases
-
Updated
Nov 24, 2025 - Python
RoboML is an aggregator package written for prototyping and deploying open source ML models for robotics use cases
This is a speech_to_text script by Pranay Joshi
A web based platform that lets you control your DJI Tello drone via speech inputs and gestures through UDP protocol. Utilises computer vision, networking, mediapipe and ANNs
A speech to text and text to speech client for Program-Y chatbot framework
Description : 'CarSoul-AI' is a Python program blending vehicle diagnostics with integrated ChatGPT. Using voice commands, access real-time car data, clear error codes, or talk to gpt-4 powerd ai for solutions..
Leveraging the services of IBM Watson to convert the Real-Time Input Speech to Text
Source code of "ChatGPT at your service" on "ImagineAndMake" serie
take a look of this a simple program that you can do all of things here.
Demo for automated speech to text transcription, using Google services
This repo is used for Speech to Text with offline saved customizable dictionary.
Reception Automation automates the processes and tasks accomplished at the Front Desk of any organisation (here, College premise). The tasks carried out can be referring the visitors by showing paths, intimating the staff with messages for the schedules and the appointments with the visitors, and FAQs.
In this project, I have made a personalized voice assistant which can automate the daily task that I usually do on laptop like open chrome, youtube other apps etc
A few lines of code which convert speech to text.
Lucifer is a powerful, offline voice assistant built specifically for Windows 10. It brings deep system-level automation, real-time voice command processing, and a fully customizable HTML-based clock utility—solving limitations of the native Windows clock.
Transcripción de audios con Azure Speech y extracción de insights con Open AI
App using python. A project with team of 5
Telegram bot for converting voice messages and audio files to text using the OpenAI Whisper model.
Self-supervised multimodal ASR system with noise context.
Add a description, image, and links to the speechtotext topic page so that developers can more easily learn about it.
To associate your repository with the speechtotext topic, visit your repo's landing page and select "manage topics."