speechtotext

Code snippets showing how to record I2S audio and store as .wav file on ESP32 with SD card, how to transcribe pre-recorded audio via Deepgram SpeechToText API, how to generate audio from text via TextToSpeech API from OpenAI a/o Google TTS. Triggering ESP32 actions via Voice.

audio esp32 tts recording sd-card texttospeech stt speechtotext google-tts i2s i2s-audio i2s-microphone is2-audio inmp441 deepgram openai-tts max98357 deepgram-stt

Updated Aug 25, 2024
C++

KrunalGupta02 / Dictionary-App

Star

Practice Typescript Project

typescript speechtotext tailwindcss next-themes reactquery next-fonts nextjs14

Updated Jul 28, 2024
TypeScript

mrzaizai2k / stock_price_4_fun

Star

Mrzaizai2k Stock Assistant Bot: Your all-in-one stock analysis companion. Calculate payback time, find support/resistance, and receive market warnings.

finance ci-cd investing stock-market openai whisper stock-analysis pytelegrambotapi speechtotext selenium-python todolist-app newssummary llms microsofttodo

Updated Jul 22, 2024
Jupyter Notebook

VibhinnS / Doraemon

Star

A web based platform that lets you control your DJI Tello drone via speech inputs and gestures. Utilises computer vision, networking, mediapipe and ANNs

opencv machine-learning tensorflow machine artificial-neural-networks speechtotext djitello mediapipe vosk-models

Updated Jun 22, 2024
Python

YashPimpalkar / ModernDictionary

Star

This is Modern Dictinary using web Scraping with django it gives meaning ,synonyms ,antonyms ,example of a searched word it also has chatbot ,jasascript games ,blogs with google translator api

javascript django python3 materializecss texttospeech speechtotext vercel-deployment aibot

Updated Jun 14, 2024
HTML

ArkS0001 / IIT-Bombay-Whisper-Hindi-ASR-Model-Machine-Learning-Intern

Sponsor

Star

Whisper is an automatic speech recognition (ASR) system trained on 680,000 hours of multilingual and multitask supervised data collected from the web. We show that the use of such a large and diverse dataset leads to improved robustness to accents, background noise and technical language. Moreover, it enables transcription in multiple languages, as

multilingual voice-recognition openai whisper nlp-machine-learning speechtotext multilingual-nlp llm openai-whisper

Updated Apr 29, 2024
Jupyter Notebook

olololoe110399 / mikasa_gpt

Star

🚀 MiksaGPT, part of the 'Miksa' project, is a groundbreaking voice assistant utilizing Claude 3 and APIs from 'anthropic' and 'elevenlabs'. It enables real-time Opus two-way voice chat with seamless interruptibility, built with Flutter and available for free on GitHub.

opensource openai opus whisper claude speechtotext artificialintelligence opensourceai aivoice elevenlabs claudeai flutterprojects flutterprogramming flutterai

Updated Apr 3, 2024
Dart

semanticsci / whispering

Star

Speech to Text Using OpenAI Whisper. Keep whispering

openai whisper speechtotext whisper-ai

Updated Feb 29, 2024
Python

aswinap13 / MeetShort

Star

Automatic Meeting Summarizer - Provide Speaker wise abstractive and extractive summary of meetings, given the audio input of meeting recording.

machine-learning-algorithms python3 summarization extractive-summarization speechtotext abstractive-summarization

Updated Feb 22, 2024
JavaScript

microsoft / glue

Star

GLUE is a lightweight, Python-based collection of scripts to support you at succeeding with speech and text use-cases based on Microsoft Azure Cognitive Services.

nlp azure luis cognitive-services texttospeech speechtotext dstoolkit

Updated Jun 18, 2024
Jupyter Notebook

tauseeqq / Carsoulai

Star

Description : 'CarSoul-AI' is a Python program blending vehicle diagnostics with integrated ChatGPT. Using voice commands, access real-time car data, clear error codes, or talk to gpt-4 powerd ai for solutions..

api ai python3 dynamic-programming dtc speechtotext apiint gttts

Updated Oct 6, 2023
Python

development-community / Integrate-GPT-on-App

Sponsor

Star

Source code of "ChatGPT at your service" on "ImagineAndMake" serie

python speech-recognition question-answering speechtotext chatgpt imagineandmake

Updated Dec 15, 2023
Python

Ashot72 / Speech-to-Text-to-Image

Star

Generating texts from your voice then images form the texts

speech-to-text text-to-image whisper speechtotext replicate texttoimage large-language-models llm chatgpt stability-ai whisper-ai

Updated May 28, 2023
JavaScript

ThisIsNSH / Android-CatchMusic

Star

This project displays full song metadata and lyrics by just inputting a few words from the song. Along with that, they see metadata of other songs in the album and songs from the same singer. Users can also listen to the song directly on Youtube. Not all data shown is free but is fetched for free by using HTML page parsing in Android.

android google-cloud google-api musixmatch-api speechtotext

Updated May 14, 2023
Java

thisisyashgarg / ease-it

Star

EaseIt is an AI powered tool that can write personalised messages, debug code of any language, make songs in any artist style, make your workout plan and can do anything that you can think of.

nodejs handlebars expressjs openai javscript speechtotext tailwindcss

Updated Apr 25, 2023
Handlebars

Siddharth-Latthe-07 / Personal-Voice-Assistant

Star

In this project, I have made a personalized voice assistant which can automate the daily task that I usually do on laptop like open chrome, youtube other apps etc

keyboard machine-learning speech-recognition speechtotext pywhatkit

Updated Jan 14, 2023
Python

Improve this page

Add a description, image, and links to the speechtotext topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the speechtotext topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

speechtotext

Here are 69 public repositories matching this topic...

devjatinrawat / React-SpeechToText

0xBitBuster / talking-chatgpt-extension

javaidiqbal11 / DocAi_STT

s-r-e-e-r-a-j / SPEECH-TO-TEXT-CONVERTER-APP

kaloprojects / KALO-ESP32-Voice-Assistant

KrunalGupta02 / Dictionary-App

mrzaizai2k / stock_price_4_fun

VibhinnS / Doraemon

YashPimpalkar / ModernDictionary

ArkS0001 / IIT-Bombay-Whisper-Hindi-ASR-Model-Machine-Learning-Intern

olololoe110399 / mikasa_gpt

semanticsci / whispering

aswinap13 / MeetShort

microsoft / glue

tauseeqq / Carsoulai

development-community / Integrate-GPT-on-App

Ashot72 / Speech-to-Text-to-Image

ThisIsNSH / Android-CatchMusic

thisisyashgarg / ease-it

Siddharth-Latthe-07 / Personal-Voice-Assistant

Improve this page

Add this topic to your repo