A comparison of E2E and Cascading S2ST systems on the CVSS-C Spanish to English dataset (CommonVoice 4.0)
-
Updated
Jul 11, 2024 - Python
A comparison of E2E and Cascading S2ST systems on the CVSS-C Spanish to English dataset (CommonVoice 4.0)
GPT powered rubber duck debugger as CS50 2023 final project.
✨✨Freeze-Omni: A Smart and Low Latency Speech-to-speech Dialogue Model with Frozen LLM
Turn any LLM into Jarvis
An interactive voice-based chatbot with a visual avatar that runs locally (no internet needed)
Conversational speech chatbot utilizing OpenAI's GPTs and Microsoft Azure's Speech Services
CtrlSpeak is a voice assistant activated with [Control]+Q, listening and responding only when you want.
An advanced speech-to-speech (S2S) voice assistant utilizing OpenAI’s Realtime API for ultra-low-latency, two-way audio streaming, real-time natural language understanding, and responsive, interactive dialogue through direct WebSocket communication.
A lite tool to quickly customize LLM chatbot workflow pipelines, like Text-to-Text, Text-to-Speech or Speech-to-Speech
Translation from one language to another without speech intermediate
Audio-to-Audio using microsoft/speecht5_vc from HuggingFace
Small Assistant IA like Amazon Echo or Siri (not usable)
A speech-to-speech real-time translation bot for Discord
3-month project on artificial intelligence in teams of 3 with Manon Duboscq and Léa Mariot
Systems submitted to IWSLT 2022 by the MT-UPC group.
Speech to Speech Translation Python
A flask web-page hosting a speech to speech translation demo
Tool to generate English AI Dubbing for a YouTube video
Add a description, image, and links to the speech-to-speech topic page so that developers can more easily learn about it.
To associate your repository with the speech-to-speech topic, visit your repo's landing page and select "manage topics."