#

speech-to-speech

Here are 25 public repositories matching this topic...

opendilab / CleanS2S

High-quality and streaming Speech-to-Speech interactive agent in a single file. 只用一个文件实现的流式全双工语音交互原型智能体！

python machine-learning streaming ai speech-synthesis speech-recognition speech-to-speech gpt-4o

Updated Nov 5, 2024
Python

MooreThreads / MooER

MooER: Moore-threads Open Omni model for spech-to-speech intERaction. MooER-omni includes a series of end-to-end speech interaction models along with training and inference code, covering but not limited to end-to-end speech interaction, end-to-end speech translation and speech recognition.

speech-recognition speech-to-text speech-translation speech-to-speech large-language-models chatgpt gpt-4o speech-interaction

Updated Nov 5, 2024
Python

Applio

IAHispano / Applio

A simple, high-quality voice conversion tool focused on ease of use and performance

text-to-speech ai voice speech pytorch tts rvc voice-conversion vc voice-cloning speech-to-speech vits voice-clone applio

Updated Nov 4, 2024
Python

langchain-tech / openai-realtime-api-demo

An advanced speech-to-speech (S2S) voice assistant utilizing OpenAI’s Realtime API for ultra-low-latency, two-way audio streaming, real-time natural language understanding, and responsive, interactive dialogue through direct WebSocket communication.

python pyaudio websockets poetry openai wave realtime-api speech-to-speech

Updated Nov 4, 2024
Python

SabaSyed / SpeechAvatarBot

An interactive voice-based chatbot with a visual avatar that runs locally (no internet needed)

text-to-speech chatbot speech-recognition speech-to-text offline-app speech-to-speech llm ollama llama3

Updated Oct 30, 2024
Python

Limerio / speech_recognition

Small Assistant IA like Amazon Echo or Siri (not usable)

python speech-recognition ia speech-to-speech text-to-text

Updated Oct 18, 2024
Python

Asa-Dong / speech-to-speech

Speech To Speech

tts vad asr speech-to-speech ollama

Updated Oct 10, 2024
Python

johnsutor / llama-jarvis

Turn any LLM into Jarvis

transformers transformer llama speech-to-speech llm seamlessm4t

Updated Oct 6, 2024
Python

MinhxThanh / CtrlSpeak

CtrlSpeak is a voice assistant activated with [Control]+Q, listening and responding only when you want.

api chat ai voice-assistant speech-to-speech groq-api

Updated Sep 28, 2024
Python

ictnlp / LLaMA-Omni

LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.

speech-to-text speech-to-speech large-language-models multimodal-large-language-models speech-language-model speech-interaction

Updated Sep 24, 2024
Python

sxinyuhoo / speech-to-speech-pipeline

A lite tool to quickly customize LLM chatbot workflow pipelines, like Text-to-Text, Text-to-Speech or Speech-to-Speech

text-to-speech speech-to-speech llm-framework lite-pipeline

Updated Sep 10, 2024
Python

jofizcd / Soul-of-Waifu

If you've ever had the wish to talk to your AI Waifu using quality characters and voices for character voicing, then I suggest Soul of Waifu. Don't miss the opportunity to touch your dream!

text-to-speech ai chatbot artificial-intelligence tts speech-to-text waifu stt aichatbot aigirl speech-to-speech characterai aigirlfriend aiwaifu

Updated Aug 3, 2024
Python

ictnlp / DASpeech

Code for NeurIPS 2023 paper "DASpeech: Directed Acyclic Transformer for Fast and High-quality Speech-to-Speech Translation".

machine-translation speech-translation speech-to-speech speech-to-speech-translation

Updated Jul 22, 2024
Python

Aadit3003 / s2st-cascading-e2e

A comparison of E2E and Cascading S2ST systems on the CVSS-C Spanish to English dataset (CommonVoice 4.0)

nlp text-to-speech translation meteor comet speech-to-text speech-processing asr bleu-score speech-to-speech cascaded-speech-translation end-to-end-speech-translation speech-to-speech-translation

Updated Jul 11, 2024
Python

bykemalh / S2ST

Speech to Speech Translation Python

text-to-speech speech speech-to-speech speech-to-speech-translation

Updated Jun 26, 2024
Python

HugoSvy / AssistantIA-Speech-to-Speech

3-month project on artificial intelligence in teams of 3 with Manon Duboscq and Léa Mariot

text-to-speech ai speech-to-text ia speech-to-speech

Updated Apr 25, 2024
Python

oscaem / debug50

GPT powered rubber duck debugger as CS50 2023 final project.

speech-to-speech llm local-llm

Updated Oct 22, 2023
Python

gutbash / gpt-s2s

Conversational speech chatbot utilizing OpenAI's GPTs and Microsoft Azure's Speech Services

azure chatbot speech-recognition openai microsoft-azure azure-speech-service speech-to-speech

Updated Sep 21, 2023
Python

DawoodTouseef / Speech-to-speech

Audio-to-Audio using microsoft/speecht5_vc from HuggingFace

machine-learning speech speech-to-speech

Updated Sep 2, 2023
Python

liamdugan / speech-to-speech

Code for the INTERSPEECH 2023 paper "Learning When to Speak: Latency and Quality Trade-offs for Simultaneous Speech-to-Speech Translation with Offline Models"

speech speech-processing speech-translation speech-to-speech simultaneous-translation

Updated Sep 1, 2023
Python

Improve this page

Add a description, image, and links to the speech-to-speech topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the speech-to-speech topic, visit your repo's landing page and select "manage topics."