speech-to-speech

Here are 25 public repositories matching this topic...

ictnlp / LLaMA-Omni

LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.

speech-to-text speech-to-speech large-language-models multimodal-large-language-models speech-language-model speech-interaction

Updated Sep 24, 2024
Python

IAHispano / Applio

Star

A simple, high-quality voice conversion tool focused on ease of use and performance

text-to-speech ai voice speech pytorch tts rvc voice-conversion vc voice-cloning speech-to-speech vits voice-clone applio

Updated Nov 4, 2024
Python

opendilab / CleanS2S

Star

High-quality and streaming Speech-to-Speech interactive agent in a single file. 只用一个文件实现的流式全双工语音交互原型智能体！

python machine-learning streaming ai speech-synthesis speech-recognition speech-to-speech gpt-4o

Updated Nov 5, 2024
Python

MooER: Moore-threads Open Omni model for spech-to-speech intERaction. MooER-omni includes a series of end-to-end speech interaction models along with training and inference code, covering but not limited to end-to-end speech interaction, end-to-end speech translation and speech recognition.

speech-recognition speech-to-text speech-translation speech-to-speech large-language-models chatgpt gpt-4o speech-interaction

Updated Nov 5, 2024
Python

ictnlp / DASpeech

Star

Code for NeurIPS 2023 paper "DASpeech: Directed Acyclic Transformer for Fast and High-quality Speech-to-Speech Translation".

machine-translation speech-translation speech-to-speech speech-to-speech-translation

Updated Jul 22, 2024
Python

jofizcd / Soul-of-Waifu

Star

If you've ever had the wish to talk to your AI Waifu using quality characters and voices for character voicing, then I suggest Soul of Waifu. Don't miss the opportunity to touch your dream!

text-to-speech ai chatbot artificial-intelligence tts speech-to-text waifu stt aichatbot aigirl speech-to-speech characterai aigirlfriend aiwaifu

Updated Aug 3, 2024
Python

liamdugan / speech-to-speech

Star

Code for the INTERSPEECH 2023 paper "Learning When to Speak: Latency and Quality Trade-offs for Simultaneous Speech-to-Speech Translation with Offline Models"

speech speech-processing speech-translation speech-to-speech simultaneous-translation

Updated Sep 1, 2023
Python

lugia19 / Echo-XI

Star

Speech to text to speech using Elevenlabs

python voice speech tts speech-synthesis speech-recognition speech-to-text speech-to-speech elevenlabs

Updated Jul 2, 2023
Python

winedarkmoon / ElevenGUI

Star

A user-friendly interface for ElevenLabs' API with added audio transcription capability.

python gui tts openai speech-to-text transcription speech-to-speech whisper-ai elevenlabs

Updated Jun 20, 2023
Python

mt-upc / iwslt-2022

Star

Systems submitted to IWSLT 2022 by the MT-UPC group.

translation adapters pretrained-models fine-tuning speech-translation speech-to-speech

Updated May 18, 2022
Python

PedroDKE / LibriS2S

Star

Speech-to-Speech translation dataset for German and English (text and speech quadruplets).

translation dataset librivox speech-to-speech librivoxdeen libris2s

Updated Jun 27, 2023
Python

MinhxThanh / CtrlSpeak

Star

CtrlSpeak is a voice assistant activated with [Control]+Q, listening and responding only when you want.

api chat ai voice-assistant speech-to-speech groq-api

Updated Sep 28, 2024
Python

sxinyuhoo / speech-to-speech-pipeline

Star

A lite tool to quickly customize LLM chatbot workflow pipelines, like Text-to-Text, Text-to-Speech or Speech-to-Speech

text-to-speech speech-to-speech llm-framework lite-pipeline

Updated Sep 10, 2024
Python

Rutts07 / Speech2Speech-DigitsRecognition

Star

This repository contains the code for a speech to speech translation system created from scratch for digits translation from English to Tamil

cnn pytorch gru rnn mfcc ctc seq2seq-attn speech-to-speech

Updated Jun 10, 2023
Python

HugoSvy / AssistantIA-Speech-to-Speech

Star

3-month project on artificial intelligence in teams of 3 with Manon Duboscq and Léa Mariot

text-to-speech ai speech-to-text ia speech-to-speech

Updated Apr 25, 2024
Python

johnsutor / llama-jarvis

Star

Turn any LLM into Jarvis

transformers transformer llama speech-to-speech llm seamlessm4t

Updated Oct 6, 2024
Python

Aadit3003 / s2st-cascading-e2e

Star

A comparison of E2E and Cascading S2ST systems on the CVSS-C Spanish to English dataset (CommonVoice 4.0)

nlp text-to-speech translation meteor comet speech-to-text speech-processing asr bleu-score speech-to-speech cascaded-speech-translation end-to-end-speech-translation speech-to-speech-translation

Updated Jul 11, 2024
Python

gutbash / gpt-s2s

Star

Conversational speech chatbot utilizing OpenAI's GPTs and Microsoft Azure's Speech Services

azure chatbot speech-recognition openai microsoft-azure azure-speech-service speech-to-speech

Updated Sep 21, 2023
Python

langchain-tech / openai-realtime-api-demo

Star

An advanced speech-to-speech (S2S) voice assistant utilizing OpenAI’s Realtime API for ultra-low-latency, two-way audio streaming, real-time natural language understanding, and responsive, interactive dialogue through direct WebSocket communication.

python pyaudio websockets poetry openai wave realtime-api speech-to-speech

Updated Nov 4, 2024
Python

bykemalh / S2ST

Star

Speech to Speech Translation Python

text-to-speech speech speech-to-speech speech-to-speech-translation

Updated Jun 26, 2024
Python

Improve this page

Add a description, image, and links to the speech-to-speech topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the speech-to-speech topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

speech-to-speech

Here are 25 public repositories matching this topic...

ictnlp / LLaMA-Omni

IAHispano / Applio

opendilab / CleanS2S

MooreThreads / MooER

ictnlp / DASpeech

jofizcd / Soul-of-Waifu

liamdugan / speech-to-speech

lugia19 / Echo-XI

winedarkmoon / ElevenGUI

mt-upc / iwslt-2022

PedroDKE / LibriS2S

MinhxThanh / CtrlSpeak

sxinyuhoo / speech-to-speech-pipeline

Rutts07 / Speech2Speech-DigitsRecognition

HugoSvy / AssistantIA-Speech-to-Speech

johnsutor / llama-jarvis

Aadit3003 / s2st-cascading-e2e

gutbash / gpt-s2s

langchain-tech / openai-realtime-api-demo

bykemalh / S2ST

Improve this page

Add this topic to your repo