gorinars

🛠️

Arseniy Gorin gorinars

🛠️

ML Researcher: Audio and Speech

37 followers · 30 following

Achievements

Lists (1)

Sort

🔮 Future ideas

1 repository

Stars

BUTSpeechFIT / DiCoW

Python 18 1 Updated Jan 10, 2025

OpenBMB / UltraEval-Audio

An easy-to-use, fast, and easily integrable tool for evaluating audio LLM

Python 27 Updated Jan 24, 2025

espeak-ng / espeak-ng

eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.

C 4,541 942 Updated Jan 20, 2025

thewh1teagle / kokoro-onnx

TTS with kokoro and onnx runtime

Python 1,327 111 Updated Jan 25, 2025

jishengpeng / WavChat

A Survey of Spoken Dialogue Models (60 pages)

251 16 Updated Nov 28, 2024

RVC-Boss / GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 39,444 4,440 Updated Jan 18, 2025

pytorch / pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 86,277 23,223 Updated Jan 28, 2025

vocodedev / vocode-core

🤖 Build voice-based LLM agents. Modular + open source.

Python 3,103 515 Updated Nov 15, 2024

fixie-ai / ultravox

A fast multimodal LLM for real-time voice

Python 3,272 209 Updated Jan 22, 2025

Azure-Samples / aoai-realtime-audio-sdk

Azure OpenAI code resources for using gpt-4o-realtime capabilities.

TypeScript 746 144 Updated Jan 22, 2025

BradyFU / Awesome-Multimodal-Large-Language-Models

✨✨Latest Advances on Multimodal Large Language Models

13,679 876 Updated Jan 28, 2025

VITA-MLLM / VITA

✨✨VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction

Python 2,012 147 Updated Jan 21, 2025

openai / openai-realtime-console

React app for inspecting, building and debugging with the Realtime API

JavaScript 2,819 1,018 Updated Jan 2, 2025

chonkie-ai / autotiktokenizer

🧰 The AutoTokenizer that TikToken always needed -- Load any tokenizer with TikToken now! ✨

Python 35 3 Updated Jan 3, 2025

VITA-MLLM / Freeze-Omni

✨✨Freeze-Omni: A Smart and Low Latency Speech-to-speech Dialogue Model with Frozen LLM

Python 263 16 Updated Jan 2, 2025

openai / openai-realtime-api-beta

Node.js + JavaScript reference client for the Realtime API (beta)

JavaScript 846 238 Updated Nov 7, 2024

pipecat-ai / pipecat

Open Source framework for voice and multimodal conversational AI

Python 4,461 491 Updated Jan 27, 2025

kyutai-labs / moshi

Python 7,300 573 Updated Jan 27, 2025

ga642381 / speech-trident

Awesome speech/audio LLMs, representation learning, and codec models

862 56 Updated Jan 17, 2025

stanfordnlp / dspy

DSPy: The framework for programming—not prompting—language models

Python 21,464 1,627 Updated Jan 22, 2025

michaelhodel / arc-dsl

Domain Specific Language for the Abstraction and Reasoning Corpus

Python 233 48 Updated Oct 11, 2024

SWivid / F5-TTS

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 9,237 1,233 Updated Jan 28, 2025

revdotcom / reverb

Open source inference code for Rev's model

Python 364 25 Updated Jan 17, 2025

deepseek-ai / ESFT

Expert Specialized Fine-Tuning

Python 206 82 Updated Sep 22, 2024

deepseek-ai / DeepSeek-Math

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

Python 1,376 210 Updated Apr 15, 2024

mxbi / arckit

Tools for working with the Abstraction & Reasoning Corpus

Python 178 23 Updated Aug 8, 2024

top-quarks / ARC-solution

Code for 1st place solution to Kaggle's Abstraction and Reasoning Challenge

C++ 152 27 Updated Jun 8, 2020

huggingface / parler-tts

Inference and training library for high-quality TTS models.

Python 4,951 509 Updated Dec 10, 2024

coqui-ai / TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 37,198 4,623 Updated Aug 16, 2024

fchollet / ARC-AGI

The Abstraction and Reasoning Corpus

JavaScript 4,181 636 Updated Aug 4, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Arseniy Gorin gorinars

Achievements

Achievements

Block or report gorinars

Lists (1)

🔮 Future ideas

Stars

BUTSpeechFIT / DiCoW

OpenBMB / UltraEval-Audio

espeak-ng / espeak-ng

thewh1teagle / kokoro-onnx

jishengpeng / WavChat

RVC-Boss / GPT-SoVITS

pytorch / pytorch

vocodedev / vocode-core

fixie-ai / ultravox

Azure-Samples / aoai-realtime-audio-sdk

BradyFU / Awesome-Multimodal-Large-Language-Models

VITA-MLLM / VITA

openai / openai-realtime-console

chonkie-ai / autotiktokenizer

VITA-MLLM / Freeze-Omni

openai / openai-realtime-api-beta

pipecat-ai / pipecat

kyutai-labs / moshi

ga642381 / speech-trident

stanfordnlp / dspy

michaelhodel / arc-dsl

SWivid / F5-TTS

revdotcom / reverb

deepseek-ai / ESFT

deepseek-ai / DeepSeek-Math

mxbi / arckit

top-quarks / ARC-solution

huggingface / parler-tts

coqui-ai / TTS

fchollet / ARC-AGI