Skip to content
View gorinars's full-sized avatar
🛠️
🛠️

Block or report gorinars

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 18 1 Updated Jan 10, 2025

An easy-to-use, fast, and easily integrable tool for evaluating audio LLM

Python 27 Updated Jan 24, 2025

eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.

C 4,541 942 Updated Jan 20, 2025

TTS with kokoro and onnx runtime

Python 1,327 111 Updated Jan 25, 2025

A Survey of Spoken Dialogue Models (60 pages)

251 16 Updated Nov 28, 2024

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 39,444 4,440 Updated Jan 18, 2025

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 86,277 23,223 Updated Jan 28, 2025

🤖 Build voice-based LLM agents. Modular + open source.

Python 3,103 515 Updated Nov 15, 2024

A fast multimodal LLM for real-time voice

Python 3,272 209 Updated Jan 22, 2025

Azure OpenAI code resources for using gpt-4o-realtime capabilities.

TypeScript 746 144 Updated Jan 22, 2025

✨✨Latest Advances on Multimodal Large Language Models

13,679 876 Updated Jan 28, 2025

✨✨VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction

Python 2,012 147 Updated Jan 21, 2025

React app for inspecting, building and debugging with the Realtime API

JavaScript 2,819 1,018 Updated Jan 2, 2025

🧰 The AutoTokenizer that TikToken always needed -- Load any tokenizer with TikToken now! ✨

Python 35 3 Updated Jan 3, 2025

✨✨Freeze-Omni: A Smart and Low Latency Speech-to-speech Dialogue Model with Frozen LLM

Python 263 16 Updated Jan 2, 2025

Node.js + JavaScript reference client for the Realtime API (beta)

JavaScript 846 238 Updated Nov 7, 2024

Open Source framework for voice and multimodal conversational AI

Python 4,461 491 Updated Jan 27, 2025
Python 7,300 573 Updated Jan 27, 2025

Awesome speech/audio LLMs, representation learning, and codec models

862 56 Updated Jan 17, 2025

DSPy: The framework for programming—not prompting—language models

Python 21,464 1,627 Updated Jan 22, 2025

Domain Specific Language for the Abstraction and Reasoning Corpus

Python 233 48 Updated Oct 11, 2024

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 9,237 1,233 Updated Jan 28, 2025

Open source inference code for Rev's model

Python 364 25 Updated Jan 17, 2025

Expert Specialized Fine-Tuning

Python 206 82 Updated Sep 22, 2024

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

Python 1,376 210 Updated Apr 15, 2024

Tools for working with the Abstraction & Reasoning Corpus

Python 178 23 Updated Aug 8, 2024

Code for 1st place solution to Kaggle's Abstraction and Reasoning Challenge

C++ 152 27 Updated Jun 8, 2020

Inference and training library for high-quality TTS models.

Python 4,951 509 Updated Dec 10, 2024

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 37,198 4,623 Updated Aug 16, 2024

The Abstraction and Reasoning Corpus

JavaScript 4,181 636 Updated Aug 4, 2024
Next