Skip to content
View K2O7I's full-sized avatar

Block or report K2O7I

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 9,201 1,227 Updated Jan 22, 2025

Finetune Llama 3.3, Mistral, Phi-4, Qwen 2.5 & Gemma LLMs 2-5x faster with 70% less memory

Python 21,235 1,494 Updated Jan 23, 2025

This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs

Python 11,213 2,368 Updated Nov 26, 2024

Intelligent (edge and LLM) proxy for agents. Designed with fast ⚡️ LLMs for task routing, rich observability, and the seamless integration of prompts with your APIs for agentic tasks. Built by the …

Rust 1,443 66 Updated Jan 25, 2025

Local realtime voice AI

Python 2,178 118 Updated Jan 22, 2025

Information Retrieval from Audio via Knowledge Graph

Python 87 7 Updated Aug 18, 2024
Python 59 8 Updated Jan 26, 2025

Build neural network from scratch without using ML framework

Jupyter Notebook 18 4 Updated Jul 16, 2024

Model for MDX23 music separation contest

Python 681 95 Updated Jun 24, 2024

Noise supression using deep filtering

Python 2,687 249 Updated Oct 17, 2024

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 39,269 4,427 Updated Jan 18, 2025

A family of diffusion models for text-to-audio generation.

Python 1,136 94 Updated Dec 31, 2024

The official Python library for the OpenAI API

Python 24,187 3,462 Updated Jan 24, 2025

Implementation of Nougat Neural Optical Understanding for Academic Documents

Python 9,166 587 Updated Apr 16, 2024

Rembg is a tool to remove images background

Python 17,792 1,932 Updated Jan 19, 2025

Let us control diffusion models!

Python 31,295 2,802 Updated Feb 25, 2024