Skip to content
View lucasgris's full-sized avatar

Organizations

@nilc-nlp @Sales-Holding-Equipe-voz

Block or report lucasgris

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

🗣️🇧🇷 Bases de áudio transcrito em Português Brasileiro

Shell 54 8 Updated Mar 30, 2023

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 10,004 969 Updated Jan 26, 2025

Audio Large Language Models

Python 328 20 Updated Jan 15, 2025

Awesome music generation model——MG²

Python 131 11 Updated Jan 21, 2025

Implementation of MusicLM, Google's new SOTA model for music generation using attention networks, in Pytorch

Python 3,216 260 Updated Sep 6, 2023

Text-to-Audio/Music Generation

Python 2,361 184 Updated Sep 29, 2024

Interface for OuteTTS models.

Python 897 75 Updated Jan 20, 2025

Local SRT/LLM/TTS Voicechat

Python 600 66 Updated Oct 12, 2024

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 9,207 1,228 Updated Jan 22, 2025

Implementation of Liquid Nets in Pytorch

Python 55 8 Updated Jan 20, 2025

21 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/

Jupyter Notebook 68,569 35,380 Updated Jan 24, 2025

A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised duration modelings. This project grows with the research community, …

Python 324 41 Updated Sep 24, 2022
Python 7,276 572 Updated Jan 24, 2025

A HuggingFace compatible Small Language Model trainer.

Python 74 7 Updated Oct 16, 2024

Speech To Speech: an effort for an open-sourced and modular GPT4-o

Python 3,691 395 Updated Dec 4, 2024

ZMM-TTS: Zero-shot Multilingual and Multispeaker Speech Synthesis Conditioned on Self-supervised Discrete Speech Representations

C 138 9 Updated Mar 6, 2024

End-to-end Automatic Guitar Transcription

Python 7 2 Updated Oct 3, 2024

SOTA discrete acoustic codec models with 40 tokens per second for audio language modeling

Python 927 61 Updated Jan 2, 2025
Python 66 8 Updated Sep 3, 2024

LLaST: Improved End-to-end Speech Translation System Leveraged by Large Language Models

Python 23 1 Updated Aug 11, 2024

Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in Pytorch

Python 411 38 Updated Dec 26, 2024

[ACL 2024] Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation

Python 719 55 Updated Dec 23, 2024

XPhoneBERT: A Pre-trained Multilingual Model for Phoneme Representations for Text-to-Speech (INTERSPEECH 2023)

Python 314 39 Updated Jul 22, 2024

velvet os - simple script framework to build ubuntu 22.04 lts jammy (in older versions also 20.04 lts focal) and debian 12 bookworm (in older versions also 11 bullseye) bootable usb / sd card image…

Shell 326 48 Updated Jan 22, 2025

Official implementation for our paper "Audio Mamba: Selective State Spaces for Self-Supervised Audio Representations"

Python 34 Updated Jun 6, 2024

A generative speech model for daily dialogue.

Python 33,916 3,675 Updated Jan 25, 2025

State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.

Python 1,261 121 Updated Jul 11, 2024

Inference and training library for high-quality TTS models.

Python 4,941 509 Updated Dec 10, 2024

Official repo for WavCraft, an AI agent for audio creation and editing

Python 518 96 Updated Sep 13, 2024
Next