lucasgris

Lucas Gris lucasgris

Interest in audio processing and deep learning.

51 followers · 73 following

@lucasrafagris

Achievements

Organizations

Lists (1)

Sort

Music generation

3 repositories

Starred repositories

falabrasil / speech-datasets

🗣️🇧🇷 Bases de áudio transcrito em Português Brasileiro

Shell 54 8 Updated Mar 30, 2023

FunAudioLLM / CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 10,004 969 Updated Jan 26, 2025

AudioLLMs / Awesome-Audio-LLM

Audio Large Language Models

Python 328 20 Updated Jan 15, 2025

shaopengw / Awesome-Music-Generation

Awesome music generation model——MG²

Python 131 11 Updated Jan 21, 2025

lucidrains / musiclm-pytorch

Implementation of MusicLM, Google's new SOTA model for music generation using attention networks, in Pytorch

Python 3,216 260 Updated Sep 6, 2023

haoheliu / AudioLDM2

Text-to-Audio/Music Generation

Python 2,361 184 Updated Sep 29, 2024

edwko / OuteTTS

Interface for OuteTTS models.

Python 897 75 Updated Jan 20, 2025

lhl / voicechat2

Local SRT/LLM/TTS Voicechat

Python 600 66 Updated Oct 12, 2024

SWivid / F5-TTS

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 9,207 1,228 Updated Jan 22, 2025

kyegomez / LiqudNet

Implementation of Liquid Nets in Pytorch

Python 55 8 Updated Jan 20, 2025

microsoft / generative-ai-for-beginners

21 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/

Jupyter Notebook 68,569 35,380 Updated Jan 24, 2025

keonlee9420 / Comprehensive-Transformer-TTS

A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised duration modelings. This project grows with the research community, …

Python 324 41 Updated Sep 24, 2022

kyutai-labs / moshi

Python 7,276 572 Updated Jan 24, 2025

AI-Guru / helibrunna

A HuggingFace compatible Small Language Model trainer.

Python 74 7 Updated Oct 16, 2024

NoSavedDATA / NoSavedKaleidoscope

Cuda 5 1 Updated Nov 22, 2024

huggingface / speech-to-speech

Speech To Speech: an effort for an open-sourced and modular GPT4-o

Python 3,691 395 Updated Dec 4, 2024

nii-yamagishilab / ZMM-TTS

ZMM-TTS: Zero-shot Multilingual and Multispeaker Speech Synthesis Conditioned on Self-supervised Discrete Speech Representations

C 138 9 Updated Mar 6, 2024

lucasgris / e2e-agt

End-to-end Automatic Guitar Transcription

Python 7 2 Updated Oct 3, 2024

jishengpeng / WavTokenizer

SOTA discrete acoustic codec models with 40 tokens per second for audio language modeling

Python 927 61 Updated Jan 2, 2025

bfs18 / e2_tts

Python 66 8 Updated Sep 3, 2024

openaudiolab / LLaST

LLaST: Improved End-to-end Speech Translation System Leveraged by Large Language Models

Python 23 1 Updated Aug 11, 2024

lucidrains / e2-tts-pytorch

Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in Pytorch

Python 411 38 Updated Dec 26, 2024

ddlBoJack / emotion2vec

[ACL 2024] Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation

Python 719 55 Updated Dec 23, 2024

VinAIResearch / XPhoneBERT

XPhoneBERT: A Pre-trained Multilingual Model for Phoneme Representations for Text-to-Speech (INTERSPEECH 2023)

Python 314 39 Updated Jul 22, 2024

hexdump0815 / imagebuilder

velvet os - simple script framework to build ubuntu 22.04 lts jammy (in older versions also 20.04 lts focal) and debian 12 bookworm (in older versions also 11 bullseye) bootable usb / sd card image…

Shell 326 48 Updated Jan 22, 2025