An advanced singing voice synthesis system with high fidelity, expressiveness, controllability and flexibility based on DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism
-
Updated
Nov 8, 2024 - Python
An advanced singing voice synthesis system with high fidelity, expressiveness, controllability and flexibility based on DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism
A convolutional neural network trained to classify emotions in singing voices.
Graduation project submitted to the teaching staff from the Eletronics and Computing Engineering undergrad course from the Polytechnic School of the Federal University of Rio de Janeiro as one of the necessary requirements for obtaining the Electronics and Computing Engineer grade.
Convert a scratch project file to a MusicXML for Sinsy (Singing Voice Synthesis)
Multispeaker Community Vocoder Model for DiffSinger
✨ UTAU DEBUG ENGINE | UTAU错误检查引擎
Notebook made around Google Collaboratory for fine-tuning the HiFiPLN vocoder
Learning the Beauty in Songs: Neural Singing Voice Beautifier; ACL 2022 (Main conference); Official code
UTAUTAI(Unrestricted Tune Automated Technology Artificial Interigence)
toki pona CVVC reclist for UTAU and other singing synthesizer with CVVC support
Neural network-based singing voice synthesis library for research
DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code
Diffusion and Mutual Information-Based Target Speaker SVS by Learning from Singing Teacher
pytorch implementation of JDCNet, singing voice detection and classification network
The code for the MaD TwinNet. Demo page:
Revisiting Singing Voice Detection : a Quantitative Review and the Future Outlook
lessampler is a Singing Voice Synthesizer
A paper and project list about the cutting edge Speech Synthesis, Text-to-Speech (TTS), Singing Voice Synthesis (SVS), Voice Conversion (VC), Singing Voice Conversion (SVC), and related interesting works (such as Music Synthesis, Automatic Music Transcription, Automatic MOS Prediction, SSL-based ASR...etc).
This repo contains the source code of the first deep learning-base singing voice beat tracking system. It leverages WavLM and DistilHuBERT pre-trained speech models to create vocal embeddings and trains linear multi-head self-attention layers on top of them to extract vocal beat activations. Then, it uses HMM decoder to infer signing beats and t…
Official implementation of SawSing (ISMIR'22)
Add a description, image, and links to the singing-voice topic page so that developers can more easily learn about it.
To associate your repository with the singing-voice topic, visit your repo's landing page and select "manage topics."