unilight

Wen-Chin Huang (unilight) unilight

Assistant professor at Nagoya University, Japan.

131 followers · 5 following

Nagoya University
Nagoya, Japan
https://unilight.github.io/
@unilightwf
https://scholar.google.com/citations?user=g71mJO4AAAAJ

Achievements

Highlights

Stars

sarulab-speech / msr-utmos

sampling frequency independent convolution for MOS prediction

Python 6 2 Updated Jul 22, 2025

carlosfranzreb / spane

Evaluation framework for speech anonymizers

Python 6 1 Updated Sep 9, 2025

carlosfranzreb / private_knnvc

Interpretable anonymizer based on kNN-VC

Jupyter Notebook 6 3 Updated Jul 28, 2025

alessandroragano / scoreq

SCOREQ: Speech COntrastive REgression for Quality Assessment (NeurIPS 2024)

Python 89 6 Updated Aug 1, 2025

huggingface / dataspeech

Python 377 59 Updated Sep 3, 2024

LAION-AI / CLAP

Contrastive Language-Audio Pretraining

Python 1,816 184 Updated May 15, 2025

Stability-AI / stable-audio-metrics

Metrics for evaluating music and audio generative models – with a focus on long-form, full-band, and stereo generations.

Python 243 20 Updated Jun 17, 2025

zhenye234 / LLaSA_training

LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis

Python 614 46 Updated Apr 8, 2025

SparkAudio / Spark-TTS

Spark-TTS Inference Code

Python 10,490 1,115 Updated Apr 9, 2025

NVIDIA / BigVGAN

Official PyTorch implementation of BigVGAN (ICLR 2023)

Python 1,101 140 Updated Sep 5, 2024

lifeiteng / naturalspeech3_facodec

FACodec: Speech Codec with Attribute Factorization used for NaturalSpeech 3

Python 217 20 Updated Apr 20, 2024

facebookresearch / encodec

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

Python 3,787 340 Updated Jan 4, 2024

NKU-HLT / MusicEval-baseline

Python 10 2 Updated Apr 18, 2025

SWivid / F5-TTS

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 13,229 1,926 Updated Sep 13, 2025

i-need-sleep / mad

Python 9 1 Updated May 14, 2025

warisqr007 / ppg2ppg

Zero-Shot Foreign Accent Conversion without a Native Reference

Python 34 7 Updated May 1, 2024

BakerBunker / SALT

[ASRU 2023] Code of paper SALT: Distinguishable Speaker Anonymization Through Latent Space Transformation

Python 20 2 Updated Aug 13, 2024

lesterphillip / serenade

A Singing Style Conversion Framework Based On Audio Infilling

Python 26 3 Updated Apr 28, 2025

ajd12342 / paraspeechcaps

Codebase for 'Scaling Rich Style-Prompted Text-to-Speech Datasets'

Python 135 6 Updated Mar 24, 2025

sarulab-speech / ml-audiocaps

Multi-lingual AudioCaps

11 Updated Nov 20, 2023

facebookresearch / audiobox-aesthetics

Unified automatic quality assessment for speech, music, and sound.

Python 597 39 Updated Jun 5, 2025

wenet-e2e / speech-synthesis-paper

List of speech synthesis papers.

1,057 122 Updated Jul 24, 2023

NKU-HLT / RAMP_MOS

Retrieval-Augmented MOS Prediction with Prior Knowledge Integration

Python 29 3 Updated Mar 23, 2025

r9y9 / pyopenjtalk

Python wrapper for OpenJTalk

Cython 231 79 Updated Apr 8, 2025

facebookresearch / seamless_communication

Foundational Models for State-of-the-Art Speech and Text Translation

Jupyter Notebook 11,664 1,156 Updated Nov 14, 2024

Voice-Privacy-Challenge / Voice-Privacy-Challenge-2024

Baseline Recipe for VoicePrivacy Challenge 2024: anonymization systems and evaluation software

Python 57 11 Updated Jan 30, 2025

sarulab-speech / yodas-transcription

Modified transcriptions of YODAS dataset

4 Updated Oct 26, 2024

m-bain / whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Python 17,750 1,870 Updated Jul 2, 2025

wavlab-speech / versa

Versatile Evaluation of Speech and Audio

Python 321 39 Updated Sep 10, 2025

NTIA / alignnet

Train no-reference speech quality estimators with multiple datasets via learned, per-dataset alignments.

Python 17 Updated Aug 1, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Wen-Chin Huang (unilight) unilight

Achievements

Achievements

Highlights

Block or report unilight

Stars

sarulab-speech / msr-utmos

carlosfranzreb / spane

carlosfranzreb / private_knnvc

alessandroragano / scoreq

huggingface / dataspeech

LAION-AI / CLAP

Stability-AI / stable-audio-metrics

zhenye234 / LLaSA_training

SparkAudio / Spark-TTS

NVIDIA / BigVGAN

lifeiteng / naturalspeech3_facodec

facebookresearch / encodec

NKU-HLT / MusicEval-baseline

SWivid / F5-TTS

i-need-sleep / mad

warisqr007 / ppg2ppg

BakerBunker / SALT

lesterphillip / serenade

ajd12342 / paraspeechcaps

sarulab-speech / ml-audiocaps

facebookresearch / audiobox-aesthetics

wenet-e2e / speech-synthesis-paper

NKU-HLT / RAMP_MOS

r9y9 / pyopenjtalk

facebookresearch / seamless_communication

Voice-Privacy-Challenge / Voice-Privacy-Challenge-2024

sarulab-speech / yodas-transcription

m-bain / whisperX

wavlab-speech / versa

NTIA / alignnet