-
dify Public
Forked from langgenius/difyDify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…
TypeScript Other UpdatedJan 14, 2025 -
VI-SVC Public
Forked from PlayVoice/lora-svcvits singing voice conversion based on ppg & hubert;singing voice clone;
Python MIT License UpdatedSep 25, 2022 -
AdaSpeech2-1 Public
Forked from jonathan-hsu123/AdaSpeech2AdaSpeech2 based on https://github.com/rishikksh20/AdaSpeech2
Jupyter Notebook MIT License UpdatedMar 27, 2022 -
Coqui-TTS Public
Forked from Edresson/Coqui-TTS🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
-
AdaSpeech Public
Forked from rishikksh20/AdaSpeechAdaSpeech: Adaptive Text to Speech for Custom Voice
Jupyter Notebook Apache License 2.0 UpdatedAug 31, 2021 -
AdaSpeech2 Public
Forked from rishikksh20/AdaSpeech2AdaSpeech 2: Adaptive Text to Speech with Untranscribed Data
Jupyter Notebook MIT License UpdatedAug 31, 2021 -
kaldi Public
Forked from kaldi-asr/kaldikaldi-asr/kaldi is the official location of the Kaldi project.
Shell Other UpdatedJan 7, 2021 -
VBx Public
Forked from BUTSpeechFIT/VBxVariational Bayes HMM over x-vectors diarization on DIHARD II
Python UpdatedJan 1, 2021 -
-
athena-signal Public
Forked from athena-team/athena-signalC Apache License 2.0 UpdatedSep 17, 2020 -
SpeakerDiarization_RNN_CNN_LSTM Public
Forked from vishalshar/SpeakerDiarization_RNN_CNN_LSTMSpeaker Diarization is the problem of separating speakers in an audio. There could be any number of speakers and final result should state when speaker starts and ends. In this project, we analyze …
Python UpdatedJul 31, 2020 -
awesome-diarization Public
Forked from wq2012/awesome-diarizationA curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
Apache License 2.0 UpdatedJun 14, 2020 -
SpectralCluster Public
Forked from wq2012/SpectralClusterPython re-implementation of the spectral clustering algorithm in the paper "Speaker Diarization with LSTM"
Python Apache License 2.0 UpdatedMay 31, 2020 -
quickgitclone Public
Forked from Zenquan/githubHelper这是一个帮助快速克隆git仓库+生成、下载网站二维码+选中右击跳转百度的Chrome插件
HTML UpdatedMay 12, 2020 -
SimpleDER Public
Forked from wq2012/SimpleDERA lightweight library to compute Diarization Error Rate (DER).
Python Apache License 2.0 UpdatedMar 30, 2020 -
PyTorch_Speaker_Verification Public
Forked from HarryVolek/PyTorch_Speaker_VerificationPyTorch implementation of "Generalized End-to-End Loss for Speaker Verification" by Wan, Li et al.
Python BSD 3-Clause "New" or "Revised" License UpdatedFeb 5, 2020 -
pyannote-audio Public
Forked from pyannote/pyannote-audioNeural building blocks for speaker diarization: speech activity detection, speaker change detection, speaker embedding
Python MIT License UpdatedNov 18, 2019 -
VGG-Speaker-Recognition Public
Forked from WeidiXie/VGG-Speaker-RecognitionUtterance-level Aggregation For Speaker Recognition In The Wild
Python UpdatedNov 9, 2019 -
Speaker-Diarization Public
Forked from taylorlu/Speaker-Diarizationspeaker diarization by uis-rnn and speaker embedding by vgg-speaker-recognition
Python Apache License 2.0 UpdatedOct 18, 2019 -
pykaldi Public
Forked from pykaldi/pykaldiA Python wrapper for Kaldi
Python Apache License 2.0 UpdatedSep 4, 2019 -
uis-rnn Public
Forked from google/uis-rnnThis is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.
Python Apache License 2.0 UpdatedJul 2, 2019 -
-
v-vector-tf Public
Forked from CSLT-THU/IS2019-VAETensorflow and kaldi implementation of our paper "VAE-based regularization for deep speaker embedding"
Perl Apache License 2.0 UpdatedMay 26, 2019 -
Deep_Speaker-speaker_recognition_system Public
Forked from Walleclipse/Deep_Speaker-speaker_recognition_systemKeras implementation of ‘’Deep Speaker: an End-to-End Neural Speaker Embedding System‘’ (speaker recognition)
Python UpdatedMar 17, 2019 -
deep-speaker Public
Forked from philipperemy/deep-speakerDeep Speaker: an End-to-End Neural Speaker Embedding System https://arxiv.org/pdf/1705.02304.pdf
Python UpdatedJan 30, 2019 -
3D-convolutional-speaker-recognition Public
Forked from astorfi/3D-convolutional-speaker-recognition🔈 Deep Learning & 3D Convolutional Neural Networks for Speaker Verification
Python Apache License 2.0 UpdatedJan 8, 2019 -
pyAudioAnalysis Public
Forked from tyiannak/pyAudioAnalysisPython Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications
Python Apache License 2.0 UpdatedOct 13, 2018 -
DeepSpeaker-pytorch Public
Forked from qqueing/DeepSpeaker-pytorchSpeaker embedding(verification and recognition) using Pytorch
Python MIT License UpdatedOct 11, 2018 -
dVectorSpeakerRecognition Public
Forked from wangleiai/dVectorSpeakerRecognition基于dVector的说话人识别keras
Python UpdatedSep 21, 2018 -
py-webrtcvad Public
Forked from wiseman/py-webrtcvadPython interface to the WebRTC Voice Activity Detector
C Other UpdatedMay 4, 2018