-
-
Genesis Public
Forked from Genesis-Embodied-AI/GenesisA generative world for general-purpose robotics & embodied AI learning.
Python Apache License 2.0 UpdatedDec 23, 2024 -
Integrated_AEC_NR Public
Forked from Arnout-Roebben/Integrated_AEC_NRMATLAB MIT License UpdatedDec 6, 2024 -
silero-vad Public
Forked from snakers4/silero-vadSilero VAD: pre-trained enterprise-grade Voice Activity Detector
Python MIT License UpdatedNov 7, 2024 -
emotion2vec Public
Forked from ddlBoJack/emotion2vec[ACL 2024] Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation
Python UpdatedOct 27, 2024 -
jltr-alignment Public
Forked from irmakbky/jltr-alignmentAudio-to-score alignment with human-labeled repeats
Python MIT License UpdatedOct 25, 2024 -
ScanTalk Public
Forked from miccunifi/ScanTalk[ECCV 2024] - ScanTalk: 3D Talking Heads from Unregistered Scans
Python Other UpdatedOct 24, 2024 -
FLAME-Universe Public
Forked from TimoBolkart/FLAME-UniverseSummary of publicly available ressources such as code, datasets, and scientific papers for the FLAME 3D head model
UpdatedOct 14, 2024 -
MMHead Public
Forked from wsj-sjtu/MMHeadMMHead: Towards Fine-grained Multi-modal 3D Facial Animation (ACM MM 2024)
UpdatedOct 10, 2024 -
LivePortrait Public
Forked from KwaiVGI/LivePortraitBring portraits to life!
Python Other UpdatedOct 7, 2024 -
ToneLab Public
Forked from YiYang-github/ToneLabA platform designed for lightweight documentation and quantitative analysis in Sino-Tibetan tonal languages
Python MIT License UpdatedOct 3, 2024 -
firecrawl Public
Forked from mendableai/firecrawl🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.
TypeScript GNU Affero General Public License v3.0 UpdatedSep 23, 2024 -
LLaMA-Omni Public
Forked from ictnlp/LLaMA-OmniLLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.
Python Apache License 2.0 UpdatedSep 23, 2024 -
Awesome-Chinese-LLM Public
Forked from HqWu-HITCS/Awesome-Chinese-LLM整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。
UpdatedSep 19, 2024 -
-
SPARK Public
Forked from KelianB/SPARKOfficial implementation for the SIGGRAPH Asia 2024 paper SPARK: Self-supervised Personalized Real-time Monocular Face Capture
UpdatedSep 13, 2024 -
ASTWS-AEC Public
Forked from ZhaoF-i/ASTWS-AECAttention-Enhanced Short-Time Wiener Solution for Acoustic Echo Cancellation
Python UpdatedSep 13, 2024 -
-
DEEPTalk Public
Forked from whwjdqls/DEEPTalkOfficial code release of "DEEPTalk: Dynamic Emotion Embedding for Probabilistic Speech-Driven 3D Face Animation"
Python MIT License UpdatedAug 29, 2024 -
langchain Public
Forked from langchain-ai/langchain🦜🔗 Build context-aware reasoning applications
Jupyter Notebook MIT License UpdatedAug 28, 2024 -
speech-to-speech Public
Forked from huggingface/speech-to-speechSpeech To Speech: an effort for an open-sourced and modular GPT4-o
Python Apache License 2.0 UpdatedAug 27, 2024 -
Prompt-Engineering-Guide Public
Forked from dair-ai/Prompt-Engineering-Guide🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
MDX MIT License UpdatedAug 17, 2024 -
Mandarin-Chinese-Syllable-Dataset Public
Forked from danielwei0214/Mandarin-Chinese-Syllable-Dataset汉语普通话音节数据集 - 覆盖性广,使用频率高,涵盖所有普通话发音。Mandarin Chinese Syllable Dataset - Extensive coverage, high frequency of use, and includes all Mandarin pronunciations.
UpdatedAug 13, 2024 -
Fast-3D-Talking-Face Public
Forked from qwert1887/Fast-3D-Talking-FaceDrive your metahuman to speak within 1 second.
Python UpdatedAug 1, 2024 -
universal-speech-enhancement Public
Forked from nanless/universal-speech-enhancementApply Score diffusion to improve speech signals recorded under various adverse conditions and distortions, including noise, reverberation, clipping, equalization (EQ) distortion, packet loss, codec…
Python MIT License UpdatedJul 29, 2024 -
Speech-Simulation-Tools Public
Forked from YoungJay0612/Speech-Simulation-Tools语音增强领域的相关数据仿真工具和方法汇总--持续更新
UpdatedJul 11, 2024 -
Non-corresponding-and-Topology-free-3D-Face-Expression-Transfer Public
Forked from SEULSH/Non-corresponding-and-Topology-free-3D-Face-Expression-TransferPython UpdatedJul 6, 2024 -
-
NRAEC_vs_NRextAEC Public
Forked from Arnout-Roebben/NRAEC_vs_NRextAECMATLAB MIT License UpdatedJun 14, 2024 -
SyncTalk Public
Forked from ZiqiaoPeng/SyncTalk[CVPR 2024] This is the official source for our paper "SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis"
Python Other UpdatedJun 6, 2024