Real time interactive streaming digital human
-
Updated
Nov 16, 2024 - Python
Real time interactive streaming digital human
实时语音交互数字人,支持端到端语音方案(GLM-4-Voice - THG)和级联方案(ASR-LLM-TTS-THG)。可自定义形象与音色,无须训练,支持音色克隆,首包延迟低至3s。Real-time voice interactive digital human, supporting end-to-end voice solutions (GLM-4-Voice - THG) and cascaded solutions (ASR-LLM-TTS-THG). Customizable appearance and voice, supporting voice cloning, with initial package delay as low as 3s.
the comfyui custom node of MuseTalk to make audio driven videos!
Add a description, image, and links to the musetalk topic page so that developers can more easily learn about it.
To associate your repository with the musetalk topic, visit your repo's landing page and select "manage topics."