Lists (3)
Sort Name ascending (A-Z)
Stars
DeepFuze is a state-of-the-art deep learning tool that seamlessly integrates with ComfyUI to revolutionize facial transformations, lipsyncing, Face Swapping, Lipsync Translation, video generation, …
Hallo2: Long-Duration and High-Resolution Audio-driven Portrait Image Animation
EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation
This repository contains the code and data for the paper EmoKnob: Enhance Voice Cloning with Fine-Grained Emotion Control by Haozhe Chen, Run Chen, and Julia Hirschberg.
🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)