- Change mask in seamless clone and give it a try
- setup.bat / setup.sh
- create venv
- install requirements inside venv
- CodeFormer arch initialization
- Documentation
- Add directory check in inference in the beginning.
- Make preprocessing optimal.
- Clear ram after no_face_filter.
- Make face coordinates reusable:
- Saving facial coordinates as .npy file.
- Alter code to also include eye coordinates.
- Merge Data Pipeline with preprocessor:
- Remove need to recrop, realign and rewarp the image.
- Merge all data Pipeline:
- Remove the need to recrop, realign, renormalizing etc.
- Devise a way to keep frames without face in the video.
- Understand Mels and working of wav2lip model.
- Gradio UI
- A tab for Video, Audio and Output.
- A tab for Image, Audio and output.
- Inference without restorer
- Model Improvement
- Implement no_face_filter too
- Make it intuitive with proper instructions.
- Optimize Inference.
- Implement Checks.
- Face and Audio wise Lipsync using face recognition.
- A separate tab for TTS.