Skip to content

Navigation Menu

Explore
By company size
By use case
By industry
View all solutions
Topics
- AI
- DevOps
- Security
- Software Development
- View all
Explore
- GitHub Sponsors
  Fund open source developers
- The ReadME Project
  GitHub community articles
Repositories
- Enterprise platform
  AI-powered developer platform
Available add-ons
Pricing

Search code, repositories, users, issues, pull requests...

Search

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

pawansharmaaaa / Lip_Wise Public

Notifications You must be signed in to change notification settings
Fork 15
Star 64

Code
Issues 7
Pull requests
Discussions
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Security
Insights

Breadcrumbs

Lip_Wise

/

todo.md

Latest commit

History

48 lines (39 loc) · 1.37 KB

Breadcrumbs

Lip_Wise

/

todo.md

File metadata and controls

48 lines (39 loc) · 1.37 KB

📝 TO-DO List:

URGENT REQUIREMENTS

Change mask in seamless clone and give it a try
setup.bat / setup.sh
- create venv
- install requirements inside venv
CodeFormer arch initialization
Documentation

PREPROCESS

Add directory check in inference in the beginning.
Make preprocessing optimal.
Clear ram after no_face_filter.
Make face coordinates reusable:
- Saving facial coordinates as .npy file.
- Alter code to also include eye coordinates.

IMPROVING GAN UPSCALING

Merge Data Pipeline with preprocessor:
- Remove need to recrop, realign and rewarp the image.

IMPROVING WAV2LIP

Merge all data Pipeline:
- Remove the need to recrop, realign, renormalizing etc.
- Devise a way to keep frames without face in the video.
  - Understand Mels and working of wav2lip model.

OPTIONAL

Gradio UI
- A tab for Video, Audio and Output.
- A tab for Image, Audio and output.

FURTHER IMPROVEMENTS

Inference without restorer
Model Improvement
Implement no_face_filter too

COLAB NOTEBOOK

Make it intuitive with proper instructions.
Optimize Inference.
Implement Checks.

FUTURE PLANS

Face and Audio wise Lipsync using face recognition.
A separate tab for TTS.

Footer

© 2024 GitHub, Inc.

Footer navigation

Terms
Privacy
Security
Status
Docs
Contact

You can’t perform that action at this time.