This project aims to perform voice cloning and translation tasks using Python. It utilizes various libraries and techniques to achieve the following objectives:
Here my input is string (example: sharukhan tedx, samanatha tedx)
1.YouTube Video Extraction: The project starts by searching for YouTube videos related to a specific keyword using the YouTube Data API. It extracts video links for further processing.
2.Video Download and Audio Extraction: It downloads the YouTube videos and extracts the audio content from them. The extracted audio is saved as WAV files.
3.Audio Segmentation: The audio files are split into smaller segments to process them individually. This segmentation helps in handling longer audio recordings efficiently.
4.Speech Recognition: Using the SpeechRecognition library, the project converts audio segments into text. This step enables the system to work with spoken content.
5.Translation: The extracted text is translated into multiple languages. In this project, translations to Telugu and Hindi are demonstrated using the Google Translate API.
6.Voice Cloning: Custom voice cloning is performed using the Tortoise library. It generates speech in multiple languages with the voice of the user's choice.
You should keep all the voice_segment files in Voice_Clone with tortoise-tts.ipynb where voice_segments are act as a dataset and u get the voice segment by executing the main.py
The project is structured as follows:
- 'main.py': The main script that orchestrates the entire process, from video extraction to voice cloning.
- 'audio_processing.py': Contains functions for audio extraction and segmentation.
- 'translation.py': Handles text translation to Telugu and Hindi.
- 'Voice_Clone with tortoise-tts.ipynb': Manages the custom voice cloning process.
- 'videos_dir/': Directory to store downloaded YouTube videos.
- 'voice_wav/': Directory to store extracted audio in WAV format.
- 'voice_segments/': Directory to store segmented audio clips.
- 'tortoise': Contains custom voice models and related data.
1.Clone the repository to your local machine.
2.Install the required Python libraries listed in the 'requirements.txt' file.
3.Obtain a 'YouTube Data API' key from the Google Developer Console and replace the api_key variable in 'main.py' with your key.
4.Run 'main.py' to execute the entire voice cloning and translation pipeline.
- Python 3.x
- Google YouTube Data API
- Tortoise Voice Cloning Library
- SpeechRecognition Library
- Pytube Library
- Googletrans Library
- MoviePy Library
- NumPy Library