This project is a text-to-speech web application built using the Microsoft Edge TTS model. The user interface is created with the Gradio framework. Users can input text, set the pitch and speech rate, choose from various voices in the voice library, and download the generated audio file. You can use this web app here
- Text-to-Speech Conversion: Converts user-input text into speech.
- Adjustable Pitch and Speech Rate: Allows users to customize the pitch and speech rate of the generated audio.
- Voice Library: Users can choose from different voices available in the library.
- Audio Download: Users can download the generated audio file.
- Python: Programming language used for the backend.
- Microsoft Edge TTS Model: Model used for text-to-speech conversion.
- Gradio: Framework used for creating the user interface.
- Clone the repository:
git clone https://github.com/abhisek-13/text-to-speech.git cd text-to-speech
- Create and activate a virtual environment:
python -m venv venv venv\Scripts\activate # On Windows
- Install the required packages:
pip install -r requirements.txt
- Run the Gradio application:
python src\app.py
- Open your web browser and go to the URL provided by Gradio to access the web app.
- Input your text, set the pitch and speech rate, choose a voice, and click the generate button to create the audio file.
- Click the download button to save the generated audio file to your device.
app.py
: Main application file for running the Gradio web app.requirements.txt
: List of required Python packages.
Contributions are welcome! Please fork the repository and submit a pull request for any enhancements or bug fixes.
This project is licensed under the MIT License. See the LICENSE
file for more details.
For any questions or feedback, please contact abhisekmaharana9861@gmail.com.