Easily turn any text, PDF, or web article into high-quality audio files you can listen to while walking, driving, or on the go.
A simple Streamlit app to convert documents and web content into natural-sounding speech using Azure Text-to-Speech.
- Convert PDFs, web pages, or direct text to audio
- Choose from multiple natural-sounding voices
- Adjust speech rate for comfortable listening
- Download audio files for offline use
- Works great for listening while commuting, exercising, or relaxing
- app.py – Streamlit UI & main entry point
- service.py – Core business logic & orchestration layer
- audio_processor.py – Azure TTS integration
- text_processor.py – Text extraction utilities
- llm_client.py – Azure OpenAI integration for text optimization
- audio_output/ – Generated audio files
- Install dependencies:
pip install -r requirements.txt
- Configure Azure Speech Service:
- Create a Speech Service in Azure Portal
- Add your credentials to a
.envfile:AZURE_SPEECH_KEY=your_key AZURE_SPEECH_REGION=your_region
- Run the app:
streamlit run app.py
- Choose input: Upload a PDF, enter a URL, or paste text.
- Pick a voice & settings: Select a voice and adjust speed.
- Convert & listen: Click convert, then listen or download your audio from the app.
- Input: PDF, web page URL, or plain text
- Voices: Multiple Azure neural voices (male/female, US English)
MIT License
