Releases: vancoder1/NarrateAI-webui
Releases · vancoder1/NarrateAI-webui
v1.0
🎉 NarrateAI - Major Update: Kokoro TTS, Expanded Format Support & UI Enhancements! 🎉
This is a significant update for NarrateAI-webui! We've revamped core components to bring you a more powerful, versatile, and user-friendly audiobook generation experience.
✨ Highlights of this Release:
- 🚀 New TTS Engine: Kokoro TTS!
- We've transitioned from Silero TTS to Kokoro TTS (
hexgrad/Kokoro-82M). This powerful engine offers a diverse range of languages and voices, aiming for high-quality audio output. Explore the new voice options in the settings!
- We've transitioned from Silero TTS to Kokoro TTS (
- 📄 Wider Document Support:
- Your literary world just got bigger! NarrateAI can now convert:
- Plain Text (
.txt) - PDF (
.pdf) - EPUB (
.epub) - Microsoft Word (
.docx) - Requirespython-docxto be installed (handled byinstall.bat) - HTML (
.html,.htm)
- Plain Text (
- Your literary world just got bigger! NarrateAI can now convert:
- ⚙️ Interactive Settings Tab in UI:
- Take full control! A new "Settings" tab allows you to:
- Select Language Code and see available Voices dynamically update.
- Adjust speaking Speed (from 0.1x to 2.0x).
- Choose your processing Device (
cpuorcudaif available).
- Take full control! A new "Settings" tab allows you to:
- 💾 Persistent Configuration:
- Your preferred settings (language, voice, speed, device) are now saved in
config/config.jsonand automatically loaded. The UI makes managing these a breeze.
- Your preferred settings (language, voice, speed, device) are now saved in
- 💅 Enhanced User Experience:
- Clearer progress reporting during audiobook generation.
- More robust error handling and informative messages.
- 📝 Improved Logging:
- Logs are now more structured and saved to a dedicated
logs/directory, making troubleshooting easier.
- Logs are now more structured and saved to a dedicated
⚠️ Important Notes:
- TTS Engine Change: Due to the switch to Kokoro TTS, previously available Silero voices are no longer applicable. You'll find a new set of voices based on Kokoro's capabilities. Please explore the "Settings" tab to configure your new preferred voice.
- Configuration File: The
config/config.jsonstructure has been updated for Kokoro TTS. If you had a previous version, your settings will be adapted or reset to new defaults. - Dependencies: The
requirements.txthas been updated. Ensure you runinstall.batfor a fresh setup or if you encounter issues after updating. The installer now also guides PyTorch (CPU/CUDA) installation.
🚀 Getting Started:
- New Users:
- Clone the repository.
- Run
install.batto set up the Conda environment and install all dependencies, including your choice of PyTorch (CPU/CUDA). - Run
start.batto launch the application.
- Existing Users (Updating):
- Ensure your local repository is clean (commit or stash changes).
- Run
update.batto fetch the latest changes and update dependencies. - (Recommended) If you encounter any issues after updating, consider running
install.batagain to ensure your environment is fully aligned with the new requirements. - Run
start.bat.
🐛 Bug Fixes & Minor Improvements:
- Improved handling of empty or invalid input files.
- More resilient audio chunk processing and combining.
- UI elements are now more responsive to configuration changes.
- Codebase refactoring for better maintainability and stability.
We're incredibly excited for you to try out this new version of NarrateAI-webui! Your feedback is invaluable to us. Please report any bugs, suggest features, or share your experience by opening an issue on GitHub.
Happy Narrating! 🎙️📚