-
Notifications
You must be signed in to change notification settings - Fork 849
Description
I would like to contribute a new project titled Image to Speech Conversion System, which focuses on enhancing accessibility for visually impaired individuals by converting visual content into auditory information.
Key Features:
Image Recognition: Using Optical Character Recognition (OCR) technology to extract text from images.
Speech Synthesis: Generating natural-sounding speech from the extracted text.
User Interface: A simple, interactive UI for users to upload images and listen to the output (using Python Tkinter).
File Management: Options to save the generated speech as an MP3 file.
Technologies Used:
Python: Core programming language.
Tesseract OCR: For text extraction from images.
gTTS (Google Text-to-Speech): For converting text to speech.
Tkinter: For creating a user-friendly interface.
Let me know if this aligns with the repository’s goals. I’d be happy to implement and provide detailed documentation for the project. Looking forward to your feedback!