Visual Voice is an inclusive platform specifically designed for deaf and hard-of-hearing individuals. The platform allows users to upload videos without transcripts, automatically generating accurate transcripts for the video. During playback, users can view large, clear subtitles and interactive, colorful sticky notes that highlight key topics discussed in the video.
Visual Voice aims to bridge communication gaps and provide a more accessible, interactive learning and entertainment experience.
visual.voice.mp4
- Upload any video, and Visual Voice will generate an accurate transcript for the content.
- The transcription is displayed clearly below the video as large, easy-to-read subtitles.
- While the video plays, sticky notes appear on the right side of the video.
- The top note always highlights the current topic being discussed, providing a quick overview of the content.
- Notes are designed in a colorful sticky note format to make them visually engaging and easy to follow.
- Upload a Video: Upload a video without any transcripts.
- Automatic Transcript: The platform will automatically generate a transcript of the video.
- Video Playback: While the video plays:
- Clear Subtitles: The transcript will be displayed below the video in large, readable fonts.
- Sticky Notes: Interactive sticky notes will appear on the right side, showing the current topic and relevant key points in a colorful format.
-
Clone the Repository
git clone https://github.com/sarfarajansari/visualvoice.git cd visual-voice
-
Install Dependencies
- Make sure you have Node.js and npm installed.
- Run the following command to install all required dependencies:
npm install
-
Run the Application
npm start
- The platform will be available at
http://localhost:3000
.
- The platform will be available at
- Frontend: React, CSS, JavaScript
- Backend: FastAPI
- Speech-to-Text API: transformers, pytorch