Skip to content

Visual Voice is an inclusive platform for deaf and hard-of-hearing individuals that automatically generates accurate transcripts for uploaded videos. It provides large, clear subtitles and interactive sticky notes during playback to highlight key topics, enhancing the video-watching experience.

License

Notifications You must be signed in to change notification settings

sarfarajansari/visualvoice

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Visual Voice 🗣️🎥

Visual Voice is an inclusive platform specifically designed for deaf and hard-of-hearing individuals. The platform allows users to upload videos without transcripts, automatically generating accurate transcripts for the video. During playback, users can view large, clear subtitles and interactive, colorful sticky notes that highlight key topics discussed in the video.

Visual Voice aims to bridge communication gaps and provide a more accessible, interactive learning and entertainment experience.

visual.voice.mp4

Features

1. Automatic Transcript Generation

  • Upload any video, and Visual Voice will generate an accurate transcript for the content.
  • The transcription is displayed clearly below the video as large, easy-to-read subtitles.

2. Interactive Notes and Sticky Notes

  • While the video plays, sticky notes appear on the right side of the video.
  • The top note always highlights the current topic being discussed, providing a quick overview of the content.
  • Notes are designed in a colorful sticky note format to make them visually engaging and easy to follow.

How It Works

  1. Upload a Video: Upload a video without any transcripts.
  2. Automatic Transcript: The platform will automatically generate a transcript of the video.
  3. Video Playback: While the video plays:
    • Clear Subtitles: The transcript will be displayed below the video in large, readable fonts.
    • Sticky Notes: Interactive sticky notes will appear on the right side, showing the current topic and relevant key points in a colorful format.

Installation

  1. Clone the Repository

    git clone https://github.com/sarfarajansari/visualvoice.git
    cd visual-voice
  2. Install Dependencies

    • Make sure you have Node.js and npm installed.
    • Run the following command to install all required dependencies:
      npm install
  3. Run the Application

    npm start
    • The platform will be available at http://localhost:3000.

Technologies Used

  • Frontend: React, CSS, JavaScript
  • Backend: FastAPI
  • Speech-to-Text API: transformers, pytorch

About

Visual Voice is an inclusive platform for deaf and hard-of-hearing individuals that automatically generates accurate transcripts for uploaded videos. It provides large, clear subtitles and interactive sticky notes during playback to highlight key topics, enhancing the video-watching experience.

Resources

License

Stars

Watchers

Forks