Skip to content

A responsive speech recognition web app that converts spoken words to text in real time. Features include multi-language support, dark mode toggle, text-to-speech playback, clipboard copy, and transcript highlighting. Built with React and Material UI for an interactive and user-friendly experience.

Notifications You must be signed in to change notification settings

softenrj/speech-recognition-app

Repository files navigation

Speech Recognition and Dark Mode Web Application

Table of Contents

Project Overview

This web application leverages speech recognition and synthesis technologies to provide real-time transcription of speech input and text-to-speech functionality. It includes additional features such as dark mode, clipboard copying, and multi-language support, enhancing user accessibility and customization.

Features

  • Speech-to-Text (Real-Time Transcription)

    • Supports multiple languages including English, Hindi, Arabic, Japanese, and Chinese.
  • Text-to-Speech Playback

    • Read aloud the transcribed text with adjustable language settings.
  • Highlight Transcript Text

    • Highlight specific words or phrases in the transcript for easy reference and interaction.
    • Click on highlighted text to remove the highlight.
  • Dark Mode Toggle

    • Switch between light and dark themes for better user experience.
  • Copy Transcript to Clipboard

    • Copy the transcribed text with a single click.
  • User Controls

    • Reset, start, stop listening, and speak text commands for improved interactivity.
  • Language Selection

    • Easily change languages for speech recognition and synthesis.

Technologies Used

  • Frontend: React, Material UI, CSS
  • Speech Recognition: Web Speech API
  • Icons: Material UI Icons
  • State Management: React Hooks

Installation

Clone the repository:

git clone <repository_url>
cd <project_directory>

Install dependencies:

npm install

Start the development server:

npm start

Usage

  • Start Transcription: Click the "Start" button to begin real-time speech-to-text transcription.
  • Speak Text: Click the "Speak" button to hear the transcript read aloud.
  • Dark Mode Toggle: Click the moon/sun icon to switch between light and dark modes.
  • Copy Transcript: Click the clipboard icon to copy the text to the clipboard.
  • Language Selection: Use the dropdown to select the language for speech recognition.

Screenshots

  • Light Mode Interface: Light Mode Interface

  • Dark Mode Interface: Dark Mode Interface

Future Enhancements

  • Add pitch and rate controls for text-to-speech playback.
  • Save and manage transcript history.
  • Improve mobile responsiveness for better user experience on smaller screens.
  • Integrate pause and resume functionality for speech synthesis.

Contributing

Contributions are welcome! Please follow these steps:

  1. Fork the project.
  2. Create a new branch:
    git checkout -b feature/YourFeature
  3. Commit your changes:
    git commit -m 'Add new feature'
  4. Push the changes:
    git push origin feature/YourFeature
  5. Submit a pull request.

About

A responsive speech recognition web app that converts spoken words to text in real time. Features include multi-language support, dark mode toggle, text-to-speech playback, clipboard copy, and transcript highlighting. Built with React and Material UI for an interactive and user-friendly experience.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published