Skip to content

A simple Python project, using Text to Speech to copy what is being said from the microphone.

Notifications You must be signed in to change notification settings

IbrahimIF/CopyCat

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 

Repository files navigation

CopyCat

Project Status

Static Badge

Description:

A simple Python project that listens to your microphone, processes the speech, and responds using Text-to-Speech (TTS). CopyCat records audio from your microphone, converts it to text, processes the text (with special responses to phrases like "copy"), and generates a spoken reply using ElevenLabs' TTS API.


Getting started:

Prerequisites

  1. Python 3.10+
  2. FFmpeg (Required for audio processing)
   - Install via:
     - macOS: `brew install ffmpeg`
     - Linux: `sudo apt install ffmpeg`
     - Windows: Download from [FFmpeg's official site](https://ffmpeg.org/)

Installation

  1. Clone the repository:
git clone https://github.com/yourusername/CopyCat.git
cd CopyCat
  1. Install dependencies:
pip install -r requirements.txt
  1. Add your ElevenLabs API key to a .env file:
ELEVENLABS_API_KEY=your_api_key_here

Features:

This project includes several exciting features for you to explore:

  • Microphone recording: Real-time microphone recording (6-second clips).
  • Speech-to-text conversion: Speech-to-text conversion via Google's Speech Recognition.
  • Smart text processing: Copies what you say and provides special responses based on your input (e.g., replies "Ok I won't then" if you say "copy").
  • TTS Responses: Natural-sounding TTS responses using ElevenLabs.
  • Automatic responses: Plays responses aloud automatically.

Technologies and Tools Utilised:

My Skills


Acknowledgment:

About

A simple Python project, using Text to Speech to copy what is being said from the microphone.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages