Skip to content

An AI-powered voice assistant that authenticates users via face recognition and executes voice commands to open applications, search the web, automate tasks, and more. Built using Python libraries like SpeechRecognition, pyttsx3, OpenCV, and integrates with system tools for desktop automation.

Notifications You must be signed in to change notification settings

Dhruvcodings/Voice-Based-Desktop-Assistant

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

🗣️ Voice-Based Desktop Assistant

A smart desktop assistant that uses speech recognition and face authentication to interact with users and execute system-level commands.


🚀 Steps to Execute the Project

1. ✅ Create a Virtual Environment

Create a virtual environment to isolate dependencies:

python -m venv venv

Activate it:

  • Windows:

    venv\Scripts\activate
  • macOS/Linux:

    source venv/bin/activate

2. 📦 Install Required Packages

Install the following packages in the virtual environment:

pip install SpeechRecognition pyttsx3 datetime pywhatkit opencv-python pandas numpy Pillow subprocess psutil pyautogui pyperclip

Note: Some packages like sys, subprocess, webbrowser, and urllib.parse are part of Python’s standard library — no need to install them via pip.


3. 🔐 User Authentication via Face Recognition

Go to the Face2 directory and follow these steps:

  1. Generate Face Samples

    python sample_generator.py
  2. Train the Model

    python model_trainer.py
  3. Run Face Recognition for Authentication

    python face_recognition.py

4. 🗂️ Configure Application Paths

Inside the Engine folder, update Path.xlsx with:

  • Application names
  • Their corresponding absolute file paths

Example: Chrome | C:\Program Files\Google\Chrome\Application\chrome.exe


5. ▶️ Run the Assistant

Start the assistant by executing:

python main.py

🔊 Supported Commands

To explore the full list of voice commands the assistant can respond to, refer to:

run.py

About

An AI-powered voice assistant that authenticates users via face recognition and executes voice commands to open applications, search the web, automate tasks, and more. Built using Python libraries like SpeechRecognition, pyttsx3, OpenCV, and integrates with system tools for desktop automation.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages