🎙️ GPT-OSS Local Voice Agent Demo

Modern local voice assistant with React frontend and Python backend using Ollama, Whisper, and XTTS

📺 Watch the tutorial • 💬 Get free consultation • 📞 AI phone solutions

Live Demo Features: Chat Interface • Voice Mode • Real-time Processing • Modern UI • Completely Local

✨ Features

💬 Chat Interface: Text conversation with local AI models
🎤 Voice Mode: Speech-to-text with natural voice responses
🔄 Real-time Status: Live feedback during processing
🎨 Modern UI: Beautiful React interface with smooth animations
🌐 Completely Local: No cloud services, full privacy
🔧 Open Source: Extend and customize as needed

🚀 Quick Start

Prerequisites

Python 3.8+ (3.11+ recommended)
Node.js 16+
Ollama (Install here)

Installation

# 1. Clone the repository
git clone https://github.com/everlastconsulting/gpt-oss-local-voice-agent-demo.git
cd gpt-oss-local-voice-agent-demo

# 2. Backend setup
python3 -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate
pip install -r requirements.txt

# 3. Frontend setup
npm install

# 4. Configure Ollama
ollama pull gpt-oss:20b

# 5. Start the application
./start.sh  # Starts both backend and frontend

Access the app at: http://localhost:3000

🎯 Usage

Chat Mode

Type your message and press Enter
Get AI responses in real-time

Voice Mode

Click "Voice Mode" button
Click microphone and speak your question
Watch real-time transcription
Listen to AI voice response

⚙️ Configuration

Create .env file for custom settings:

# Ollama Model
OLLAMA_MODEL=gpt-oss:20b

# Audio Settings
RECORD_DURATION=4
TTS_LANGUAGE=de

# Server
FLASK_RUN_PORT=8080

🛠️ Tech Stack

Frontend: React, Tailwind CSS, Framer Motion
Backend: Flask, Python
AI/ML: Ollama (LLM), Whisper (STT), XTTS (TTS)
Audio: SoundDevice, NumPy, SciPy

🤝 Contributing

This is a demo project - feel free to:

🌟 Star if you find it useful
🐛 Report issues you encounter
💡 Suggest features in discussions
🔧 Submit pull requests for improvements

Quick Ideas

🌍 Add more languages
🎨 Improve UI/UX
📱 Mobile responsiveness
⚡ Performance optimizations

🎥 Learn More

This project was created as part of our AI development series. Check out:

📺 YouTube Channel: EverLast AI - AI tutorials and demos
💬 Free Consultation: kiberatung.de - Get expert AI advice
📞 AI Phone Assistants: kitelefonagent.de - Professional AI phone solutions

📝 License

MIT License - see LICENSE file

🙏 Acknowledgments

Ollama for local LLM hosting
OpenAI Whisper for speech recognition
Coqui TTS for text-to-speech

⭐ Star this repo if you like it! ⭐

Built with ❤️ for the open source community

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
public		public
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
backend_server.py		backend_server.py
package.json		package.json
postcss.config.js		postcss.config.js
requirements.txt		requirements.txt
start-backend.sh		start-backend.sh
start.sh		start.sh
tailwind.config.js		tailwind.config.js
voice_assistant.py		voice_assistant.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🎙️ GPT-OSS Local Voice Agent Demo

✨ Features

🚀 Quick Start

Prerequisites

Installation

🎯 Usage

Chat Mode

Voice Mode

⚙️ Configuration

🛠️ Tech Stack

🤝 Contributing

Quick Ideas

🎥 Learn More

📝 License

🙏 Acknowledgments

About

Uh oh!

Releases

Packages

Languages

License

everlastconsulting/gpt-oss-local-voice-agent-demo

Folders and files

Latest commit

History

Repository files navigation

🎙️ GPT-OSS Local Voice Agent Demo

✨ Features

🚀 Quick Start

Prerequisites

Installation

🎯 Usage

Chat Mode

Voice Mode

⚙️ Configuration

🛠️ Tech Stack

🤝 Contributing

Quick Ideas

🎥 Learn More

📝 License

🙏 Acknowledgments

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages