A simple web application that allows users to record their voice and transcribe it using GROQ's API.
- Record audio directly from the browser
- Send audio to the server for processing
- Display transcription results
- Modern, responsive UI with Tailwind CSS
- Python 3.8+
- pip (Python package manager)
- A GROQ API key (get it from GROQ Console)
-
Clone the repository
git clone <repository-url> cd voice-agent
-
Create and activate a virtual environment
# Windows python -m venv venv .\venv\Scripts\activate # macOS/Linux python3 -m venv venv source venv/bin/activate
-
Install dependencies
pip install -r requirements.txt
-
Set up environment variables Create a
.envfile in the project root with the following content:DEBUG=True SECRET_KEY='your-secret-key-here' GROQ_API_KEY='your-groq-api-key-here' -
Run migrations
python manage.py migrate
-
Create a superuser (optional, for admin access)
python manage.py createsuperuser
-
Start the development server:
python manage.py runserver
-
Open your web browser and navigate to:
http://127.0.0.1:8000/
voice_agent/- Main project configurationvoice/- Voice app containing the core functionalitytemplates/voice/- HTML templatesviews.py- View functionsurls.py- URL routing
- This is a development setup. For production, you'll need to:
- Set
DEBUG=Falsein.env - Configure a production web server (e.g., Gunicorn with Nginx)
- Set up proper static file serving
- Use a production database (PostgreSQL recommended)
- Set
MIT