Real Audio - AGI SDK Agent

A demonstration agent using voice to perform automated web actions using the AGI SDK framework.

This uses REAL Evals to evaluate your agent and give you a score and win #1 on the REAL leaderboard for a $1000 prize.

Prerequisites

Python 3.11 or higher
OpenAI API key (or compatible provider)

Installation

Clone the repository:

git clone https://github.com/agi-inc/real-audio
cd real-audio

Install the required dependencies:
```
pip install -r requirements.txt
```

Set up your API key:

export OPENAI_API_KEY="your-api-key"   # any supported provider key works

Set up playwright:
```
playwright install chromium --force
```

Usage

Run the agent with default settings:

python run_agent.py

Command Line Options

--model: Model to use (default: gpt-4o)
--task: Task to run (default: webclones.omnizon-1)
--headless: Run in headless mode (default: False)

Examples

# Run with a specific task
python run_agent.py --task webclones.dashdish-1

# Run in headless mode
python run_agent.py --headless true

Available Tasks

The project includes audio tasks for various web applications:

dashdish: Restaurant/food delivery platform tasks
fly-unified: Flight booking platform tasks
gocalendar: Calendar application tasks
gomail: Email application tasks
networkin: Professional networking platform tasks
omnizon: E-commerce platform tasks
opendining: Restaurant reservation platform tasks
staynb: Accommodation booking platform tasks
topwork: Job platform tasks
udriver: Ride-sharing platform tasks
zilloft: Real estate platform tasks

Troubleshooting

Agent logs are stored in the /results directory. If your agent crashes, it will be logged there, so look there first.

API Key Issues: Ensure your OPENAI_API_KEY is properly set and has sufficient credits
Dependencies: Make sure all requirements are installed with pip install -r requirements.txt

Dependencies

agisdk: AGI SDK framework
playwright: Browser automation
numpy: Numerical computing
openai: OpenAI API client
Pillow: Image processing

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
audio_tasks		audio_tasks
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt
run_agent.py		run_agent.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Real Audio - AGI SDK Agent

Prerequisites

Installation

Usage

Command Line Options

Examples

Available Tasks

Troubleshooting

Dependencies

About

Uh oh!

Releases

Packages

Contributors 3

Uh oh!

Languages

agi-inc/real-audio

Folders and files

Latest commit

History

Repository files navigation

Real Audio - AGI SDK Agent

Prerequisites

Installation

Usage

Command Line Options

Examples

Available Tasks

Troubleshooting

Dependencies

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Uh oh!

Languages

Packages