Skip to content

Secure AI computer use powered by E2B Desktop Sandbox

License

Notifications You must be signed in to change notification settings

e2b-dev/open-computer-use

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

68 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Open Computer Use

A secure cloud Linux computer powered by E2B Desktop Sandbox and controlled by open-source LLMs.

Desktop.Use.+.Streaming.mp4

Features

  • Uses E2B for secure Desktop Sandbox
  • Supports Meta Llama, OS-Atlas and any LLM you want to integrate!
  • Operates the computer via the keyboard, mouse, and shell commands
  • Live streams the display of the sandbox on the client computer
  • User can pause and prompt the agent at any time
  • Uses Ubuntu, but designed to work with any operating system

Design

Open Computer Use Architecture Open Computer Use Architecture

The details of the design are laid out in this article: How I taught an AI to use a computer

LLM support

Open Computer Use is designed to easily support new LLMs. The LLM and provider combinations are are defined in models.py. Following the comments in this file, one can easily add any LLM and provider that adheres to the OpenAI API specification.

The list of tested models and providers currently includes:

Type Model Providers
Vision Llama 3.2 Fireworks, OpenRouter, Llama API
Action Llama 3.2 Fireworks, Llama API
Action DeepSeek DeepSeek
Grounding OS-Atlas HuggingFace Spaces

The following lines of code in models.py define the default LLMs and providers:

vision_model = FireworksProvider(model_names["fireworks"]["llama3.2"])
action_model = FireworksProvider(model_names["fireworks"]["llama3.3"])

If you add a new model or provider, please make a PR to this repository!

Get started

Prerequisites

1. Install the prerequisites

In your terminal:

brew install poetry ffmpeg

2. Clone the repository

In your terminal:

git clone https://github.com/e2b-dev/open-computer-use/

3. Set the environment variables

Enter the project directory:

cd open-computer-use

Create a .env file in open-computer-use and set the following:

# Get your API key here - https://e2b.dev/
E2B_API_KEY="your-e2b-api-key"
FIREWORKS_API_KEY="your-fireworks-api-key"

4. Start the web interface

Run the following command to start the agent:

poetry install
poetry run start

The agent will start and prompt you for its first instruction.