Gemini Browser Agent

A research experiment and browser automation project scaffolding. Run agent tasks right in your Chrome browser.

Overview

Gemini Browser Agent is an automation agent that bridges a Chrome extension with Google’s Gemini Computer Use API. It observes the active tab, exchanges screenshots and events with the model, and performs actions directly in your own browser, no sandbox or virtual machine required.

Setup

Install Python 3.10+ and Chrome (or Chromium-based) browser.
Clone this repository and open a terminal in the project directory.

(Optional) Create a virtual environment:

python -m venv venv
source venv/bin/activate  # On Windows use: venv\Scripts\activate

Run the setup helper to install dependencies and scaffold .env:
```
python setup.py
```
Visit https://aistudio.google.com/api-keys to create a Gemini API key, then place it in the generated .env file as GEMINI_API_KEY=....

Usage

Start the Python WebSocket bridge:
```
python websocket_agent.py
```
Open Chrome and navigate to chrome://extensions.
Enable Developer mode, choose Load unpacked, and select the widget/ directory from this project.
Open the sidebar, click Connect to link the extension with the Python agent, and provide your automation goal.
Press Start AI Agent to let Gemini plan, execute actions, and stream log updates directly in your browser.

For sandboxed automation, use this repo https://github.com/pmbstyle/gemini-computer-use

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
widget		widget
.env.example		.env.example
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt
setup.py		setup.py
websocket_agent.py		websocket_agent.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Gemini Browser Agent

Overview

Setup

Usage

About

Uh oh!

Languages

pmbstyle/gemini-browser-agent

Folders and files

Latest commit

History

Repository files navigation

Gemini Browser Agent

Overview

Setup

Usage

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Languages