Skip to content
Yiwei Mao edited this page Jul 15, 2025 · 10 revisions

Refer to the Docker setup guide for instructions on installing Docker on your machine

Evaluation Quick Start

  1. Create a new empty folder, add two files in this folder:
./config.json5
./docker-compose.yml
  1. For config.json5, copy the json below and edit by Config Parameters:
{
  "models": [
    "openai/gpt-4o", 
    // You can add more models here
    // "claude-sonnet-4-20250514"
  ],
  // Eval one project only
  // "projects": ["@web-bench/react"]
}
  1. For docker-compose.yml, copy the yaml below and set environment
services:
  web-bench:
    image: maoyiweiebay777/web-bench:latest
    volumes:
      - ./config.json5:/app/apps/eval/src/config.json5
      - ./report:/app/apps/eval/report
    environment:
      # Add enviorment variables according to apps/src/model.json
      - OPENROUTER_API_KEY=your_api_key
      # Add more model's key
      # - ANTHROPIC_API_KEY=your_api_key
  1. Run docker-compose:
docker compose up
  1. Evaluation Report will be generated under ./report/

Note

The current mode only supports evaluation, not development.

Clone this wiki locally