-
Notifications
You must be signed in to change notification settings - Fork 16
Web Agent Contribution
guanxinyi edited this page Jun 5, 2025
·
9 revisions
Dev env see Installation.
Complete Configuration and run:
rush dev:eval
Main structure of web-bench could see Evaluator
├─ tools
│ ├─ bench-agent
│ │ ├─ src
│ │ │ ├─ llm
│ │ │ ├─ prompt
│ │ │ ├─ agent.ts
│ │ │ ├─ schedule.ts
Web-Agent is inspired by Continue.
- agent: Agent entry, process task description, file content, and error passed by the Evaluator into LLMs messages.
- schedule: Provide WebAgent with scheduling capabilities to implement rate limiting based on each LLM's constraints.
- llm:Provider of LLMs.If you want to add provider, you can refer to the files in this folder.
Execute the following command to run evaluations and view the results in apps/eval/report
:
rush eval
In the development environment, configure parameters in apps/eval/src/config.json5
:
- logLevel: 'debug', get more information.
- projects: ['@web-bench/xxxx'], not process all projects.
More details in Config Parameters.