Repo-level benchmark for real-world Code Agents: from repo understanding → env setup → incremental dev/bug-fixing → task delivery, with cost-aware α metric.
-
Updated
Sep 22, 2025 - Python
Repo-level benchmark for real-world Code Agents: from repo understanding → env setup → incremental dev/bug-fixing → task delivery, with cost-aware α metric.
Add a description, image, and links to the repomaster topic page so that developers can more easily learn about it.
To associate your repository with the repomaster topic, visit your repo's landing page and select "manage topics."