You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
docs: update sg_only documentation for v2 clone-at-verify pattern
Replace all references to the old /repo_full/ backup pattern with the
new clone-at-verify approach using clone manifests. Updates:
- docs/CONFIGS.md: rewrite MCP-Full Docker Environment section
- CLAUDE.md + AGENTS.md: update verifier description
- templates/mcp_unique_task/Dockerfile.sg_only.j2: fix stale comment
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
MCP configs use `Dockerfile.sg_only` or `Dockerfile.artifact_only` so the
65
-
agent must discover code via MCP tools. The verifier restores the full repo
66
-
before scoring. See `docs/CONFIGS.md` for the full config matrix.
65
+
agent must discover code via MCP tools. The verifier clones the mirror repo
66
+
at verification time and overlays agent changes before scoring.
67
+
See `docs/CONFIGS.md` for the full config matrix.
67
68
68
69
## Standard Workflow
69
70
0.**Before commit or push:** Run `python3 scripts/repo_health.py` (or `--quick`). Fix any failures so main stays clean and drift is caught early (see `docs/REPO_HEALTH.md`).
MCP configs use `Dockerfile.sg_only` or `Dockerfile.artifact_only` so the
65
-
agent must discover code via MCP tools. The verifier restores the full repo
66
-
before scoring. See `docs/CONFIGS.md` for the full config matrix.
65
+
agent must discover code via MCP tools. The verifier clones the mirror repo
66
+
at verification time and overlays agent changes before scoring.
67
+
See `docs/CONFIGS.md` for the full config matrix.
67
68
68
69
## Standard Workflow
69
70
0.**Before commit or push:** Run `python3 scripts/repo_health.py` (or `--quick`). Fix any failures so main stays clean and drift is caught early (see `docs/REPO_HEALTH.md`).
@@ -149,7 +172,7 @@ enterprise, etc.) need the full repo for compilation/test execution.
149
172
The configuration is controlled by the `BASELINE_MCP_TYPE` environment variable in `claude_baseline_agent.py`:
150
173
151
174
-**Baseline (`none`):** No MCP config is loaded. Uses the task's regular `Dockerfile`. The system prompt contains only the evaluation context. No `--tools` or `--disallowedTools` flags are applied.
152
-
-**MCP-Full (`sourcegraph_full`):** Uses `Dockerfile.sg_only` (truncated local source). The Sourcegraph MCP config is loaded (`.api/mcp/v1` endpoint). All local tools remain available but return empty results for source files. The system prompt instructs MCP-first usage with all 13 Sourcegraph MCP tools.
175
+
-**MCP-Full (`sourcegraph_full`):** Uses `Dockerfile.sg_only` (empty or truncated local source). The Sourcegraph MCP config is loaded (`.api/mcp/v1` endpoint). All local tools remain available but return empty results for source files. The verifier clones mirrors at verification time via clone manifest. The system prompt instructs MCP-first usage with all 13 Sourcegraph MCP tools.
153
176
154
177
Both configs use `--dangerously-skip-permissions` for autonomous operation and deliver evaluation context via `--append-system-prompt`.
0 commit comments