Add ATIF conversion pipeline by neubig · Pull Request #256 · neulab/agent-data-protocol

neubig · 2026-06-03T12:02:35Z

Summary

Adds ATIF v1.7 schema models plus ADP↔ATIF adapter helpers, with ATIF normalization kept ATIF-in/ATIF-out.
Keeps raw_to_atif.py independent of the legacy ADP converters; the shared raw converter now maps common raw chat/tool-call shapes directly into ATIF and preserves raw tool names before normalization.
Adds per-dataset atif_to_std.py entrypoints for ATIF-to-ATIF tool normalization, backed by shared normalization utilities for common shell/Python/code aliases.
Updates all shared std_to_sft.py converters (OpenHands v0, SWE-agent, AgentLab, OpenHands SDK) to accept ATIF input through a shared loader.
Updates docs, sample/full generation scripts, and tests to route SFT generation through the dataset-specific ATIF normalization stage.

Fixes #243.

Tests

PATH="$HOME/.local/bin:$PATH" pre-commit run --all-files
python -m pytest tests/test_atif_schemas.py tests/test_dataset_structure.py tests/test_std_to_sft_conversion.py -q (254 passed, 12 skipped)
PATH="$HOME/.local/bin:$PATH" python -m pytest tests/ -q (602 passed, 97 skipped)

Notes / design decisions

Raw-to-ATIF separation: raw_to_atif.py no longer shells out to raw_to_standardized.py or adapts through ADP. It performs a lightweight raw transcript/tool-call projection into ATIF and leaves cross-dataset tool smoothing to the next stage.
Dataset ATIF normalization: Every dataset now has an atif_to_std.py wrapper. The current wrappers share common ATIF normalization logic, so datasets with no special smoothing are effectively no-ops beyond common alias cleanup; future dataset-specific scripts can diverge where needed.
Normalization output: atif_to_std.py emits ATIF-compatible JSONL, not legacy ADP sample_std.json records.
SFT conversion: Shared SFT converters now accept ATIF input. Internally they use the existing event-based conversion path for compatibility while the pipeline entrypoint is normalized ATIF.

This PR description was updated by an AI agent (OpenHands) on behalf of the user.

Co-authored-by: openhands <openhands@all-hands.dev>

neubig · 2026-06-03T12:25:43Z

@OpenHands /iterate

openhands-ai · 2026-06-03T12:26:28Z

I'm on it! neubig can track my progress at all-hands.dev

Co-authored-by: openhands <openhands@all-hands.dev>

github-actions

🟡 Acceptable — the pipeline wires up correctly and all 596 tests pass. Three issues worth addressing before merge.

is ATIF-in / ATIF-out (normalization only), but the name and the README pipeline diagram () imply it emits ADP std format. This naming mismatch will mislead future contributors. Rename it (or clarify in a comment and the diagram).
does a wasteful round-trip through at the end; is sufficient since the copy is already a valid .
The fallback in is a fragile heuristic — see inline comment.

[RISK ASSESSMENT]

[Overall PR] Risk Assessment: 🟢 LOW — additive pipeline layer, backward-compatible, no existing behavior broken.

VERDICT: ✅ Worth merging with the heuristic fixed.

KEY INSIGHT: delegates to internally, so the ATIF stage currently inherits ADP normalization rather than preserving raw tool shapes — acceptable for now but worth a follow-up if per-dataset ATIF fidelity is ever needed.

This review was generated by an AI agent (OpenHands) on behalf of the user.

Was this automated review useful? React with 👍 or 👎 to this review to help us measure review quality.
Workflow run: https://github.com/neulab/agent-data-protocol/actions/runs/26885141828

Co-authored-by: openhands <openhands@all-hands.dev>

github-actions

🟡 Acceptable — The three previously flagged issues are partially addressed: normalize_atif_trajectory no longer re-validates, and the schema_version fallback is now exception-based. Two new issues and one outstanding naming concern are below.

Was this automated review useful? React with 👍 or 👎 to this review to help us measure review quality.
Workflow run: https://github.com/neulab/agent-data-protocol/actions/runs/26885838005

Co-authored-by: openhands <openhands@all-hands.dev>

github-actions

🟡 Acceptable — pipeline wires up correctly and previous review items are resolved. One new correctness bug found.

Was this automated review useful? React with 👍 or 👎 to this review to help us measure review quality.
Workflow run: https://github.com/neulab/agent-data-protocol/actions/runs/26886419503

Co-authored-by: openhands <openhands@all-hands.dev>

github-actions

🟡 Acceptable — pipeline wires up correctly, all 596 tests pass, and previous review items are resolved. Two new issues below.

This review was generated by an AI agent (OpenHands) on behalf of the user.

Was this automated review useful? React with 👍 or 👎 to this review to help us measure review quality.
Workflow run: https://github.com/neulab/agent-data-protocol/actions/runs/26887170488

Co-authored-by: openhands <openhands@all-hands.dev>

github-actions

🟡 Acceptable — pipeline wires up correctly and all 596 tests pass. Two new issues found.

Was this automated review useful? React with 👍 or 👎 to help us measure review quality.
Workflow run: https://github.com/neulab/agent-data-protocol/actions/runs/26888057010

Co-authored-by: openhands <openhands@all-hands.dev>

github-actions

🟡 Acceptable — previous review items are all resolved and all 596 tests pass. One new documentation inconsistency below.

AGENTS.md line 146 — SFT generation command doesn't use the ATIF pipeline

The pipeline diagram (line 30) shows the canonical path as sample_atif.json → scripts/atif_to_std.py → agents/*/std_to_sft.py → sample_sft/, but the "Generate sample files" command at line 146 still reads from sample_std.json directly:

cat datasets/$MY_DATASET/sample_std.json | ... | python agents/openhands_v0/std_to_sft.py ...

Both paths produce identical output (since std_to_sft.py auto-detects format), but contributors following AGENTS.md will never exercise the new ATIF-based SFT generation path. Consider updating to:

cat datasets/$MY_DATASET/sample_atif.json | python scripts/json_to_jsonl.py | python scripts/atif_to_std.py | python agents/openhands_v0/std_to_sft.py --is_web=no --api_env=execute_bash | python scripts/jsonl_to_json.py > datasets/$MY_DATASET/sample_sft/openhands_v0.json

Was this automated review useful? React with 👍 or 👎 to this review to help us measure review quality.
Workflow run: https://github.com/neulab/agent-data-protocol/actions/runs/26888896141

Co-authored-by: openhands <openhands@all-hands.dev>

github-actions

🟡 Acceptable — pipeline wires up correctly, tests pass (596 passed, 97 skipped), and previous review items are resolved. One new issue found.

full_atif.jsonl is not gitignored — script_full.sh (line 9) generates datasets/$MY_DATASET/full_atif.jsonl, but .gitignore only covers full_raw.jsonl, full_std.jsonl, and full_sft.jsonl. A contributor who runs the full pipeline locally could accidentally commit large ATIF JSONL files. Add full_atif.jsonl to .gitignore beside the other full_*.jsonl patterns, and add it to the AGENTS.md "do not commit" list (line 44) alongside the existing full-corpus artifact names.

Was this automated review useful? React with 👍 or 👎 to this review to help us measure review quality.
Workflow run: https://github.com/neulab/agent-data-protocol/actions/runs/26889796205

github-actions

🟡 Acceptable — both previously unresolved threads are resolved in the current code, and 602 tests pass. One minor documentation inconsistency found.

Resolved threads (verified in workspace):

agents/openhands_sdk/std_to_sft.py: no agenttuning_os-specific logic remains in the shared converter; correctly moved to datasets/agenttuning_os/std_to_sft.py.
AGENTS.md sample_std.json generation command now correctly routes through atif_to_std.py.

CONTRIBUTING.md lines 28–39 — "Standardized Schema Components" section not updated

The pipeline diagram and generation commands were updated to ATIF, but this section still describes the primary standardized format as MessageAction/CodeAction/ApiAction/TextObservation/WebObservation. Since sample_std.json is now ATIF, new contributors reading top-down will have incorrect expectations about the schema. Consider updating this section to describe ATIF steps/tool_calls/observations, or add a note that the listed ADP types belong to the legacy raw_to_standardized.py compatibility path.

Was this automated review useful? React with 👍 or 👎 to this review to help us measure review quality.
Workflow run: https://github.com/neulab/agent-data-protocol/actions/runs/27141347753

github-actions

🟡 Acceptable — pipeline wires up correctly, both previously-unresolved threads are resolved in the current code, and 602 tests pass. No new correctness issues found.

Previously-unresolved threads (verified in workspace):

agents/openhands_sdk/std_to_sft.py: no agenttuning_os-specific logic in the shared converter; correctly moved to datasets/agenttuning_os/std_to_sft.py.
AGENTS.md: generation commands correctly route through raw_to_atif.py → atif_to_std.py.

Quick spot-checks:

agents/openhands_v0/api.py: execute_ipython_cell required/optional key typo fix is correct.
scripts/atif_input.py: exception-based dispatch is clean; a schema_version that is neither None nor ATIF_SCHEMA_VERSION routes to Trajectory(**data), which will raise a clear ADP version-mismatch error.
agents/openhands_sdk/condensation_sft.py: json.loads runs twice on the error path (once inside load_atif_aware_trajectory, once in the except block). Not a correctness bug, but a minor cleanup opportunity.

Was this automated review useful? React with 👍 or 👎 to this review to help us measure review quality.
Workflow run: https://github.com/neulab/agent-data-protocol/actions/runs/27142080328

github-actions

🟡 Acceptable — pipeline wires up correctly, all previously-raised issues are resolved, and 602 tests pass. One minor inconsistency found.

Was this automated review useful? React with 👍 or 👎 to this review to help us measure review quality.
Workflow run: https://github.com/neulab/agent-data-protocol/actions/runs/27145041970

github-actions · 2026-06-08T14:40:05Z

 import traceback
+from pathlib import Path
+
+sys.path.insert(0, str(Path(__file__).resolve().parents[2]))


🟡 Suggestion: sys.path.insert added here but not in sibling converters that have the same new import.

agents/agentlab/std_to_sft.py and agents/sweagent/std_to_sft.py both gained from scripts.atif_input import load_trajectory in this PR but have no corresponding sys.path.insert. Either add the same guard to those files for consistency, or remove it here (all commands in AGENTS.md already set PYTHONPATH).

github-actions

🟡 Acceptable — all three previously-unresolved threads are verified as resolved in the current code, and 602 tests pass. One minor new observation below.

Verified resolved threads:

agents/openhands_v0/std_to_sft.py:10 (sys.path.insert): all four sibling converters (openhands_v0, agentlab, sweagent, openhands_sdk) now include the sys.path.insert call consistently.
agents/openhands_sdk/std_to_sft.py: no agenttuning_os-specific logic in the shared converter.
AGENTS.md: generation commands correctly route through raw_to_atif.py → atif_to_std.py, and the SFT step reads from sample_std.json (which is now ATIF format).

Minor new observation — scripts/atif_input.py version dispatch:
The dispatch uses exact equality (== ATIF_SCHEMA_VERSION). If a future ATIF version (e.g., "ATIF-v1.8") is introduced, records with the old "ATIF-v1.7" version string fall through to return Trajectory(**data) (ADP path) and fail with a confusing Pydantic ValidationError rather than a clear "unsupported ATIF version" message. The ATIFTrajectory field validator already produces a clean error, so routing any "ATIF-" prefixed schema_version through the ATIF parse path would give a better failure signal. Not a blocker.

Was this automated review useful? React with 👍 or 👎 to this review to help us measure review quality.
Workflow run: https://github.com/neulab/agent-data-protocol/actions/runs/27145768889

neubig · 2026-06-11T00:39:41Z

 import sys
+from pathlib import Path
+
+sys.path.insert(0, str(Path(__file__).resolve().parents[2]))


This import style should not be necessary anywhere in the repo

neubig · 2026-06-11T00:42:08Z

@OpenHands since we are now relying on the ATIF schema, we can delete the ADP schema. Do so and adjust all scripts accordingly

openhands-ai · 2026-06-11T00:42:46Z

I'm on it! neubig can track my progress at all-hands.dev

Co-authored-by: openhands <openhands@all-hands.dev>

neubig · 2026-06-11T01:04:02Z

Addressed the ADP schema removal request in ff580a6. Summary:

Deleted the legacy ADP schema package, schema docs, ADP↔ATIF adapter, and schema-version release workflow/check script.
Kept ATIF as the only standardized schema surface via schema/atif.py and updated validation tests accordingly.
Updated SFT converters, generated-thoughts tooling, raw_to_standardized compatibility wrappers, docs, and role-preservation tests to consume normalized ATIF.

Validation:

PATH="$HOME/.local/bin:$PATH" pre-commit run --all-files
PATH="$HOME/.local/bin:$PATH" python -m pytest tests/ -q (626 passed, 98 skipped)

This PR comment was created by an AI agent (OpenHands) on behalf of the user.

neubig · 2026-06-11T14:57:26Z

@OpenHands Can raw_to_standardized.py be deleted now given that the method for creating standardized format is now raw_to_atif and atif_to_std? Also, check to make sure that the changes to the resulting sft data are minimal compared to when the sft data was generated from the legacy adp format.

openhands-ai · 2026-06-11T14:58:08Z

I'm on it! neubig can track my progress at all-hands.dev

Co-authored-by: openhands <openhands@all-hands.dev>

neubig · 2026-06-11T15:10:26Z

Addressed the raw_to_standardized.py cleanup in 95e27d6:

Deleted all per-dataset raw_to_standardized.py compatibility wrappers and the shared scripts/raw_to_standardized_common.py wrapper.
Removed the obsolete go-browse-wa parse-action test that imported the deleted legacy converter.
Updated dataset-structure validation, deterministic-converter lint coverage, README/CONTRIBUTING/AGENTS guidance, and affected dataset README snippets to use the canonical raw_to_atif.py -> atif_to_std.py path.
Confirmed there are no remaining raw_to_standardized references in the repository.

SFT comparison against the legacy ADP-generated data on origin/main:

Common sample_sft/*.json files compared: 86
Unchanged exactly: 76
Changed: 10
New/deleted SFT files: 0/0

Changed SFT files observed in the existing ATIF PR output:

datasets/agenttuning_alfworld/sample_sft/openhands_sdk.json     base_records=5   atif_records=5   changed_records=5
datasets/agenttuning_kg/sample_sft/openhands_sdk.json           base_records=5   atif_records=5   changed_records=5
datasets/agenttuning_mind2web/sample_sft/openhands_sdk.json     base_records=5   atif_records=5   changed_records=5
datasets/agenttuning_os/sample_sft/openhands_sdk.json           base_records=5   atif_records=5   changed_records=5
datasets/agenttuning_webshop/sample_sft/openhands_sdk.json      base_records=5   atif_records=5   changed_records=5
datasets/android_in_the_wild/sample_sft/openhands_v0.json       base_records=1   atif_records=3   changed_records=3
datasets/go-browse-wa/sample_sft/openhands_v0.json              base_records=5   atif_records=100 changed_records=100
datasets/nemotron_terminal_corpus/sample_sft/openhands_v0.json  base_records=4   atif_records=5   changed_records=5
datasets/nnetnav-live/sample_sft/openhands_v0.json              base_records=5   atif_records=30  changed_records=30
datasets/nnetnav-wa/sample_sft/openhands_v0.json                base_records=5   atif_records=68  changed_records=68

Validation run:

PATH="$HOME/.local/bin:$PATH" python -m pytest tests/test_dataset_structure.py tests/test_atif_schemas.py tests/test_deterministic_terminal_messages.py -q — 215 passed
PATH="$HOME/.local/bin:$PATH" pre-commit run --all-files — passed

Note: I first attempted python -m pip install -r requirements.txt, but the full install failed while building greenlet for Python 3.13 via the browsergym-core/playwright==1.44 dependency path. I then installed the minimal validation dependencies needed for the commands above.

This PR comment was created by an AI agent (OpenHands) on behalf of the user.

openhands-ai · 2026-06-11T15:11:14Z

Since my last summary, there were no additional code changes or follow-up actions.

Final status:

✅ The PR comment request was fully addressed.
✅ raw_to_standardized.py compatibility wrappers were removed.
✅ References and validation logic were updated to use the canonical raw_to_atif.py -> atif_to_std.py pipeline.
✅ SFT output differences versus the legacy ADP path were checked and summarized on the PR.
✅ Changes were committed, pushed, and documented in a PR comment with AI disclosure.
✅ Validation passed for the focused pytest suite and pre-commit run --all-files.

Conciseness:

The changes are scoped to the request: removing the obsolete legacy pipeline entrypoints and updating only the affected docs/tests/references.
I did not identify extraneous changes that need reverting.

The work is complete.

Co-authored-by: openhands <openhands@all-hands.dev>

neubig · 2026-06-11T17:24:03Z

I audited and regenerated the samples in f8d9597.

Answer: no, the committed sample_sft/* artifacts were not all reproducible from the new sample_raw.json -> raw_to_atif.py -> sample_atif.json -> atif_to_std.py -> sample_std.json -> agents/*/std_to_sft.py path. I regenerated them and committed the resulting artifacts plus the converter fixes needed to make the full sample pipeline run end-to-end.

What changed:

Regenerated all 52 sample_atif.json / sample_std.json files from their committed sample_raw.json inputs.
Regenerated all 86 committed sample_sft/*.json files from sample_std.json through the shared SFT converters.
Added raw-turn parsing for turns/tool_use/tool_result shaped raw records so datasets like SWE-chat no longer fall back to raw JSON blobs.
Normalized finish/submit-style terminal tool calls and made SFT converters tolerate invalid source tool-call attempts that are present in raw trajectories.
Adjusted the SFT turn-count sanity check to account for ATIF grouping/copy-context differences observed in regenerated data.

SFT audit versus origin/main legacy ADP-generated data after true regeneration:

Common SFT files: 86
Unchanged exactly: 1
Changed: 85
New/deleted SFT files: 0/0

So the regenerated output is not minimal compared to legacy ADP output. The earlier audit showing 76/86 unchanged was against the pre-regeneration artifacts; after forcing regeneration through the ATIF path, the deltas are much larger. The biggest count changes are in flattened web/AgentLab-style samples and datasets whose ATIF sample now preserves all raw sample rows (for example go-browse-wa, nnetnav-*, android_in_the_wild, and AgentLab outputs).

Validation run:

PATH="$HOME/.local/bin:$PATH" python -m pytest tests/test_dataset_structure.py tests/test_atif_schemas.py tests/test_std_to_sft_conversion.py tests/test_deterministic_terminal_messages.py -q — 307 passed, 12 skipped
PATH="$HOME/.local/bin:$PATH" pre-commit run --all-files — passed

Dependency note: full pip install -r requirements.txt is still not clean on this Python 3.13 environment due the pinned browser/playwright/greenlet path, so I installed the missing validation/regeneration dependencies directly. For browser prompt imports, I used a modern Playwright/greenlet wheel plus the lightweight browsergym pieces needed by the converter.

This PR comment was created by an AI agent (OpenHands) on behalf of the user.

Add ATIF conversion pipeline

a091e83

Co-authored-by: openhands <openhands@all-hands.dev>

openhands-ai Bot mentioned this pull request Jun 3, 2026

Compare ADP and ATIF for possible format unification #243

Open

fix: bump schema version for ATIF changes

1c6a153

Co-authored-by: openhands <openhands@all-hands.dev>

neubig marked this pull request as ready for review June 3, 2026 12:36

github-actions Bot reviewed Jun 3, 2026

View reviewed changes

Comment thread agents/openhands_v0/std_to_sft.py Outdated

Comment thread schema/atif.py Outdated

chore: address PR review feedback (#256)

7a121b6

Co-authored-by: openhands <openhands@all-hands.dev>

neubig requested a review from openhands-agent June 3, 2026 12:49

github-actions Bot reviewed Jun 3, 2026

View reviewed changes

Comment thread scripts/raw_to_atif_common.py Outdated

Comment thread scripts/atif_to_std.py

Comment thread tests/test_atif_schemas.py

chore: clarify ATIF normalization pipeline

4b9a0f6

Co-authored-by: openhands <openhands@all-hands.dev>

neubig requested review from openhands-agent and removed request for openhands-agent June 3, 2026 13:00

github-actions Bot requested changes Jun 3, 2026

View reviewed changes

Comment thread schema/atif.py Outdated

Comment thread tests/test_atif_schemas.py Outdated

Comment thread README.md Outdated

fix: preserve environment observations in ATIF roundtrip

1a6e588

Co-authored-by: openhands <openhands@all-hands.dev>

neubig requested review from openhands-agent and removed request for openhands-agent June 3, 2026 13:14

github-actions Bot requested changes Jun 3, 2026

View reviewed changes

Comment thread AGENTS.md Outdated

Comment thread schema/atif.py Outdated

fix: preserve available APIs through ATIF conversion

2773fea

Co-authored-by: openhands <openhands@all-hands.dev>

neubig requested review from openhands-agent and removed request for openhands-agent June 3, 2026 13:30

github-actions Bot requested changes Jun 3, 2026

View reviewed changes

Comment thread script_full.sh

Comment thread schema/atif.py Outdated

fix: preserve ATIF standalone observation results

8e570b5

Co-authored-by: openhands <openhands@all-hands.dev>

neubig requested review from openhands-agent and removed request for openhands-agent June 3, 2026 13:45

github-actions Bot reviewed Jun 3, 2026

View reviewed changes

docs: align OpenHands sample SFT generation with ATIF

a28b498

Co-authored-by: openhands <openhands@all-hands.dev>

neubig requested review from openhands-agent and removed request for openhands-agent June 3, 2026 14:00

github-actions Bot requested changes Jun 3, 2026

View reviewed changes

Graham Neubig added 2 commits June 8, 2026 09:28

fix: address ATIF docs and OS SDK wrapper

5b2af5b

fix: satisfy ruff import order for OS wrapper

7d0dc3c

neubig added review-this Trigger the OpenHands PR review workflow and removed review-this Trigger the OpenHands PR review workflow labels Jun 8, 2026

github-actions Bot reviewed Jun 8, 2026

View reviewed changes

docs: align contributing schema overview with ATIF

50df06c

neubig added review-this Trigger the OpenHands PR review workflow and removed review-this Trigger the OpenHands PR review workflow labels Jun 8, 2026

github-actions Bot approved these changes Jun 8, 2026

View reviewed changes

Graham Neubig added 2 commits June 8, 2026 10:21

docs: restore AGENTS repository guide

bb0b1ab

fix: move OS normalization into ATIF std

c463092

neubig added review-this Trigger the OpenHands PR review workflow and removed review-this Trigger the OpenHands PR review workflow labels Jun 8, 2026

github-actions Bot reviewed Jun 8, 2026

View reviewed changes

fix: align shared converter execution guards

396133a

neubig added review-this Trigger the OpenHands PR review workflow and removed review-this Trigger the OpenHands PR review workflow labels Jun 8, 2026

github-actions Bot approved these changes Jun 8, 2026

View reviewed changes

neubig commented Jun 11, 2026

View reviewed changes

Remove legacy ADP schema

ff580a6

Co-authored-by: openhands <openhands@all-hands.dev>

chore: remove raw_to_standardized wrappers

95e27d6

Co-authored-by: openhands <openhands@all-hands.dev>

chore: regenerate samples through ATIF pipeline

f8d9597

Co-authored-by: openhands <openhands@all-hands.dev>

Conversation

neubig commented Jun 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Tests

Notes / design decisions

Uh oh!

neubig commented Jun 3, 2026

Uh oh!

openhands-ai Bot commented Jun 3, 2026

Uh oh!

github-actions Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

github-actions Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-actions Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-actions Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

github-actions Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

github-actions Bot left a comment

Choose a reason for hiding this comment

Uh oh!

github-actions Bot left a comment

Choose a reason for hiding this comment

Uh oh!

github-actions Bot left a comment

Choose a reason for hiding this comment

Uh oh!

github-actions Bot left a comment

Choose a reason for hiding this comment

Uh oh!

github-actions Bot left a comment

Choose a reason for hiding this comment

Uh oh!

github-actions Bot Jun 8, 2026

Choose a reason for hiding this comment

Uh oh!

github-actions Bot left a comment

Choose a reason for hiding this comment

Uh oh!

neubig Jun 11, 2026

Choose a reason for hiding this comment

Uh oh!

neubig commented Jun 11, 2026

Uh oh!

openhands-ai Bot commented Jun 11, 2026

Uh oh!

neubig commented Jun 11, 2026

Uh oh!

neubig commented Jun 11, 2026

Uh oh!

openhands-ai Bot commented Jun 11, 2026

Uh oh!

neubig commented Jun 11, 2026

Uh oh!

openhands-ai Bot commented Jun 11, 2026

Uh oh!

neubig commented Jun 11, 2026

Uh oh!

Reviewers

Assignees

neubig commented Jun 3, 2026 •

edited

Loading