[Bug]: Model extremely slow on terminal-bench + fails sanity check

### Bug Description

When running grok-cli against [terminal-bench](https://github.com/stanford-crfm/terminal-bench), the model takes an unusually long time to start completing tasks and fails even the basic hello-world sanity check.

### Steps to reproduce

1. Install and set up terminal-bench as per the README.
2. run command : tb run -a grok-cli -m grok-4-latest -d terminal-bench-core==0.1.1
3. Observe that progress is extremely slow — around 6% in 1 hour — and the hello-world task does not pass.

Expected Behavior:

The model should quickly start producing completions for tasks.

The hello-world task should pass as a basic sanity check.

Actual Behavior:

The run hangs for long periods between actions.

Progress is very slow (6% in ~1hr).

This worked before you guys fixed the previous hanging issue I came with.




Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Bug]: Model extremely slow on terminal-bench + fails sanity check #66

Bug Description

Steps to reproduce

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[Bug]: Model extremely slow on terminal-bench + fails sanity check #66

Description

Bug Description

Steps to reproduce

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions