Add `RolloutGateway` to enable native CLI agent training#1757

Open

rasdani wants to merge 28 commits intomainfrom

daniel/cli-agent-env

Contributor

rasdani commented Feb 10, 2026 •

edited by cursor bot

Loading

Adds a rollout gateway to the vLLM inference server, enabling server-side multi-turn rollout execution.

depends on PrimeIntellect-ai/verifiers#954

Rollout gateway (rollout_gateway.py, 542 lines — new file):

Register/unregister rollouts with per-rollout state, locks, and trajectory tracking
Proxy chat/completions calls through the gateway with DP-rank-aware routing
Token-level trajectory extraction (prompt/completion ids, masks, logprobs)
Pydantic request/response models for type-safe API

Inference server (server.py):

Mount rollout gateway router alongside existing chat endpoints
Initialize RolloutRegistry in custom_init_app_state (disabled when api_server_count > 1)

Config (config.py, inference/config.py):

log_rollout_gateway_turns on inference config

Note

Medium Risk
Adds new FastAPI endpoints and per-rollout state management inside the vLLM inference server, plus tweaks DP/api-server scaling behavior; issues here could break inference serving or rollout collection in production.

Overview
Introduces a new inference-side rollout gateway (/v1/rollouts/...) that lets clients register multi-turn rollouts, route requests to a data-parallel rank, and fetch token-level trajectories (ids/masks/logprobs), with optional verbose turn logging.

Updates inference config to add log_rollout_gateway_turns and auto_scale_api_servers (gating the existing dp→api_server_count auto-scaling), and mounts/initializes the gateway in the vLLM server (disabled when api_server_count>1). Also updates orchestrator metric logging to pass an explicit step.

^{Written by Cursor Bugbot for commit 27806a4. This will update automatically on new commits. Configure here.}

rasdani mentioned this pull request

opencode_swe: deprecate InterceptionServer PrimeIntellect-ai/research-environments#162

Open

samsja reviewed

View reviewed changes

tests/integration/test_rollout_gateway.py Outdated Show resolved Hide resolved

samsja reviewed

View reviewed changes

src/prime_rl/inference/vllm/rollout_gateway.py Show resolved Hide resolved

samsja reviewed

View reviewed changes

src/prime_rl/inference/vllm/rollout_gateway.py Outdated Show resolved Hide resolved

samsja reviewed

View reviewed changes

src/prime_rl/inference/vllm/rollout_gateway.py Outdated Show resolved Hide resolved

samsja reviewed

View reviewed changes

src/prime_rl/inference/vllm/rollout_gateway.py Outdated Show resolved Hide resolved

samsja reviewed

View reviewed changes

src/prime_rl/inference/vllm/rollout_gateway.py Outdated Show resolved Hide resolved

rasdani added 10 commits

February 22, 2026 21:36


          init

80f2c16


          fix extra body

fb93f6f


          wip: turn level logging

def11b5


          wip: turn level INFO logs

52ad43b


          wip: full turn logs

2bcc058


          fix previous context

96960d5


          pin verifiers

279fdb4


          don't retry client side bad requests

be002d7


          fix error msg

7b0f0db


          simplify

db0d01d

rasdani force-pushed the daniel/cli-agent-env branch from 12a3171 to db0d01d Compare

February 22, 2026 16:15

rasdani added 11 commits

February 22, 2026 22:18


          local verifiers

6b7bbb7


          support new verifiers client abstraction

3e8ef83


          rollout router


          fix state dict access

7004c39


          fix state dict access again

964b74b


          Pydantic response models

7b29d2a


          fix slop in logging

7395e16


          fix logging samples to wandb

ed81f95


          DP routing

add965c


          fix message and completion types

38f2c3d


          simplify logging config

80230f8

rasdani mentioned this pull request

add RolloutGatewayMixin for server-side rollout execution PrimeIntellect-ai/verifiers#954

Merged

13 tasks

rasdani added 3 commits

February 24, 2026 08:08


          simplify endpoint

55f70d7


          pre-commit

6536eba


          delete tests

71c293d

rasdani changed the title ~~WIP: CliAgentEnv training~~ Add RolloutGateway to enable native CLI agent training


          Merge remote-tracking branch 'origin/main' into daniel/cli-agent-env

128b738

rasdani marked this pull request as ready for review

February 24, 2026 03:09


          add changelog

b35a756

cursor bot reviewed

View reviewed changes

src/prime_rl/inference/vllm/rollout_gateway.py Outdated Show resolved Hide resolved

CHANGELOG.md Show resolved Hide resolved

rasdani added 2 commits

February 24, 2026 08:56


          simplify extra_body

7ba6162


          changelog

27806a4

rasdani requested a review from samsja

February 24, 2026 03:28

cursor bot reviewed

View reviewed changes

cursor bot left a comment

Cursor Bugbot has reviewed your changes and found 1 potential issue.

^{Bugbot Autofix is OFF. To automatically fix reported issues with Cloud Agents, enable Autofix in the Cursor dashboard.}

src/prime_rl/inference/vllm/rollout_gateway.py Show resolved Hide resolved

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet