RLM: remove code jail -> simplify code by snimu · Pull Request #800 · PrimeIntellect-ai/verifiers

snimu · 2026-01-28T19:40:33Z

Description

There previously was a very basic code jail for the RLM, but this adds a lot of complexity for very little gain. I made the decision that users simply need to accept risks if they run the RLM locally, and if they don't want that they should run it on the sandbox. This PR removes the jail, which allows for a strong simplification of the RLM code. Tested with vf-eval (both execution backends and both repl languages).

Type of Change

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
Documentation update
Test improvement

Testing

All existing tests pass when running uv run pytest locally.
New tests have been added to cover the changes

Checklist

My code follows the style guidelines of this project as outlined in AGENTS.md
I have performed a self-review of my own code
I have commented my code, particularly in hard-to-understand areas
I have made corresponding changes to the documentation
My changes generate no new warnings
Any dependent changes have been merged and published

Note

Medium Risk
Removes local execution guardrails for RLMEnv, which expands what locally-executed user code can access and could increase the impact of running untrusted tasks outside a sandbox.

Overview
RLMEnv no longer applies the best-effort local filesystem/code jail: the FilesystemJail utility and its unit tests are removed, and the docs drop the claim that local Python execution is jailed.

The Python worker script generation in rlm_env.py is simplified and unified between local and sandbox modes (single context loader, single sub-LLM timeout config, no restriction/env-var plumbing, and consistent root-tool HTTP timeout behavior).

^{Written by Cursor Bugbot for commit b73e858. This will update automatically on new commits. Configure here.}

* Remove RLM local guardrails * Unify RLM worker context setup * ruff format * Unify RLM worker timeouts

snimu added 4 commits January 28, 2026 11:10

Remove RLM local guardrails

a437c80

Unify RLM worker context setup

7ace150

ruff format

7bb1eda

Unify RLM worker timeouts

b73e858

snimu changed the title ~~Sebastian/rlm remove code jail 2026 01 28~~ RLM: remove code jail -> simplify code Jan 28, 2026

snimu merged commit 6ba6195 into main Jan 28, 2026
6 checks passed

mikasenghaas pushed a commit that referenced this pull request Jan 29, 2026

RLM: remove code jail -> simplify code (#800)

8145653

* Remove RLM local guardrails * Unify RLM worker context setup * ruff format * Unify RLM worker timeouts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

RLM: remove code jail -> simplify code#800

RLM: remove code jail -> simplify code#800
snimu merged 4 commits intomainfrom
sebastian/rlm-remove-code-jail-2026-01-28

snimu commented Jan 28, 2026 •

edited by cursor bot

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Comments

Conversation

snimu commented Jan 28, 2026 • edited by cursor bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Type of Change

Testing

Checklist

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

snimu commented Jan 28, 2026 •

edited by cursor bot

Loading