Skip to content

Comments

RLM: remove code jail -> simplify code#800

Merged
snimu merged 4 commits intomainfrom
sebastian/rlm-remove-code-jail-2026-01-28
Jan 28, 2026
Merged

RLM: remove code jail -> simplify code#800
snimu merged 4 commits intomainfrom
sebastian/rlm-remove-code-jail-2026-01-28

Conversation

@snimu
Copy link
Contributor

@snimu snimu commented Jan 28, 2026

Description

There previously was a very basic code jail for the RLM, but this adds a lot of complexity for very little gain. I made the decision that users simply need to accept risks if they run the RLM locally, and if they don't want that they should run it on the sandbox. This PR removes the jail, which allows for a strong simplification of the RLM code. Tested with vf-eval (both execution backends and both repl languages).

Type of Change

  • Bug fix (non-breaking change which fixes an issue)
  • New feature (non-breaking change which adds functionality)
  • Breaking change (fix or feature that would cause existing functionality to not work as expected)
  • Documentation update
  • Test improvement

Testing

  • All existing tests pass when running uv run pytest locally.
  • New tests have been added to cover the changes

Checklist

  • My code follows the style guidelines of this project as outlined in AGENTS.md
  • I have performed a self-review of my own code
  • I have commented my code, particularly in hard-to-understand areas
  • I have made corresponding changes to the documentation
  • My changes generate no new warnings
  • Any dependent changes have been merged and published

Note

Medium Risk
Removes local execution guardrails for RLMEnv, which expands what locally-executed user code can access and could increase the impact of running untrusted tasks outside a sandbox.

Overview
RLMEnv no longer applies the best-effort local filesystem/code jail: the FilesystemJail utility and its unit tests are removed, and the docs drop the claim that local Python execution is jailed.

The Python worker script generation in rlm_env.py is simplified and unified between local and sandbox modes (single context loader, single sub-LLM timeout config, no restriction/env-var plumbing, and consistent root-tool HTTP timeout behavior).

Written by Cursor Bugbot for commit b73e858. This will update automatically on new commits. Configure here.

@snimu snimu changed the title Sebastian/rlm remove code jail 2026 01 28 RLM: remove code jail -> simplify code Jan 28, 2026
@snimu snimu merged commit 6ba6195 into main Jan 28, 2026
6 checks passed
mikasenghaas pushed a commit that referenced this pull request Jan 29, 2026
* Remove RLM local guardrails

* Unify RLM worker context setup

* ruff format

* Unify RLM worker timeouts
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant