Merged
Conversation
mikasenghaas
pushed a commit
that referenced
this pull request
Jan 29, 2026
* Remove RLM local guardrails * Unify RLM worker context setup * ruff format * Unify RLM worker timeouts
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Description
There previously was a very basic code jail for the RLM, but this adds a lot of complexity for very little gain. I made the decision that users simply need to accept risks if they run the RLM locally, and if they don't want that they should run it on the sandbox. This PR removes the jail, which allows for a strong simplification of the RLM code. Tested with vf-eval (both execution backends and both repl languages).
Type of Change
Testing
uv run pytestlocally.Checklist
Note
Medium Risk
Removes local execution guardrails for
RLMEnv, which expands what locally-executed user code can access and could increase the impact of running untrusted tasks outside a sandbox.Overview
RLMEnvno longer applies the best-effort local filesystem/code jail: theFilesystemJailutility and its unit tests are removed, and the docs drop the claim that local Python execution is jailed.The Python worker script generation in
rlm_env.pyis simplified and unified between local and sandbox modes (single context loader, single sub-LLM timeout config, no restriction/env-var plumbing, and consistent root-tool HTTP timeout behavior).Written by Cursor Bugbot for commit b73e858. This will update automatically on new commits. Configure here.