Expose render_completion + add_trajectory_step, private/final guardrails by willccbb · Pull Request #679 · PrimeIntellect-ai/verifiers

willccbb · 2026-01-04T06:32:06Z

Description

Elevates render_completion to an overridable MultiTurnEnv method, and adds an overridable method add_trajectory_step for setting step-level rewards/advantages/extras. Logic not intended for overriding is marked as final if public-facing, private if internal. Additional methods in Environment/EnvGroup are marked as private/final (or removed if no longer needed).

Minor reorganizations + docstring improvements.

Type of Change

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
Documentation update
Test improvement

Testing

All existing tests pass when running uv run pytest locally.
New tests have been added to cover the changes

Checklist

My code follows the style guidelines of this project as outlined in AGENTS.md
I have performed a self-review of my own code
I have commented my code, particularly in hard-to-understand areas
I have made corresponding changes to the documentation
My changes generate no new warnings
Any dependent changes have been merged and published

Additional Notes

Note

API additions and overrides

MultiTurnEnv: new overridable render_completion and add_trajectory_step; rollout marked final; env_response is abstract; completion rendered at end of rollout.
experimental/RLMEnv: implements render_completion to ignore sub-LLM steps when computing completion.

Core refactors and guardrails

Environment: internal helpers renamed private (_format_dataset, _format_completion_dataset, _get_eval_inputs); several public methods marked final (get_dataset, get_eval_dataset, init_state, is_completed, run_rollout, run_group); removed internal completion rendering; minor doc/docstring updates.
EnvGroup: aligns to private dataset formatters (_format_dataset, _format_completion_dataset); rollout marked final; docstrings clarified.

Misc

ToolRubric and data_utils: minor typing/cast cleanups without behavior changes.

^{Written by Cursor Bugbot for commit 4546247. This will update automatically on new commits. Configure here.}

…jectory_step

verifiers/envs/multiturn_env.py

Restructure final/private methods, expose render_completion + add_tra…

f8065b2

…jectory_step

willccbb marked this pull request as ready for review January 4, 2026 06:32

willccbb requested review from mikasenghaas and snimu January 4, 2026 06:32

ty fix

10f718f

snimu reviewed Jan 4, 2026

View reviewed changes

verifiers/envs/multiturn_env.py Show resolved Hide resolved

willccbb added 3 commits January 4, 2026 09:57

ty fixes

b499860

Merge branch 'main' into will/abstract-render-completion

f881161

RLM render_completion

4546247

snimu approved these changes Jan 4, 2026

View reviewed changes

willccbb merged commit 0cba34b into main Jan 4, 2026
6 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

Expose render_completion + add_trajectory_step, private/final guardrails#679

Expose render_completion + add_trajectory_step, private/final guardrails#679
willccbb merged 5 commits intomainfrom
will/abstract-render-completion

willccbb commented Jan 4, 2026 •

edited by cursor bot

Loading

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Comments

Conversation

willccbb commented Jan 4, 2026 • edited by cursor bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Type of Change

Testing

Checklist

Additional Notes

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

willccbb commented Jan 4, 2026 •

edited by cursor bot

Loading