Skip to content

Comments

Bump verifiers#1468

Merged
mikasenghaas merged 9 commits intomainfrom
bump-vf-e4d8bf0
Dec 22, 2025
Merged

Bump verifiers#1468
mikasenghaas merged 9 commits intomainfrom
bump-vf-e4d8bf0

Conversation

@mikasenghaas
Copy link
Member

@mikasenghaas mikasenghaas commented Dec 22, 2025

Bump verifiers to latest main, notable changes:

  • #657: Improvements to math rubric
  • #658: Fix import for PythonEnv

Also removes prime-sandboxes from pyproject, as its an environment dependency.

Also adds configs that plays around with updated INTELLECT-3 training environments from #48 in research-environments.


Note

Adds env-mix RL configs (incl. math-python sandbox) and bumps verifiers to cdbc417 while removing prime-sandboxes from project deps.

  • Configs:
    • Add configs/env_mix/env_mix.toml and configs/env_mix/README.md for mixed-environment RL.
    • Orchestrates math, code, science, logic, and math-python (with sandbox, python tool, and resource limits).
    • Sets training/inference params (GPU IDs, batch size, rollouts, WandB, Qwen/Qwen3-4B-Instruct-2507, tool-call parser, max model len).
  • Dependencies:
    • Update verifiers source rev to cdbc417 in pyproject.toml.
    • Remove prime-sandboxes from top-level project dependencies.

Written by Cursor Bugbot for commit 7b07785. This will update automatically on new commits. Configure here.

@mikasenghaas mikasenghaas changed the title Bump vf Bump verifiers Dec 22, 2025
@samsja samsja marked this pull request as ready for review December 22, 2025 16:54
Copy link
Member

@samsja samsja left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm

@mikasenghaas mikasenghaas merged commit 1bb597d into main Dec 22, 2025
6 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants