Skip to content

Conversation

@bradhilton
Copy link
Collaborator

No description provided.

- Introduced conditional epsilon values based on PPO setting.
- Default epsilon values adjusted for improved flexibility in loss calculations.
- Cleaned up logic for handling epsilon and epsilon_high parameters.
@bradhilton bradhilton merged commit 2daa845 into main Jan 2, 2026
2 checks passed
@bradhilton bradhilton deleted the feat/default-to-cispo branch January 2, 2026 19:20
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants