[Refactor] Update the example usage for the `@osmosis_rubric` by JoyboyBrian · Pull Request #11 · Osmosis-AI/osmosis-git-sync-example

JoyboyBrian · 2025-10-27T15:39:42Z

This pull request modernizes the rubric scoring workflow and improves usability, maintainability, and dataset support for the support conversation evaluation system. The most important changes include a refactor of the rubric scorer script to use a schema-driven config and dataset loader, updates to the workflow and documentation to support new usage patterns, and the introduction of a sample dataset for batch evaluation.

Rubric evaluation system refactor:

Major refactor of reward_rubric.py to use schema-driven config loading, dataset records, and a simplified entrypoint (score_support_conversation). The script now loads YAML config and JSONL data, supports batch evaluation, and handles provider/model selection via config or environment.
Updated reward_rubric_config.yaml to use a versioned schema with a rubrics[] array, separating rubric details and supporting multiple rubrics and default values.

Dataset and example improvements:

Changed reward_rubric_example.json to use a flat structure (solution_str, original_input, ground_truth) instead of a message array, matching new dataset format.
Added sample_data.jsonl as a JSONL dataset for batch rubric evaluation and CLI preview, with multiple conversation records.

Workflow and script updates:

Updated GitHub Actions workflow (reward_rubric.yml) to call the new shell script (run_reward_rubric.sh) and trigger on changes to the example and scorer script, not just the config. [1] [2] [3]
Simplified run_reward_rubric.sh to invoke the scorer as a Python module and accept CLI arguments for alternate data files.

Documentation enhancements:

Expanded README.md to explain the new config schema, script usage, dataset format, and CLI options for previewing and evaluating rubrics. [1] [2] [3]

…smosis-git-sync-example into brian/reward_rubric

… module. Updated method names for clarity, streamlined extra info construction, and enhanced dataset loading logic.

JoyboyBrian · 2025-10-27T16:08:54Z

The test fails is expected. Because we refactored the osmosis_ai sdk, which hasn't been merged and released yet.

JoyboyBrian added 5 commits October 27, 2025 14:21

test git config

89d287f

wip

da98e35

Merge branch 'brian/reward_rubric' of https://github.com/Osmosis-AI/o…

b66f2fb

…smosis-git-sync-example into brian/reward_rubric

simplify

8e778e0

Refactor scoring function and improve error handling in reward rubric…

88fe7b1

… module. Updated method names for clarity, streamlined extra info construction, and enhanced dataset loading logic.

JoyboyBrian had a problem deploying to osmosis-secrets October 27, 2025 15:39 — with GitHub Actions Failure

This comment was marked as resolved.

Sign in to view

updates

2de924b

JoyboyBrian had a problem deploying to osmosis-secrets October 27, 2025 16:05 — with GitHub Actions Failure

JoyboyBrian requested review from BaiqingL, JakeTrock and mathewjhan October 27, 2025 16:09

BaiqingL approved these changes Oct 31, 2025

View reviewed changes

BaiqingL merged commit afde4b1 into main Oct 31, 2025
1 check failed

JoyboyBrian temporarily deployed to osmosis-secrets November 3, 2025 21:58 — with GitHub Actions Inactive

Osmosis-AI deleted a comment from gemini-code-assist bot Feb 11, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

[Refactor] Update the example usage for the `@osmosis_rubric`#11

[Refactor] Update the example usage for the `@osmosis_rubric`#11
BaiqingL merged 6 commits intomainfrom
brian/reward_rubric

JoyboyBrian commented Oct 27, 2025

Uh oh!

This comment was marked as resolved.

Uh oh!

This comment was marked as resolved.

Uh oh!

JoyboyBrian commented Oct 27, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Comments

Conversation

JoyboyBrian commented Oct 27, 2025

Uh oh!

This comment was marked as resolved.

Uh oh!

This comment was marked as resolved.

Uh oh!

JoyboyBrian commented Oct 27, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants