[BREAKING][misc] feat: reorganize Local/Remote rollout docs and switch dataset support to parquet/jsonl/csv by JoyboyBrian · Pull Request #62 · Osmosis-AI/osmosis-sdk-python

JoyboyBrian · 2026-02-17T23:08:39Z

What

Reorganized documentation into a clearer IA:
- split rollout docs into docs/local-rollout/* and docs/remote-rollout/*
- moved shared topics to top-level docs (docs/datasets.md, docs/test-mode.md, docs/eval-mode.md)
- added new reference pages (docs/configuration.md, docs/troubleshooting.md)
Rewrote README.md for a two-mode onboarding flow (Local Rollout vs Remote Rollout), with badges/logo and links to dedicated example repos.
Updated supporting docs and examples (docs/cli.md, examples/README.md, SECURITY.md) for consistency and clarity.
Updated dataset handling in eval/test workflows:
- added CSV support in DatasetReader
- removed .json dataset support
- standardized supported formats to .parquet (recommended), .jsonl, .csv
Synced CLI help text for dataset flags in eval/test commands.
Expanded dataset unit tests with CSV coverage and migrated JSON-based tests to JSONL.
Added explicit Python classifiers (3.10, 3.11, 3.12, 3.13) in pyproject.toml.

Why

The previous docs structure mixed concepts across local and remote workflows, making onboarding and navigation harder.
Users need a mode-first path from install → setup → test/eval.
CSV support lowers adoption friction for common tabular workflows, while standardizing formats simplifies validation and maintenance.
Aligning code behavior, CLI help, and docs reduces confusion and support overhead.

Type of Change

Bug fix (non-breaking change that fixes an issue)
New feature (non-breaking change that adds functionality)
Breaking change (fix or feature that would cause existing functionality to change)
Refactor (code change that neither fixes a bug nor adds a feature)
Documentation update
CI/build configuration change

How to Test

uv run pytest tests/unit/rollout/eval/common/test_dataset.py
uv run pytest
uv run ruff check .
uv run ruff format --check .
uv run pyright osmosis_ai/

Manual sanity checks:

Verify .csv datasets work with osmosis test and osmosis eval.
Verify .json datasets now fail with DatasetParseError.
Verify key docs links from README.md and docs/README.md.

Checklist

PR title follows [module] type: description format
Appropriate labels added (e.g. enhancement, bug, breaking)
ruff check . and ruff format --check . pass
pyright osmosis_ai/ passes
pytest passes (new tests added if applicable)
Public API changes are documented
No secrets or credentials included

Summary by cubic

Reorganized docs into Local vs Remote rollout guides with a simpler README and clearer onboarding. Standardized datasets to Parquet, JSONL, and CSV (dropped .json), updated CLI/help/tests, and improved agent loop docs; the base loop now emits structured tool messages.

New Features
- Split docs into local-rollout/* and remote-rollout/*; added CLI, configuration, datasets, test-mode, eval-mode, and troubleshooting; removed redundant sections.
- Simplified README with logo/badges, two-mode onboarding, and links to example repos.
- Standardized datasets: .parquet (recommended), .jsonl, .csv; strict CSV header validation with clear errors; updated CLI help; added CSV tests and migrated JSON tests to JSONL.
- Clarified Remote Rollout agent loop docs and improved docstrings; base loop parses tool args and emits structured "tool" messages with latency tracking.
- Added Python 3.10–3.13 classifiers; simplified CODEOWNERS.
Migration
- Replace any .json datasets with .jsonl, .parquet, or .csv.
- Ensure CSV files include required headers and matching columns.
- Update links to the new docs structure; verify with osmosis test and osmosis eval.

^{Written for commit 7123f53. Summary will update on new commits.}

- Consolidated dataset format information across multiple documents, emphasizing support for Parquet, JSONL, and CSV formats. - Updated CLI help messages to reflect the recommended dataset file formats. - Removed redundant sections from the README and examples to streamline content. - Added new tests for CSV file handling in the DatasetReader class.

- Enhanced the README to clearly outline the two training modes: Local Rollout and Remote Rollout, including their respective workflows and example repositories. - Updated CLI documentation to reflect new testing and evaluation commands, consolidating agent testing and evaluation sections for better usability. - Removed outdated references and redundant sections to streamline content across documentation. - Added links to example repositories for both training modes to facilitate user onboarding.

- Added support for Python versions 3.10 to 3.13 in pyproject.toml. - Enhanced README with additional badges for platform, PyPI version, Python compatibility, license, and documentation links for better visibility.

- Expanded the README to include a detailed Quick Start guide for Local Rollout, outlining installation steps and example commands. - Updated prerequisites section in the README to clarify requirements for using the SDK. - Revised CLI documentation to specify host and port options for the `osmosis serve` command. - Added notes regarding breaking changes and command splits in eval and test modes. - Improved clarity in various documentation sections, including agent loop and testing utilities.

osmosis_ai/rollout/eval/common/dataset.py

cubic-dev-ai

3 issues found across 29 files

Prompt for AI agents (all issues)


Check if these issues are valid — if so, understand the root cause of each and fix them. If appropriate, use sub-agents to investigate and fix each issue separately.


<file name="osmosis_ai/rollout/eval/common/dataset.py">

<violation number="1" location="osmosis_ai/rollout/eval/common/dataset.py:172">
P2: CSV rows with more columns than the header can produce a None key in DictReader, which will crash in _validate_row (k.lower()) with an AttributeError instead of a dataset parse/validation error. Guard against None keys or raise a DatasetParseError when extra columns are present.</violation>
</file>

<file name="docs/remote-rollout/overview.md">

<violation number="1" location="docs/remote-rollout/overview.md:62">
P3: The example now defines three methods (`get_tools`, `execute_tool`, `run`) but the surrounding text still says to implement two methods. Update the narrative to match the new example so readers aren’t misled.</violation>
</file>

<file name="docs/remote-rollout/deployment.md">

<violation number="1" location="docs/remote-rollout/deployment.md:66">
P2: The Compose healthcheck uses `curl`, but the provided Dockerfile examples are based on `python:3.11-slim` and do not install curl. In the documented setup this healthcheck will fail, so use a dependency-free check (e.g., Python) or note the need to install curl.</violation>
</file>

_{Reply with feedback, questions, or to request a fix. Tag @cubic-dev-ai to re-run a review.}

osmosis_ai/rollout/eval/common/dataset.py

docs/remote-rollout/deployment.md

docs/remote-rollout/overview.md

…reamline content and avoid redundancy.

…taset parsing - Clarified the required methods for implementing the agent loop in the remote rollout documentation. - Improved error handling in the DatasetReader class to check for mismatched columns in CSV files, raising a DatasetParseError for better debugging.

osmosis_ai/rollout/eval/common/dataset.py

cubic-dev-ai

2 issues found across 10 files (changes from recent commits).

Prompt for AI agents (all issues)


Check if these issues are valid — if so, understand the root cause of each and fix them. If appropriate, use sub-agents to investigate and fix each issue separately.


<file name="docs/configuration.md">

<violation number="1" location="docs/configuration.md:45">
P3: This line implies the settings classes always auto-load environment variables and `.env` files, but the code only does that when `pydantic-settings` is installed; otherwise they fall back to `BaseModel` without env loading. Please qualify this statement so it matches actual behavior.</violation>
</file>

<file name="docs/troubleshooting.md">

<violation number="1" location="docs/troubleshooting.md:416">
P3: The new `--log` flag description conflicts with the unchanged `{debug_dir}` path example below, so the doc now references two different directory names. Update the path placeholder to match `--log` (e.g., `{log_dir}`) or adjust the wording so they’re consistent.</violation>
</file>

_{Reply with feedback, questions, or to request a fix. Tag @cubic-dev-ai to re-run a review.}

docs/configuration.md

docs/troubleshooting.md

- Removed redundant sections and streamlined content across various documentation files, including the README and local rollout guides. - Updated the README to better outline the Local and Remote Rollout modes, including quick start links and relevant documentation. - Clarified the usage of reward functions and rubric evaluators, ensuring consistency in examples and explanations. - Improved error handling documentation and removed outdated references to enhance user experience.

cubic-dev-ai

3 issues found across 33 files

Prompt for AI agents (all issues)


Check if these issues are valid — if so, understand the root cause of each and fix them. If appropriate, use sub-agents to investigate and fix each issue separately.


<file name="docs/test-mode.md">

<violation number="1" location="docs/test-mode.md:31">
P2: The programmatic example uses `async with`/`await` at top level, which is invalid Python outside an `async def`. Wrap this in an async function and call it with `asyncio.run(...)` so users can run the snippet.</violation>
</file>

<file name="docs/local-rollout/reward-functions.md">

<violation number="1" location="docs/local-rollout/reward-functions.md:7">
P2: This overstates the `@osmosis_reward` requirements. The decorator only enforces the classic signature when the first parameter is `solution_str`; other signatures are accepted without validation, so saying all functions must match this exact signature is misleading.</violation>

<violation number="2" location="docs/local-rollout/reward-functions.md:23">
P3: `**kwargs` is not required by `@osmosis_reward`; the decorator only drops `data_source` when the function can’t accept it. This line misleads users into adding `**kwargs` unnecessarily.</violation>
</file>

_{Reply with feedback, questions, or to request a fix. Tag @cubic-dev-ai to re-run a review.}

docs/test-mode.md

docs/local-rollout/reward-functions.md

JoyboyBrian added 6 commits February 17, 2026 12:59

Add logo to README for improved visual appeal

507f0de

Update pyproject.toml and README for Python version support and badges

c8ef3ef

- Added support for Python versions 3.10 to 3.13 in pyproject.toml. - Enhanced README with additional badges for platform, PyPI version, Python compatibility, license, and documentation links for better visibility.

simplify README

a8b820d

JoyboyBrian requested review from BaiqingL, JakeTrock, allenlinsh, artem-osmosis, mathewjhan and pandyamarut as code owners February 17, 2026 23:08

github-actions bot approved these changes Feb 17, 2026

View reviewed changes

github-code-quality bot found potential problems Feb 17, 2026

View reviewed changes

osmosis_ai/rollout/eval/common/dataset.py Fixed Show fixed Hide fixed

JoyboyBrian removed request for BaiqingL, JakeTrock, allenlinsh, artem-osmosis, mathewjhan and pandyamarut February 17, 2026 23:10

cubic-dev-ai bot reviewed Feb 17, 2026

View reviewed changes

osmosis_ai/rollout/eval/common/dataset.py Outdated Show resolved Hide resolved

docs/remote-rollout/deployment.md Outdated Show resolved Hide resolved

docs/remote-rollout/overview.md Outdated Show resolved Hide resolved

JoyboyBrian added 2 commits February 17, 2026 15:19

Remove deployment documentation from various sections and files to st…

fbf4139

…reamline content and avoid redundancy.

github-code-quality bot found potential problems Feb 17, 2026

View reviewed changes

osmosis_ai/rollout/eval/common/dataset.py Fixed Show fixed Hide fixed

simplify docs

c78730e

cubic-dev-ai bot reviewed Feb 18, 2026

View reviewed changes

docs/configuration.md Outdated Show resolved Hide resolved

docs/troubleshooting.md Outdated Show resolved Hide resolved

JoyboyBrian added 2 commits February 17, 2026 16:10

simplify codeowners

b22c8f3

error handling

93c594e

JoyboyBrian marked this pull request as draft February 18, 2026 00:13

JoyboyBrian added 3 commits February 17, 2026 16:31

warning

95c35df

improve docstring

7123f53

JoyboyBrian added documentation Improvements or additions to documentation breaking Breaking changes labels Feb 18, 2026

JoyboyBrian marked this pull request as ready for review February 18, 2026 01:08

JoyboyBrian merged commit 120ce99 into main Feb 18, 2026
16 checks passed

cubic-dev-ai bot reviewed Feb 18, 2026

View reviewed changes

docs/test-mode.md Show resolved Hide resolved

docs/local-rollout/reward-functions.md Show resolved Hide resolved

docs/local-rollout/reward-functions.md Show resolved Hide resolved

JoyboyBrian deleted the brian/docs branch February 20, 2026 18:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

[BREAKING][misc] feat: reorganize Local/Remote rollout docs and switch dataset support to parquet/jsonl/csv#62

[BREAKING][misc] feat: reorganize Local/Remote rollout docs and switch dataset support to parquet/jsonl/csv#62
JoyboyBrian merged 14 commits intomainfrom
brian/docs

JoyboyBrian commented Feb 17, 2026 •

edited by cubic-dev-ai bot

Loading

Uh oh!

Uh oh!

cubic-dev-ai bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

cubic-dev-ai bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

cubic-dev-ai bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Comments

Conversation

JoyboyBrian commented Feb 17, 2026 • edited by cubic-dev-ai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What

Why

Type of Change

How to Test

Checklist

Summary by cubic

Uh oh!

Uh oh!

cubic-dev-ai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

cubic-dev-ai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

cubic-dev-ai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

JoyboyBrian commented Feb 17, 2026 •

edited by cubic-dev-ai bot

Loading