Feat: Implementations now pull schemas #7

flrping · 2026-02-10T18:20:54Z

For the sake of less cohesion, schemas and the implementations have split repositories. With this change, implementations must pull schemas from sources as zip files. Version bump marked as patch, as this needs proper testing and improvements.

Summary by cubic

Implementations now load JSON schemas from remote ZIP sources configured in the app, replacing bundled schemas. CI builds publish packages using implementation versions only, and workflows now watch implementation paths.

New Features
- Java: SchemaLoader downloads and streams ZIPs from URLs in application.yml with better logging, error handling, and resource cleanup.
- Rust: SchemaLoader loads from sources.yaml using ureq + zip + serde_yaml; removed rust-embed.
- Tests and configs added for remote loading.
Migration
- Remove local schemas; set source URLs in java/src/main/resources/application.yml or rust/resources/sources.yaml.
- CI: triggers no longer include schemas/**; versions read from pom/Cargo.toml; full history fetched (fetch-depth: 0).
- Versions bumped: Java 1.0.1, Rust 0.0.5.

^{Written for commit b105e30. Summary will update on new commits.}

Summary by CodeRabbit

New Features
- Remote schema loading from external ZIP sources; dynamic changelog generation for releases
Refactor
- Unified versioning and simplified CI release flow for Java and Rust
- Schema handling moved from bundled resources to remote-sourced catalogs
Chores
- Java version bumped to 1.0.1; Rust bumped to 0.0.5; added external sources configuration
- Removed bundled schema files from the repository
Tests
- Added basic SchemaLoader unit tests

coderabbitai · 2026-02-10T18:21:13Z

Walkthrough

Schema management moved from embedded/local files to configurable remote ZIP sources; Java and Rust loaders now fetch and cache schemas at startup. CI workflows switched to implementation-driven versioning and dynamic changelog generation. Many local JSON schema files and schemas/VERSION were removed.

Changes

Cohort / File(s)	Summary
CI / Workflows `.github/workflows/ci-java.yml`, `.github/workflows/ci-rust.yml`	Removed schema-path triggers and VERSION reads; switched to single implementation-driven version output; added changelog generation and `fetch-depth: 0`; updated tagging, release naming, and artifact paths to use new version value.
Java Build & Config `java/pom.xml`, `java/src/main/resources/application.yml`	Bumped Java package version `1.0.0→1.0.1`; removed resource copy of schemas/VERSION; added surefire plugin and `jackson-dataformat-yaml`; added application.yml with logging and remote sources list.
Java SchemaLoader & Tests `java/src/main/java/net/hydrius/pacts/core/SchemaLoader.java`, `java/src/test/java/net/hydrius/pacts/core/SchemaLoaderTest.java`, `java/src/test/.../PactsServiceTest.java`, `java/src/test/.../PactsConfig.java`, `java/src/test/resources/application.yml`	Introduced remote ZIP schema loading into in-memory cache (`loadRemoteSchemas()`); constructor and `loadSchema(category,name)` signature/behavior changed (returns null on miss in Java); added logging, timeouts, ZIP-entry safety; tests added/updated and some test setup/config methods now declare `throws IOException`; test resources include sources URL.
Rust Build & Resources `rust/Cargo.toml`, `rust/build.rs`, `rust/resources/sources.yaml`	Bumped crate `0.0.4→0.0.5`; replaced `rust-embed` with `zip` and added `serde_yaml`, `ureq`; `build.rs` now copies `resources/sources.yaml` to OUT_DIR and requires its presence; added `sources.yaml` listing remote ZIP URL.
Rust SchemaLoader `rust/src/core/schema_loader.rs`	Refactored to fetch and parse remote ZIP(s) at construction, populate in-memory cache, and make `load_schema` cache-only (panics on miss); added sources parsing and ZIP processing helpers; struct stores `schema_root`, `domain`, `version`.
Removed Embedded Schemas `schemas/VERSION`, `schemas/bees/v1/...` (many JSON files)	Deleted `schemas/VERSION` and numerous `bees/v1` JSON schema files across `definition`, `inventory`, `invitation`, `player`, `profile`, `server` directories; schemas are now expected from remote ZIP sources.

Sequence Diagram

sequenceDiagram
    participant App as Application
    participant Loader as SchemaLoader
    participant Remote as Remote Source (GitHub ZIP)
    participant Cache as Schema Cache
    participant FS as Filesystem (Java only)

    App->>Loader: new(schemaRoot, domain, version)
    Loader->>Loader: loadSourcesConfig()
    Loader->>Remote: fetch ZIP(s)
    Remote-->>Loader: ZIP stream(s)
    Loader->>Loader: parse ZIP entries (in-memory)
    Loader->>Cache: populate with JSON schemas
    Loader-->>App: ready

    App->>Loader: loadSchema(category, name)
    alt Cache hit
        Loader->>Cache: lookup
        Cache-->>Loader: schema
        Loader-->>App: schema
    else Cache miss (Java)
        Loader->>FS: attempt filesystem read
        alt File found
            FS-->>Loader: schema
            Loader-->>App: schema
        else
            Loader-->>App: null
        end
    else Cache miss (Rust)
        Loader->>Loader: panic / fail fast
    end

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~70 minutes

Possibly related PRs

Schema versioning #4: Modifies the same CI workflows, schema/version handling, and SchemaLoader implementations; changes overlap and replace prior local-schema/version approach.

🚥 Pre-merge checks | ✅ 3

✅ Passed checks (3 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title clearly and specifically describes the main change: implementations now pull schemas from remote sources instead of bundling them locally.
Docstring Coverage	✅ Passed	Docstring coverage is 100.00% which is sufficient. The required threshold is 80.00%.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing touches

📝 Generate docstrings

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment
Commit unit tests in branch ci/pull-schemas

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 11

🤖 Fix all issues with AI agents

In @.github/workflows/ci-java.yml:
- Around line 64-66: The mv commands reference a nonexistent step output
(steps.version.outputs.impl_version) causing empty substitutions; update those
expressions to use the correct step id (steps.impl.outputs.impl_version) or
simply remove the entire rename block since the combined-version build was
removed and the mv operations are no-ops (renaming files to the same name).
Locate the three mv lines (and duplicates later around lines referenced in the
review) that move target/pacts-${{ steps.impl.outputs.impl_version }}*.jar and
either replace steps.version with steps.impl in all occurrences or delete the mv
block entirely to avoid failing the workflow.

In @.github/workflows/ci-rust.yml:
- Around line 56-71: The workflow uses the wrong step id: references to
steps.version.outputs.impl_version should be steps.impl.outputs.impl_version;
update all occurrences of steps.version.outputs.impl_version (used in tag_name,
name, body, and files for libpacts artifact) to steps.impl.outputs.impl_version
so the GitHub Action reads the implementation version from the actual step id
`impl`.

In `@java/src/main/java/net/hydrius/pacts/core/SchemaLoader.java`:
- Around line 106-141: The hand-rolled YAML parsing in loadRemoteSchemas
incorrectly treats every line starting with "- " as a source; replace it with a
proper YAML parse using SnakeYAML or Jackson's YAML module to load
application.yml into a JsonNode/Map and then safely extract the "sources" array
(use objectMapper.convertValue or similar to turn the parsed YAML into
JsonNode), updating the ObjectNode settings only with the actual sources node;
ensure you reference loadRemoteSchemas, settings, objectMapper, and the
"sources" key when locating the change and add appropriate null-checks and
exception logging for malformed or missing "sources".
- Around line 150-152: The HttpURLConnection in SchemaLoader's constructor
(where URL url = new URL(source); HttpURLConnection connection =
(HttpURLConnection) url.openConnection(); connection.setRequestMethod("GET");)
lacks connect/read timeouts and can block indefinitely; set reasonable timeouts
(e.g., connection.setConnectTimeout(...) and connection.setReadTimeout(...))
before connecting (or before any read) to prevent startup hangs—prefer using
sensible defaults or pulling values from config if available and ensure these
calls occur on the same HttpURLConnection instance created in the constructor.
- Around line 148-214: The ZIP/HTTP streams and connection leak on exceptions in
the schema-loading loop; wrap the URL/connection/input streams in
try-with-resources so InputStream `in` and `ZipInputStream zipIn` are closed
automatically (e.g., create the connection, then use try (InputStream in =
connection.getInputStream(); ZipInputStream zipIn = new ZipInputStream(in)) {
... } ), move the per-entry processing inside that block, and ensure the
HttpURLConnection `connection` is disconnected in a finally block (or call
connection.disconnect() after the try-with-resources) so `connection`, `in`, and
`zipIn` are always cleaned up even on ZipException/IOException in the code that
reads entries and populates `cache`.

In `@java/src/test/java/net/hydrius/pacts/core/SchemaLoaderTest.java`:
- Around line 11-19: The test currently invokes SchemaLoader(...) which calls
loadRemoteSchemas() and performs real network I/O on every `@BeforeEach` run under
`@TestInstance`(PER_CLASS); to fix, refactor so tests avoid network by either (A)
changing the test to use `@BeforeAll` instead of `@BeforeEach` if you intend
PER_CLASS, or (B) instantiate SchemaLoader with a test-friendly
constructor/parameter or dependency injection (e.g., a Supplier/Provider or
boolean skipRemote flag) so the constructor does not call loadRemoteSchemas(),
or (C) create a test subclass or mock of SchemaLoader that overrides
loadRemoteSchemas() to no-op; update SchemaLoader's constructors (and the test
setup method) accordingly and ensure tests call the non-networking constructor
or override method instead of triggering real downloads.

In `@java/src/test/resources/application.yml`:
- Around line 9-10: Tests currently depend on a live GitHub URL in
application.yml which causes flaky/failed tests offline or when remote is
unavailable; update tests to be hermetic by either (a) bundling a small test
schema into test resources and point the test config to a local file URL instead
of "https://..." or (b) mock/stub the network fetch in SchemaLoader (e.g., stub
the method that fetches the ZIP or inject a test SchemaLoader implementation) so
tests use the packaged test schema; change the test application.yml value and/or
add a test-only SchemaLoader mock and ensure test setup uses that mock.

In `@rust/build.rs`:
- Around line 14-19: The code currently walks three parent() levels from OUT_DIR
to compute target_dir which is fragile; stop traversing parents and use OUT_DIR
directly (the out_dir variable from env::var("OUT_DIR")). Replace any use of the
computed target_dir with out_dir and write build artifacts into OUT_DIR, and
update runtime usage to load those artifacts via include_str! or
env!("OUT_DIR")/concat! as appropriate so the runtime reads the file from the
canonical build-script output location instead of relying on Cargo's internal
layout.

In `@rust/Cargo.toml`:
- Line 15: Update the zip dependency in Cargo.toml from version "0.6" to "7.4.0"
and then audit usages of the crate (look for ZipWriter, ZipArchive, ZipOptions
and any functions that construct/read archives) to accommodate breaking changes
in 7.x: change any removed or renamed method calls, adjust error types and
Result handling, and update feature names if necessary (ensure the equivalent
compression feature to "deflate" is enabled); run cargo build/tests and fix API
mismatches accordingly.

In `@rust/src/core/schema_loader.rs`:
- Around line 119-167: The load_sources_config function currently opens a
hardcoded "rust/resources/sources.yaml" and parses it line-by-line; instead,
embed or properly locate the YAML and use a real YAML parser: replace the manual
file-open + line parsing in load_sources_config with one of two approaches — (A)
embed the config at compile time via include_str!("resources/sources.yaml") (or
include_bytes!) and parse that string using serde_yaml::from_str to deserialize
into Vec<String>, or (B) locate the runtime copy from build.rs by reading the
path from OUT_DIR (or a well-known config path) and then parse with
serde_yaml::from_reader; remove the fragile ad-hoc trimming logic and ensure
errors propagate (return Err) instead of silently falling back to the hardcoded
URL so callers can fail fast if configuration is invalid.
- Around line 86-110: In load_remote_schemas, currently a successful return from
load_schemas_from_zip_url sets sources_loaded = true even if no schemas were
actually added; change the logic so after Ok(_) you verify the schema cache is
non-empty (inspect whatever in-memory store is used by the loader, e.g.,
self.schemas or self.cache) and only set sources_loaded = true and break when at
least one .json schema was loaded; if the cache remains empty, treat that Ok as
a failure (log and continue) so partial/empty loads do not silently mark success
for load_schemas_from_zip_url and load_remote_schemas.

🧹 Nitpick comments (8)

rust/resources/sources.yaml (1)

1-2: Consider pinning to a specific tag or commit instead of main.

Fetching from refs/heads/main means any push to the Schemas repo immediately affects all consumers. A tagged release or commit SHA would give you reproducible builds and explicit upgrade control.

java/src/main/resources/application.yml (1)

5-6: Same pinning concern as the Rust sources.yaml.

Using refs/heads/main makes builds non-reproducible. Consider pinning to a tagged release. Additionally, sources as a top-level Spring config key is quite generic — a namespaced key like pacts.sources would reduce the risk of collisions with other Spring configuration.

java/src/test/java/net/hydrius/pacts/impl/PactsServiceTest.java (1)

23-27: Test setup performs live network I/O on every test method.

@BeforeEach combined with @TestInstance(PER_CLASS) means setup() runs before each @Test. If the SchemaLoader constructor fetches remote schemas, this issues a network call per test. Consider using @BeforeAll (already PER_CLASS lifecycle) to initialize once, or caching/mocking the loader.

rust/Cargo.toml (1)

17-19: Commented-out dependency left in the manifest.

If tempfile isn't needed now, remove it entirely rather than leaving commented-out code. It can be re-added when needed — version control preserves the history.
rust/build.rs (1)
8-8: Unused variable project_root.

project_root is computed on line 8 but never referenced. This is dead code.
Proposed fix
     let manifest_dir = env::var("CARGO_MANIFEST_DIR").expect("CARGO_MANIFEST_DIR not set");
-    let project_root = Path::new(&manifest_dir).parent().expect("Failed to get project root");
rust/src/core/schema_loader.rs (1)

176-188: Shelling out to curl is fragile and non-portable.

Command::new("curl") requires curl to be installed on the host, is not available on all platforms (notably vanilla Windows), and bypasses Rust's HTTP ecosystem (connection pooling, TLS configuration, proper error types). Consider using a lightweight HTTP client crate like ureq (blocking, minimal dependencies) or reqwest (blocking feature).

Additionally, if the curl process is killed or the system is under memory pressure, output() captures all of stdout into memory with no size bound — a malicious or misconfigured URL could return an arbitrarily large response.

java/src/main/java/net/hydrius/pacts/core/SchemaLoader.java (1)

106-106: loadRemoteSchemas() is public — consider restricting visibility.

This method mutates internal cache state and is called automatically from the constructor. Exposing it publicly allows callers to re-trigger remote loading at any time, potentially introducing concurrency issues or inconsistent state. If external re-loading is not a requirement, reduce visibility to private or package-private.

java/pom.xml (1)

71-75: maven-surefire-plugin 3.0.0-M7 is a pre-release milestone from June 2022.

Consider upgrading to a stable release. The latest stable version is 3.5.4 (September 2025).

.github/workflows/ci-java.yml

.github/workflows/ci-rust.yml

java/src/main/java/net/hydrius/pacts/core/SchemaLoader.java

coderabbitai · 2026-02-10T18:25:48Z

java/src/test/resources/application.yml

+sources:
+  - "https://github.com/Project-Hydrius/Schemas/archive/refs/heads/main.zip"


⚠️ Potential issue | 🟠 Major

Tests depend on a live network call to GitHub.

If SchemaLoader fetches from this URL during test setup, tests will fail when offline, rate-limited, or if the remote repo is unavailable. Consider providing a test-specific mock/stub or bundling a small test schema locally for test isolation.

🤖 Prompt for AI Agents

In `@java/src/test/resources/application.yml` around lines 9 - 10, Tests currently depend on a live GitHub URL in application.yml which causes flaky/failed tests offline or when remote is unavailable; update tests to be hermetic by either (a) bundling a small test schema into test resources and point the test config to a local file URL instead of "https://..." or (b) mock/stub the network fetch in SchemaLoader (e.g., stub the method that fetches the ZIP or inject a test SchemaLoader implementation) so tests use the packaged test schema; change the test application.yml value and/or add a test-only SchemaLoader mock and ensure test setup uses that mock.

rust/build.rs

rust/Cargo.toml

rust/src/core/schema_loader.rs

cubic-dev-ai

16 issues found across 35 files

Prompt for AI agents (all issues)


Check if these issues are valid — if so, understand the root cause of each and fix them.


<file name="java/src/main/java/net/hydrius/pacts/core/SchemaLoader.java">

<violation number="1" location="java/src/main/java/net/hydrius/pacts/core/SchemaLoader.java:63">
P1: Constructor unconditionally calls `loadRemoteSchemas()`, which throws `IOException` if remote sources are unavailable. This makes `SchemaLoader` impossible to instantiate for local/file-system-only usage and breaks resilience when the remote server is unreachable. Consider making remote loading optional (e.g., only attempt if `application.yml` has sources configured, and log a warning instead of throwing).</violation>

<violation number="2" location="java/src/main/java/net/hydrius/pacts/core/SchemaLoader.java:92">
P2: `loadSchema` now silently returns `null` instead of throwing `IOException` on failure. Callers that relied on the exception contract may now get unexpected `NullPointerException`s. Consider throwing an exception for missing schemas, or at minimum documenting the null return clearly so callers add null checks.</violation>

<violation number="3" location="java/src/main/java/net/hydrius/pacts/core/SchemaLoader.java:126">
P1: Naive YAML parsing: every line starting with `"- "` in the entire file is treated as a source URL, not just those under the `sources:` key. This will break if the YAML file has any other list-typed values. Consider using a proper YAML parser (e.g., SnakeYAML, which is already common in Java projects) or at least track whether the current line is within the `sources:` block.</violation>

<violation number="4" location="java/src/main/java/net/hydrius/pacts/core/SchemaLoader.java:151">
P1: No connect/read timeouts configured on the HTTP connection. This can cause the application to hang indefinitely during startup if the remote server is slow or unresponsive. Add `setConnectTimeout()` and `setReadTimeout()` calls.</violation>

<violation number="5" location="java/src/main/java/net/hydrius/pacts/core/SchemaLoader.java:153">
P1: Resource leak: `HttpURLConnection`, `InputStream`, and `ZipInputStream` are not closed when an exception is caught. Use try-with-resources to ensure all streams and connections are properly closed on both success and failure paths. Additionally, `connection.disconnect()` is never called.</violation>
</file>

<file name="java/src/main/resources/application.yml">

<violation number="1" location="java/src/main/resources/application.yml:6">
P2: Avoid pulling schemas from a mutable branch head. Pin the schema source to a specific tag or commit to keep builds reproducible and prevent unexpected schema changes.</violation>
</file>

<file name=".github/workflows/ci-java.yml">

<violation number="1" location=".github/workflows/ci-java.yml:64">
P1: `steps.version` no longer exists after removing the "Create Combined Version" step, so all references to `steps.version.outputs.impl_version` will resolve to empty and break artifact renames and release metadata. Use `steps.impl.outputs.impl_version` directly (or reintroduce a `version` step).</violation>
</file>

<file name="rust/resources/sources.yaml">

<violation number="1" location="rust/resources/sources.yaml:2">
P2: Pin schema sources to a tag or commit SHA instead of the moving `main` branch to keep builds reproducible and avoid unexpected schema changes.</violation>
</file>

<file name="java/src/test/resources/application.yml">

<violation number="1" location="java/src/test/resources/application.yml:10">
P2: Pin the schema source to an immutable tag or commit so test runs are reproducible and don’t change when the upstream main branch moves.</violation>
</file>

<file name="rust/src/core/schema_loader.rs">

<violation number="1" location="rust/src/core/schema_loader.rs:27">
P2: The doc comment for `schema_root` says "kept for API compatibility (not used)", but the constructor still panics if it's empty. This is contradictory — if the parameter is unused, the validation should either be removed, or the doc should be updated to explain why a non-empty value is still required.</violation>

<violation number="2" location="rust/src/core/schema_loader.rs:121">
P1: Hardcoded relative path `rust/resources/sources.yaml` depends on the working directory being the repo root. In production or CI environments, the binary will typically be run from a different directory, causing the config to silently not be found and falling back to the hardcoded default URL. Consider using an environment variable, embedding the config at compile time, or resolving relative to the binary's location.</violation>

<violation number="3" location="rust/src/core/schema_loader.rs:128">
P2: Hand-rolled YAML parsing is fragile and will break on many valid YAML constructs. Use the `serde_yaml` crate with a proper deserialization struct instead. For example:
```rust
#[derive(Deserialize)]
struct SourcesConfig {
    sources: Vec<String>,
}
let config: SourcesConfig = serde_yaml::from_reader(file)?;

This handles quoting, comments, edge cases, and is far more maintainable.

P2: JSON files with fewer than 5 path components in the ZIP are silently skipped. If the ZIP structure changes, schemas will be missing from cache with no diagnostic information. Add an `eprintln!` warning (or proper log) for skipped files so that debugging failed schema lookups is possible. P3: The PR description says the version bump is minor, but the Cargo.toml change is a patch bump (0.0.4 → 0.0.5). Please confirm the intended release level and align the version or update the PR description to avoid ambiguity.

(Based on your team's feedback about handling version/description mismatches.) [FEEDBACK_USED]

P3: The PR description says this is a minor version bump, but the pom.xml change is a patch bump (1.0.1). Please confirm intent and update either the version or the PR description to avoid release confusion.

(Based on your team's feedback about handling PR description/version mismatches without assuming which is correct.) [FEEDBACK_USED]

P1: `steps.version` was removed, but the workflow still references `steps.version.outputs.impl_version`, which will be undefined and break artifact naming/release. Use the `impl` step output instead. ```

_{Reply with feedback, questions, or to request a fix. Tag @cubic-dev-ai to re-run a review.}

java/src/main/java/net/hydrius/pacts/core/SchemaLoader.java

.github/workflows/ci-java.yml

rust/src/core/schema_loader.rs

.github/workflows/ci-rust.yml

java/src/main/java/net/hydrius/pacts/core/SchemaLoader.java

rust/src/core/schema_loader.rs

cubic-dev-ai · 2026-02-10T18:31:25Z

java/src/main/java/net/hydrius/pacts/core/SchemaLoader.java

+            logger.warning("Failed to load schema from file system: " + e.getMessage());
        }
+
+        return null;


P2: loadSchema now silently returns null instead of throwing IOException on failure. Callers that relied on the exception contract may now get unexpected NullPointerExceptions. Consider throwing an exception for missing schemas, or at minimum documenting the null return clearly so callers add null checks.

Prompt for AI agents

Check if this issue is valid — if so, understand the root cause and fix it. At java/src/main/java/net/hydrius/pacts/core/SchemaLoader.java, line 92: <comment>`loadSchema` now silently returns `null` instead of throwing `IOException` on failure. Callers that relied on the exception contract may now get unexpected `NullPointerException`s. Consider throwing an exception for missing schemas, or at minimum documenting the null return clearly so callers add null checks.</comment> <file context> @@ -44,91 +59,164 @@ public SchemaLoader(String schemaRoot, String domain, String version) throws Ill + logger.warning("Failed to load schema from file system: " + e.getMessage()); } + + return null; } </file context>

cubic-dev-ai · 2026-02-10T18:31:25Z

rust/Cargo.toml

 [package]
 name = "pacts"
-version = "0.0.4"
+version = "0.0.5"


P3: The PR description says the version bump is minor, but the Cargo.toml change is a patch bump (0.0.4 → 0.0.5). Please confirm the intended release level and align the version or update the PR description to avoid ambiguity.

(Based on your team's feedback about handling version/description mismatches.)

View Feedback

Prompt for AI agents

Check if this issue is valid — if so, understand the root cause and fix it. At rust/Cargo.toml, line 3: <comment>The PR description says the version bump is minor, but the Cargo.toml change is a patch bump (0.0.4 → 0.0.5). Please confirm the intended release level and align the version or update the PR description to avoid ambiguity. (Based on your team's feedback about handling version/description mismatches.) </comment> <file context> @@ -1,8 +1,7 @@ [package] name = "pacts" -version = "0.0.4" +version = "0.0.5" edition = "2021" -build = "build.rs" </file context>

cubic-dev-ai · 2026-02-10T18:31:25Z

java/pom.xml

    <groupId>net.hydrius</groupId>
    <artifactId>pacts</artifactId>
-    <version>1.0.0</version>
+    <version>1.0.1</version>


P3: The PR description says this is a minor version bump, but the pom.xml change is a patch bump (1.0.1). Please confirm intent and update either the version or the PR description to avoid release confusion.

(Based on your team's feedback about handling PR description/version mismatches without assuming which is correct.)

View Feedback

Prompt for AI agents

Check if this issue is valid — if so, understand the root cause and fix it. At java/pom.xml, line 8: <comment>The PR description says this is a minor version bump, but the pom.xml change is a patch bump (1.0.1). Please confirm intent and update either the version or the PR description to avoid release confusion. (Based on your team's feedback about handling PR description/version mismatches without assuming which is correct.) </comment> <file context> @@ -5,7 +5,7 @@ <groupId>net.hydrius</groupId> <artifactId>pacts</artifactId> - <version>1.0.0</version> + <version>1.0.1</version> <packaging>jar</packaging> </file context>

coderabbitai

Actionable comments posted: 2

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (1)

rust/src/core/schema_loader.rs (1)
230-232: ⚠️ Potential issue | 🟡 Minor

replace("v", "") removes all occurrences of v, not just the prefix.

For a version like "v1" this works, but any version string containing v elsewhere (e.g., hypothetically "v12dev3") would be mangled. Use trim_start_matches for correctness.
🐛 Proposed fix
     pub fn get_parsed_version(&self) -> i32 {
-        self.version.replace("v", "").parse().unwrap_or(1)
+        self.version.trim_start_matches('v').parse().unwrap_or(1)
     }

🤖 Fix all issues with AI agents

In `@java/pom.xml`:
- Around line 71-75: Update the Maven Surefire plugin declaration to use the
latest stable release instead of the pre-release milestone; locate the plugin
block for org.apache.maven.plugins:maven-surefire-plugin (artifactId
maven-surefire-plugin) and change its <version> from 3.0.0-M7 to 3.5.4, then run
a quick mvn test to verify compatibility and adjust any configuration if new
defaults changed.

In `@rust/src/core/schema_loader.rs`:
- Around line 140-152: The load_schemas_from_zip_url function currently shells
out to curl; replace that with a Rust HTTP client (use ureq for blocking
simplicity or reqwest blocking) by adding the dependency and using e.g.
ureq::get(url).call() (or reqwest::blocking::get) to fetch the bytes, check the
HTTP status and return an error on non-success, then use the response
bytes/reader in place of Command::output() output handling; update error
messages to include the HTTP status/body when available and remove the curl
Command usage. Ensure you preserve existing behavior (redirects/timeouts by
configuring the client) and adjust imports and Result error conversions
accordingly.

🧹 Nitpick comments (11)

.github/workflows/ci-rust.yml (1)

50-62: Changelog generation looks solid, minor edge-case note.

The heredoc-with-random-delimiter approach for multiline output is good practice for GitHub Actions. The tag-existence check and fallback to last 20 commits is reasonable.

One thing to be aware of: if the tag rust-v<version> already exists (e.g., a re-run without bumping the version), the changelog will be empty ($TAG..HEAD with no new commits), and the release step may fail or create a duplicate. Consider guarding against an empty $LOG or adding an if: condition on the release step to skip when there are no new changes.
rust/src/core/schema_loader.rs (4)
66-78: load_schema panics on cache miss — consider returning Result instead.

Panicking at runtime for a missing schema is harsh, especially since the caller (Validator) may want to handle a missing schema gracefully. Returning Result<Value> (or Option<Value>) would be more idiomatic Rust and give callers the choice of how to handle the error.

That said, if this is an intentional "fail-fast" design decision (schemas must always be pre-loaded), this is acceptable — but it should be documented as a contract at the call sites too.

186-189: Minor: avoid an unnecessary clone by reordering the log and insert.

insert consumes the key, so the clone exists only to keep cache_key alive for the eprintln!. Swapping the order eliminates the allocation.
♻️ Proposed fix
-                    let cache_key = format!("{}/{}/{}/{}", entry_domain, entry_version, entry_category, schema_name);
-                    self.schema_cache.insert(cache_key.clone(), schema);
-                    eprintln!("Loaded schema into cache: {}", cache_key);
+                    let cache_key = format!("{}/{}/{}/{}", entry_domain, entry_version, entry_category, schema_name);
+                    eprintln!("Loaded schema into cache: {}", cache_key);
+                    self.schema_cache.insert(cache_key, schema);
160-192: ZIP entry path parsing is brittle — silently skips entries that don't match the expected depth.

Lines 177–190 require at least 5 path components and index from the end. If the ZIP structure changes (e.g., no top-level directory, or additional nesting), entries are silently ignored. Consider logging a warning for skipped .json entries so mismatches are observable.
♻️ Suggested improvement
                 if path_parts.len() >= 5 {
                     // ... existing logic ...
+                } else {
+                    eprintln!("Skipping JSON entry with unexpected path structure: {}", entry_path);
                 }
37-53: Constructor panics on both validation and load failure — document or soften.

new() can panic in two places: empty field validation (Line 39) and .expect() on remote load (Line 50). Since service.rs calls SchemaLoader::new(...) directly, any network failure at startup will crash the process with a bare panic message. Consider returning Result<Self> to let the caller decide on the error strategy (retry, fallback, graceful shutdown).

If panicking is the intended contract, this is fine — just noting it propagates to Service::new as well.
java/src/main/java/net/hydrius/pacts/core/SchemaLoader.java (6)
33-34: HashMap cache is not thread-safe for concurrent reads/writes.

If SchemaLoader is used as a singleton (e.g., a Spring bean), loadSchema can be called from multiple request threads concurrently, each potentially reading and writing to the cache. A plain HashMap under concurrent access can lead to corrupted state or infinite loops (on resize). Consider using ConcurrentHashMap.
♻️ Suggested fix
-import java.util.HashMap;
-import java.util.Map;
+import java.util.Map;
+import java.util.concurrent.ConcurrentHashMap;
-    private final Map<String, JsonNode> cache;
+    private final Map<String, JsonNode> cache;
-        this.cache = new HashMap<>();
+        this.cache = new ConcurrentHashMap<>();
Also applies to: 60-60

79-98: Replace containsKey + get with a single get + null-check.

The containsKey followed by get is two lookups and becomes a TOCTOU issue if the map is replaced with ConcurrentHashMap. Use a single get:
♻️ Suggested fix
     public JsonNode loadSchema(String category, String name) {
         String cacheKey = domain + "/" + version + "/" + category + "/" + name;
 
-        if (cache.containsKey(cacheKey)) {
-            return cache.get(cacheKey);
+        JsonNode cached = cache.get(cacheKey);
+        if (cached != null) {
+            return cached;
         }
112-141: Unnecessary intermediate ObjectNode wrapping — use sourcesNode directly.

The code parses sourcesNode from YAML (line 124), wraps it into a new ObjectNode called settings (lines 128–129), then immediately unwraps it on line 139 with settings.get("sources"). This round-trip adds complexity without benefit.
♻️ Suggested simplification
     private void loadRemoteSchemas() throws IOException {
-        JsonNode settings = null;
+        JsonNode sourcesArray = null;
         try (InputStream stream = getClass().getClassLoader().getResourceAsStream("application.yml")) {
             if (stream == null) {
                 logger.warning("Application Settings not found");
             } else {
                 ObjectMapper yamlMapper = new ObjectMapper(new YAMLFactory());
                 JsonNode parsed = yamlMapper.readTree(stream);
 
                 if (parsed == null || !parsed.has("sources")) {
                     logger.warning("application.yml is missing 'sources' key");
                 } else {
                     JsonNode sourcesNode = parsed.get("sources");
                     if (!sourcesNode.isArray()) {
                         logger.warning("'sources' key in application.yml is not an array");
                     } else {
-                        settings = objectMapper.createObjectNode();
-                        ((ObjectNode) settings).set("sources", objectMapper.convertValue(sourcesNode, JsonNode.class));
+                        sourcesArray = sourcesNode;
                     }
                 }
             }
         } catch (Exception e) {
             logger.warning(() -> "Failed to parse application settings: " + e.getMessage());
         }
 
         boolean sourcesLoaded = false;
-        if (settings != null) {
-            JsonNode sourcesNode = settings.get("sources");
-            for (JsonNode node : sourcesNode) {
+        if (sourcesArray != null) {
+            for (JsonNode node : sourcesArray) {
156-188: Silently discarding ZIP entries that don't match the expected path structure.

When pathParts.length < 3 (line 178), the entry is silently skipped — no log, no warning. If the remote ZIP layout changes or has unexpected nesting, schemas will silently fail to load, making debugging difficult.
♻️ Add a debug/warning log for skipped entries
                                 String[] pathParts = categoryName.split("/");
                                 if (pathParts.length >= 3) {
                                     // ... existing cache logic ...
+                                } else {
+                                    logger.warning(() -> "Skipping ZIP entry with unexpected path structure: " + entryPath);
                                 }
157-167: Consider adding a cap on total decompressed bytes across all ZIP entries.

The per-entry limit (MAX_ENTRY_SIZE = 10 MB) is good, but a malicious or oversized ZIP could still contain thousands of entries just under the limit. A total decompressed size cap (e.g., 100 MB) would provide more robust protection against zip bombs.

145-145: new URL(String) is deprecated since Java 20.

While the project targets Java 17, it's worth noting for forward compatibility. When upgrading, switch to URI.create(source).toURL().

java/pom.xml

rust/src/core/schema_loader.rs

coderabbitai

Actionable comments posted: 2

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (1)

rust/src/core/schema_loader.rs (1)
232-234: ⚠️ Potential issue | 🟡 Minor

replace("v", "") strips all occurrences of v, not just the prefix.

For a version string like "v12" this works, but replace is overly broad — e.g., a hypothetical "preview1" would become "preiew1". Use strip_prefix for correctness:
🐛 Proposed fix
     pub fn get_parsed_version(&self) -> i32 {
-        self.version.replace("v", "").parse().unwrap_or(1)
+        self.version
+            .strip_prefix('v')
+            .unwrap_or(&self.version)
+            .parse()
+            .unwrap_or(1)
     }

🤖 Fix all issues with AI agents

In `@java/src/main/java/net/hydrius/pacts/core/SchemaLoader.java`:
- Around line 25-29: The class Javadoc for SchemaLoader incorrectly states
remote ZIP loading is "available through a separate method" even though
loadRemoteSchemas() is private and invoked by the constructor; either change the
Javadoc to accurately say remote loading is handled internally by the private
loadRemoteSchemas() method and is auto-invoked by the constructor, or if the
intention is to expose remote loading to callers, change loadRemoteSchemas() to
public (and update its Javadoc) so the comment matches the API; update
references to SchemaLoader and loadRemoteSchemas() accordingly.

In `@rust/src/core/schema_loader.rs`:
- Around line 162-194: The loop currently uses `?` on `entry.read_to_string(&mut
content)?` and `serde_json::from_str(&content)?`, which aborts the whole
function on a single malformed entry; change per-entry error handling in the
`for` loop inside `load_schemas_from_zip_url` (the block that iterates `for i in
0..zip.len()` and uses `zip.by_index(i)`) to catch and log errors instead of
propagating them: replace the `?` calls with match/if-let that on Err logs the
error (including `entry.name()` or the index) and continues to the next entry,
so `self.schema_cache.insert(...)` is only reached for successfully
read-and-parsed JSON files and the loop completes even if some entries fail
(keep the existing cache-key creation using `entry_domain`, `entry_version`,
`entry_category`, `schema_name`).

🧹 Nitpick comments (6)

java/src/main/java/net/hydrius/pacts/core/SchemaLoader.java (4)
40-42: Consider adding a total ZIP size limit alongside the per-entry limit.

MAX_ENTRY_SIZE guards individual entries, but a ZIP with many entries each just under 10 MB could still exhaust heap. A cumulative byte counter across all entries with a reasonable cap (e.g., 50–100 MB) would provide stronger protection against ZIP bombs.

Also applies to: 157-167

112-135: Simplify: use sourcesNode directly instead of wrapping in an intermediate ObjectNode.

The settings ObjectNode (line 128–129) wraps sourcesNode only to unwrap it again at line 139. The objectMapper.convertValue(sourcesNode, JsonNode.class) call is also a no-op (JsonNode → JsonNode). You can simplify by storing sourcesNode directly in a local variable and iterating over it.
♻️ Proposed simplification
-        JsonNode settings = null;
+        JsonNode sourcesArray = null;
         try (InputStream stream = getClass().getClassLoader().getResourceAsStream("application.yml")) {
             if (stream == null) {
                 logger.warning("Application Settings not found");
             } else {
                 ObjectMapper yamlMapper = new ObjectMapper(new YAMLFactory());
                 JsonNode parsed = yamlMapper.readTree(stream);
 
                 if (parsed == null || !parsed.has("sources")) {
                     logger.warning("application.yml is missing 'sources' key");
                 } else {
                     JsonNode sourcesNode = parsed.get("sources");
                     if (!sourcesNode.isArray()) {
                         logger.warning("'sources' key in application.yml is not an array");
                     } else {
-                        settings = objectMapper.createObjectNode();
-                        ((ObjectNode) settings).set("sources", objectMapper.convertValue(sourcesNode, JsonNode.class));
+                        sourcesArray = sourcesNode;
                     }
                 }
             }
         } catch (Exception e) {
             logger.warning(() -> "Failed to parse application settings: " + e.getMessage());
         }
 
         boolean sourcesLoaded = false;
-        if (settings != null) {
-            JsonNode sourcesNode = settings.get("sources");
-            for (JsonNode node : sourcesNode) {
+        if (sourcesArray != null) {
+            for (JsonNode node : sourcesArray) {
Also applies to: 137-141

30-34: HashMap cache is not thread-safe if loadSchema is called from multiple threads.

If SchemaLoader instances are shared across threads (common in Spring-managed beans), concurrent reads/writes on HashMap can cause data races. Consider using ConcurrentHashMap or documenting that the class is not thread-safe.
♻️ Proposed fix
-import java.util.HashMap;
-import java.util.Map;
+import java.util.Map;
+import java.util.concurrent.ConcurrentHashMap;
 ...
-    private final Map<String, JsonNode> cache;
+    private final Map<String, JsonNode> cache;
 ...
-        this.cache = new HashMap<>();
+        this.cache = new ConcurrentHashMap<>();
Also applies to: 79-99

53-70: Constructor performs blocking network I/O.

Calling loadRemoteSchemas() from the constructor makes object creation dependent on network availability and latency (up to 15s connect + 30s read per source). In a Spring context, this blocks bean initialization. Consider deferring remote loading to an explicit init() method or using @PostConstruct / lazy initialization so consumers can control when the network call happens.
rust/src/core/schema_loader.rs (2)
154-155: No upper bound on response body size — a malicious or misconfigured source could exhaust memory.

read_to_end will consume the entire HTTP response without limit. Consider capping the read (e.g., via take()) or checking Content-Length before reading:
🛡️ Example: cap download size
+        const MAX_ZIP_SIZE: u64 = 50 * 1024 * 1024; // 50 MB
+
         let mut bytes = Vec::new();
-        response.body_mut().as_reader().read_to_end(&mut bytes)?;
+        response.body_mut().as_reader().take(MAX_ZIP_SIZE).read_to_end(&mut bytes)?;
+
+        if bytes.len() as u64 >= MAX_ZIP_SIZE {
+            return Err(anyhow::anyhow!("ZIP from {} exceeds maximum allowed size", url));
+        }
36-52: Constructor performs network I/O and panics on failure — this is difficult to test.

new() eagerly fetches remote ZIPs and panics (via .expect()) if loading fails. This makes unit testing and offline usage impossible. Consider a two-phase initialization (e.g., a fallible SchemaLoader::load() → Result<Self>) or accepting pre-loaded schemas for testing. This is a design trade-off and acceptable if the current approach meets your needs, but worth keeping in mind.

coderabbitai · 2026-02-11T07:30:01Z

java/src/main/java/net/hydrius/pacts/core/SchemaLoader.java

 * SchemaLoader class that loads schemas that are bundled with Pacts.
+ * 
+ * This class supports loading schemas from the file system or embedded resources.
+ * Remote ZIP loading functionality is available through a separate method.
 */


⚠️ Potential issue | 🟡 Minor

Javadoc says "available through a separate method" but loadRemoteSchemas() is private.

The class-level doc (line 28) states remote loading is "available through a separate method," implying a public API, but the method is private and auto-invoked by the constructor. Consider updating the doc to reflect the actual behavior.

🤖 Prompt for AI Agents

In `@java/src/main/java/net/hydrius/pacts/core/SchemaLoader.java` around lines 25 - 29, The class Javadoc for SchemaLoader incorrectly states remote ZIP loading is "available through a separate method" even though loadRemoteSchemas() is private and invoked by the constructor; either change the Javadoc to accurately say remote loading is handled internally by the private loadRemoteSchemas() method and is auto-invoked by the constructor, or if the intention is to expose remote loading to callers, change loadRemoteSchemas() to public (and update its Javadoc) so the comment matches the API; update references to SchemaLoader and loadRemoteSchemas() accordingly.

coderabbitai · 2026-02-11T07:30:01Z

rust/src/core/schema_loader.rs

+        for i in 0..zip.len() {
+            let mut entry = zip.by_index(i)?;
+
+            // Process only JSON files
+            if !entry.is_dir() && entry.name().ends_with(".json") {
+                // Read the entry content
+                let mut content = String::new();
+                entry.read_to_string(&mut content)?;
+
+                // Parse the JSON schema
+                let schema: Value = serde_json::from_str(&content)?;
+
+                // Extract path information to create cache key
+                let entry_name = entry.name().to_string();
+                let path_parts: Vec<&str> = entry_name.split('/').collect();
+
+                // We expect paths like: Schemas-main/bees/v1/inventory/item.json
+                if path_parts.len() >= 5 {
+                    let entry_domain = path_parts[path_parts.len() - 4];
+                    let entry_version = path_parts[path_parts.len() - 3];
+                    let entry_category = path_parts[path_parts.len() - 2];
+                    let file_name = path_parts[path_parts.len() - 1];
+
+                    let schema_name = file_name.trim_end_matches(".json");
+
+                    let cache_key = format!("{}/{}/{}/{}", entry_domain, entry_version, entry_category, schema_name);
+                    self.schema_cache.insert(cache_key.clone(), schema);
+                    eprintln!("Loaded schema into cache: {}", cache_key);
+                } else {
+                    eprintln!("Skipping JSON file with unexpected path structure (< 5 components): {}", entry_name);
+                }
+            }
+        }


⚠️ Potential issue | 🟠 Major

A single malformed JSON entry aborts processing of the remaining ZIP entries and leaves the cache partially populated.

The ? on lines 169 and 172 propagates errors out of the for loop, so if entry N of M fails parsing, entries already inserted into schema_cache remain but entries N+1..M are never processed. load_schemas_from_zip_url then returns Err, and load_remote_schemas falls through to the next source — but the cache is already partially filled from the failed source. This creates inconsistent state.

Consider catching per-entry errors and logging them rather than aborting the entire ZIP:

♻️ Suggested change: skip malformed entries

for i in 0..zip.len() { let mut entry = zip.by_index(i)?; // Process only JSON files if !entry.is_dir() && entry.name().ends_with(".json") { // Read the entry content let mut content = String::new(); - entry.read_to_string(&mut content)?; - - // Parse the JSON schema - let schema: Value = serde_json::from_str(&content)?; + if let Err(e) = entry.read_to_string(&mut content) { + eprintln!("Failed to read ZIP entry {}: {}", entry.name(), e); + continue; + } + + // Parse the JSON schema + let schema: Value = match serde_json::from_str(&content) { + Ok(v) => v, + Err(e) => { + eprintln!("Failed to parse JSON in {}: {}", entry.name(), e); + continue; + } + };

🤖 Prompt for AI Agents

In `@rust/src/core/schema_loader.rs` around lines 162 - 194, The loop currently uses `?` on `entry.read_to_string(&mut content)?` and `serde_json::from_str(&content)?`, which aborts the whole function on a single malformed entry; change per-entry error handling in the `for` loop inside `load_schemas_from_zip_url` (the block that iterates `for i in 0..zip.len()` and uses `zip.by_index(i)`) to catch and log errors instead of propagating them: replace the `?` calls with match/if-let that on Err logs the error (including `entry.name()` or the index) and continues to the next entry, so `self.schema_cache.insert(...)` is only reached for successfully read-and-parsed JSON files and the loop completes even if some entries fail (keep the existing cache-key creation using `entry_domain`, `entry_version`, `entry_category`, `schema_name`).

flrping added 7 commits January 29, 2026 22:47

refactor: remove all schemas

a12922a

ci: update workflows

8e36906

refactor: start to change java schema loader to pull from sources.

5de5df9

refactor: constructor complexity

6067b6a

refactor: rust implementation now pulls from sources

8645c07

fix: close scanner

d6a6925

deps: version bump

f0f3a66

flrping self-assigned this Feb 10, 2026

coderabbitai bot reviewed Feb 10, 2026

View reviewed changes

cubic-dev-ai bot reviewed Feb 10, 2026

View reviewed changes

flrping added 3 commits February 11, 2026 01:45

ci: fixed workflows

9027e0b

refactor: better loading for java impl

b90e4f1

feat: better loading for rust impl

ff2d110

coderabbitai bot reviewed Feb 11, 2026

View reviewed changes

java/pom.xml Show resolved Hide resolved

rust/src/core/schema_loader.rs Outdated Show resolved Hide resolved

refactor: apply suggestions, cleanup

b105e30

coderabbitai bot reviewed Feb 11, 2026

View reviewed changes

		sources:
		- "https://github.com/Project-Hydrius/Schemas/archive/refs/heads/main.zip"

Feat: Implementations now pull schemas #7

Are you sure you want to change the base?

Feat: Implementations now pull schemas #7

Conversation

flrping commented Feb 10, 2026 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by cubic

Summary by CodeRabbit

Uh oh!

coderabbitai bot commented Feb 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Sequence Diagram

Estimated code review effort

Possibly related PRs

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

coderabbitai bot Feb 10, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

cubic-dev-ai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

cubic-dev-ai bot Feb 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

cubic-dev-ai bot Feb 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

cubic-dev-ai bot Feb 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Feb 11, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Feb 11, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

flrping commented Feb 10, 2026 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Feb 10, 2026 •

edited

Loading

cubic-dev-ai bot Feb 10, 2026 •

edited

Loading

cubic-dev-ai bot Feb 10, 2026 •

edited

Loading

cubic-dev-ai bot Feb 10, 2026 •

edited

Loading