t1328+t1329: Matterbridge agent cleanup + cross-review simplification + framework consolidation by marcusquinn · Pull Request #2396 · marcusquinn/aidevops

marcusquinn · 2026-02-26T19:50:03Z

Summary

t1328: Simplify matterbridge agent — remove overly-detailed SimpleX bridge sections from matterbridge.md and matterbridge-helper.sh (the simplex-bridge subcommand and ~240 lines of Docker Compose orchestration). SimpleX bridging is now a concise reference pointing to the upstream adapter.
t1329: Simplify /cross-review slash command — remove --task-type option, streamline scoring criteria table, clean up documentation.
Framework consolidation: Large-scale cleanup removing premature/unused code and reorganizing scripts.

Changes by Category

t1328 — Matterbridge Agent (core deliverable)

Simplified matterbridge.md: removed verbose SimpleX bridge architecture diagrams, Docker Compose setup, chat ID instructions, privacy gradient section. Replaced with concise adapter reference.
Simplified matterbridge-helper.sh: removed simplex-bridge subcommand and all associated functions (~240 lines).
Removed config templates: matterbridge-simplex-compose.yml, matterbridge-simplex.toml.example.

t1329 — Cross-Review Pipeline

Simplified cross-review.md: removed --task-type option, streamlined scoring criteria, removed verbose output section.
Updated compare-models-helper.sh accordingly.

Script Reorganization

Moved supervisor scripts from supervisor-archived/ to supervisor/ (active use).
Unarchived utility scripts from archived/ to root scripts/ dir.
Deleted unused/premature code: circuit-breaker-helper.sh, prompt-guard-helper.sh, opus-review-helper.sh, session-miner-pulse.sh, session-miner/, simplex-bot/ (entire TypeScript project), simplex-helper.sh, local-model-helper.sh.
Deleted premature agents: opsec.md, simplex.md, local-models.md, huggingface.md, ui-verification.md.
Deleted unused configs: model-routing-table.json, rate-limits.json.txt.
Deleted tests for removed code: test-circuit-breaker.sh, test-matterbridge-helper.sh, test-simplex-integration.sh.

TODO.md State Reset

Reverted several tasks from [x] completed back to [ ] open (t1305, t1306, t1314, t251, etc.).
Removed completed subtask entries for t1327.x and t1338.x series.
Removed t1335-t1338 task entries and associated brief files.

Framework Changes

build.txt: Removed structural thinking, scientific reasoning, and error prevention sections.
AGENTS.md, onboarding.md, self-improving-agents.md, full-loop.md, runners.md: Various updates.
issue-sync-helper.sh, budget-tracker-helper.sh, fallback-chain-helper.sh, observability-helper.sh, full-loop-helper.sh, mail-helper.sh: Significant rewrites/expansions.
Added test-dual-hosted-sync.sh.
VERSION: 2.133.12 → 2.125.1.

Review Notes

Broken forward references: matterbridge.md references tools/security/opsec.md and services/communications/simplex.md in its Related section — both deleted in this branch. These are forward references to files that will be recreated by future tasks (t1327.6, t1327.2).
VERSION downgrade: 2.133.12 → 2.125.1 — intentional rollback or needs correction?
Net deletion: -15,844 lines across 177 files. Significant codebase reduction.

Stats

177 files changed, 9,465 insertions, 25,309 deletions

Closes #2254

Summary by CodeRabbit

New Features
- Optional judge-based scoring for cross-review via new --score and --judge options.
Bug Fixes
- Improved robustness: better error handling, parsing, logging, timeout handling, and clearer per-model failure messages.
Documentation
- Simplified cross-review docs and examples; condensed Matterbridge SimpleX guidance and clarified macOS install notes.
Removals
- Removed legacy task-type usage and the SimpleX/Docker bridge command and trimmed redundant sections.

coderabbitai · 2026-02-26T19:50:26Z

Note

Reviews paused

It looks like this branch is under active development. To avoid overwhelming you with review comments due to an influx of new commits, CodeRabbit has automatically paused this review. You can configure this behavior by changing the reviews.auto_review.auto_pause_after_reviewed_commits setting.

Use the following commands to manage reviews:

@coderabbitai resume to resume automatic reviews.
@coderabbitai review to trigger a single review.

Use the checkboxes below for quick actions:

▶️ Resume reviews
🔍 Trigger review

Walkthrough

This PR removes SimpleX/simplex-bridge from the Matterbridge helper and docs, consolidates judge-based scoring into a new _cross_review_judge_score flow with --score/--judge options in .agents/scripts/compare-models-helper.sh, and simplifies cross-review docs and Matterbridge documentation and error handling.

Changes

Cohort / File(s)	Summary
Cross-review docs `.agents/scripts/commands/cross-review.md`	Streamlined descriptions and examples: removed `--task-type` and `--prompt` option rows, simplified scoring criteria table, removed Output section, and reorganized text to reduce redundancy.
Judge scoring & cross-review CLI `.agents/scripts/compare-models-helper.sh`	Added `_cross_review_judge_score()` to build judge prompt, collect per-model outputs, invoke judge runner, parse JSON and normalize scores; added `--score`/`--judge` flags to `cmd_cross_review()` and removed `_resolve_cross_review_auth()` and `_judge_cross_review()`; improved model-id sanitization, per-model error logging, JSON validation, diff handling, and help text.
Matterbridge helper script `.agents/scripts/matterbridge-helper.sh`	Removed `simplex-bridge` command and related public vars/functions; hardened version/OS detection, validation, stop/restart logic, logs parsing, and update flow; cleaned help/dispatch to omit simplex-bridge.
Matterbridge documentation `.agents/services/communications/matterbridge.md`	Removed detailed SimpleX adapter walkthroughs and diagrams; replaced with concise adapter note and streamlined install/run examples; condensed privacy/security narrative and cross-referenced opsec agent; cleaned section layout.

Sequence Diagram(s)

sequenceDiagram
    autonumber
    rect rgba(46, 117, 182, 0.5)
    participant User
    end
    rect rgba(76, 175, 80, 0.5)
    participant CLI
    end
    rect rgba(255, 193, 7, 0.5)
    participant Dispatcher
    end
    rect rgba(156, 39, 176, 0.5)
    participant Models
    end
    rect rgba(233, 30, 99, 0.5)
    participant Judge
    end
    rect rgba(33, 150, 243, 0.5)
    participant FS
    end

    User->>CLI: run cmd_cross_review [--score --judge ...]
    CLI->>Dispatcher: dispatch prompts to models
    Dispatcher->>Models: send parallel prompts
    Models-->>Dispatcher: per-model outputs & logs
    Dispatcher->>FS: write per-model outputs
    alt scoring requested
      Dispatcher->>Judge: build judge prompt + model outputs
      Judge-->>Dispatcher: JSON verdict (scores, winner, reasoning)
      Dispatcher->>FS: write judge scores & mapped results
      Dispatcher->>User: display scores and diffs
    else no scoring
      Dispatcher->>User: display diffs and per-model logs
    end

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~50 minutes

Possibly related PRs

feat: Claude-Flow inspired features - model routing, semantic memory, pattern tracking (t102) #341 — Integrates pattern-tracker-helper and memory pattern types referenced by the new judge scoring/storage wiring in compare-models-helper.sh.
t1328: Matterbridge agent — subagent doc and helper script for multi-platform chat bridging #2387 — Makes overlapping edits to matterbridge helper and docs; likely reverts/removes earlier SimpleX bridge code touched here.
feat: add /compare-models and /compare-models-free slash commands (t168) #660 — Prior compare-models-helper.sh changes touching judge scoring and cross-review logic consolidated by this PR.

Suggested labels

needs-review

Poem

🚥 Pre-merge checks | ✅ 3 | ❌ 2

❌ Failed checks (2 warnings)

Check name	Status	Explanation	Resolution
Out of Scope Changes check	⚠️ Warning	The PR includes substantial out-of-scope changes beyond t1328/t1329: script reorganization (supervisor-archived→supervisor, archived utilities unarchived), multiple helper script rewrites, TODO.md reverts, VERSION changes, and ~15,844 net line deletions across 177 files unrelated to stated objectives.	Isolate t1328+t1329 changes into focused commits; defer script reorganization, helper script expansions, and VERSION changes to separate PRs with corresponding issue tracking and acceptance criteria.
Docstring Coverage	⚠️ Warning	Docstring coverage is 35.29% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (3 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title accurately summarizes the main changes: Matterbridge cleanup (t1328), cross-review simplification (t1329), and framework consolidation are all present and prominent in the changeset.
Linked Issues check	✅ Passed	The PR successfully delivers all key coding objectives from issue `#2254`: matterbridge.md created with setup/config/deployment coverage, matterbridge-helper.sh implemented with required subcommands, security cross-references to opsec agent included, and AGENTS.md/subagent-index updates completed.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment
Commit unit tests in branch feature/t1328-t1329-matterbridge-cross-review

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

gemini-code-assist · 2026-02-26T19:50:27Z

Summary of Changes

Hello @marcusquinn, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request focuses on a significant refactoring and cleanup effort across the codebase, aiming to simplify existing agents, consolidate framework components, and reduce technical debt. Key changes include streamlining the Matterbridge agent by removing outdated SimpleX bridging details and enhancing the /cross-review command with new scoring capabilities. The PR also involves extensive deletion of unused code, agents, and configurations, alongside reorganization of scripts, to improve maintainability and clarity.

Highlights

Matterbridge Agent Refinement: Simplified the Matterbridge agent by removing verbose SimpleX bridge sections, Docker Compose orchestration, and associated configuration templates, replacing them with a concise reference to the upstream adapter.
Cross-Review Command Streamlining: Simplified the /cross-review slash command by removing the --task-type option, streamlining scoring criteria documentation, and updating the underlying helper script to support new scoring functionality.
Extensive Framework Consolidation: Performed a large-scale cleanup, reorganizing supervisor and utility scripts, and deleting numerous unused or premature code files, agents, configurations, and tests, resulting in a significant reduction in codebase size.
TODO List Synchronization: Reset the state of several tasks in TODO.md from completed to open and removed entries for completed subtasks, reflecting current project status.
Core Framework Updates: Updated various documentation files and significantly rewrote or expanded several helper scripts, including issue-sync-helper.sh and full-loop-helper.sh, and added a new test script.

Changelog

.agents/AGENTS.md
- Added "Communications" category entry with Matterbridge, SimpleX, and Matrix bot references.
.agents/scripts/commands/cross-review.md
- Added new documentation for the /cross-review command, detailing its purpose, instructions, options (including --score and --judge), model tiers, scoring criteria, and examples.
.agents/scripts/compare-models-helper.sh
- Updated cmd_cross_review function to support new --score and --judge options for automated model response scoring.
- Implemented _cross_review_judge_score function to handle the judge scoring pipeline, including prompt construction, dispatching to a judge model, parsing JSON output, and recording scores.
- Expanded help documentation for cmd_cross_review to include examples for scoring.
.agents/scripts/matterbridge-helper.sh
- Added a new shell script to manage Matterbridge, providing commands for setup, validation, starting/stopping (daemon or foreground), checking status, viewing logs, and updating the binary.
.agents/services/communications/matterbridge.md
- Added comprehensive documentation for the Matterbridge multi-platform chat bridge, covering quick reference, natively supported platforms, 3rd party adapters (including SimpleX), installation methods, configuration details, running instructions (CLI, Docker, Systemd), REST API usage, troubleshooting, and security considerations.
.agents/subagent-index.toon
- Updated the description for the services/communications/ entry to include "multi-platform chat bridging" and added matterbridge and simplex as keywords.

Activity

The pull request introduces broken forward references in matterbridge.md to tools/security/opsec.md and services/communications/simplex.md, which are noted as files to be recreated by future tasks.
The project version was intentionally downgraded from 2.133.12 to 2.125.1.
A significant codebase reduction occurred, with a net deletion of 15,844 lines across 177 files.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

coderabbitai

Actionable comments posted: 3

🧹 Nitpick comments (2)

.agents/scripts/compare-models-helper.sh (2)

1036-1052: Multiple Python invocations for JSON parsing is inefficient.

Each score field extraction spawns a separate Python process. For a one-time judge evaluation this is acceptable, but it adds ~6 subprocess calls per model.

♻️ Consider single jq call or Python batch extraction

+	# Extract all scores for a model in one Python call
 	for model_tier in "${models_with_output[@]}"; do
-		local corr comp qual clar adhr
-		corr=$(echo "$judge_json" | python3 -c "import sys,json; d=json.load(sys.stdin); print(d.get('scores',{}).get('${model_tier}',{}).get('correctness',0))" 2>/dev/null || echo "0")
-		comp=$(echo "$judge_json" | python3 -c "import sys,json; d=json.load(sys.stdin); print(d.get('scores',{}).get('${model_tier}',{}).get('completeness',0))" 2>/dev/null || echo "0")
-		qual=$(echo "$judge_json" | python3 -c "import sys,json; d=json.load(sys.stdin); print(d.get('scores',{}).get('${model_tier}',{}).get('quality',0))" 2>/dev/null || echo "0")
-		clar=$(echo "$judge_json" | python3 -c "import sys,json; d=json.load(sys.stdin); print(d.get('scores',{}).get('${model_tier}',{}).get('clarity',0))" 2>/dev/null || echo "0")
-		adhr=$(echo "$judge_json" | python3 -c "import sys,json; d=json.load(sys.stdin); print(d.get('scores',{}).get('${model_tier}',{}).get('adherence',0))" 2>/dev/null || echo "0")
+		local scores_line
+		scores_line=$(echo "$judge_json" | python3 -c "
+import sys, json
+d = json.load(sys.stdin)
+s = d.get('scores', {}).get('${model_tier}', {})
+print(s.get('correctness', 0), s.get('completeness', 0), s.get('quality', 0), s.get('clarity', 0), s.get('adherence', 0))
+" 2>/dev/null || echo "0 0 0 0 0")
+		read -r corr comp qual clar adhr <<< "$scores_line"

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In @.agents/scripts/compare-models-helper.sh around lines 1036 - 1052, Replace
the six separate Python subprocess calls inside the for-loop that read from
judge_json (the corr, comp, qual, clar, adhr assignments) with a single JSON
extraction per model to reduce subprocess overhead; for example, within the loop
over models_with_output use one jq invocation or a single python -c that reads
judge_json once and prints all five scores for the current model_tier, then
parse that single output into corr, comp, qual, clar, adhr before appending to
score_args (keep references to judge_json, model_tier, and score_args intact).

997-1011: JSON extraction with fallback is defensive but could be simplified.

The dual approach (grep for single-line, Python for multiline) handles various judge output formats. However, the inline Python script has the filename hardcoded via string interpolation which could break with special characters in paths.

♻️ Suggested improvement for path handling

 	if [[ -z "$judge_json" ]]; then
 		# Try multiline JSON extraction
-		judge_json=$(python3 -c "
-import sys, json, re
-text = open('${judge_output_file}').read()
-m = re.search(r'\{.*\}', text, re.DOTALL)
-if m:
-    try:
-        obj = json.loads(m.group())
-        print(json.dumps(obj))
-    except Exception:
-        pass
-" 2>/dev/null || true)
+		judge_json=$(python3 -c "
+import sys, json, re
+text = sys.stdin.read()
+m = re.search(r'\{.*\}', text, re.DOTALL)
+if m:
+    try:
+        obj = json.loads(m.group())
+        print(json.dumps(obj))
+    except Exception:
+        pass
+" < "$judge_output_file" 2>/dev/null || true)
 	fi

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In @.agents/scripts/compare-models-helper.sh around lines 997 - 1011, The inline
Python fallback embeds the filename via shell interpolation into the python3 -c
string, which can break for paths with quotes or special chars; instead update
the fallback to invoke python3 with the filename as an argument (use sys.argv in
the Python snippet) or feed the file via stdin (here refer to the variable
judge_output_file and the python3 -c call currently producing judge_json), so
the Python code reads the file path from argv[1] (or stdin) and you call python3
with "$judge_output_file" as an argument to safely handle arbitrary paths.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In @.agents/scripts/matterbridge-helper.sh:
- Around line 248-270: The cmd_logs function's case block doesn't advance
positional parameters so the --tail value isn't captured; update argument
parsing in cmd_logs (where follow, tail_lines and LOG_FILE are used) to properly
consume args — either convert the current case into a while loop that checks
"$1" and shifts, or on matching --tail perform tail_lines="${2:-50}" and then
shift 2 (and shift 1 when matching --follow/-f), ensuring subsequent logic reads
the correct value; keep the rest of the function (file existence check and tail
invocation) unchanged.
- Around line 152-157: Replace the incorrect version-check invocation with an
actual invocation of matterbridge using the target config so the process parses
the file: run timeout 5 "$BINARY_PATH" -conf "$config_path" (not -version),
capture stdout/stderr and check its exit status; if the timeout/command exits
non-zero log a failure via the existing log function including the captured
error output, otherwise log "Binary OK"; use the existing BINARY_PATH,
config_path, timeout and log symbols to locate and update the check.
- Around line 50-54: The macOS branch in detect_os_arch currently hardcodes
darwin-amd64; update the case handling so when os is darwin it uses the arch
variable to produce darwin-arm64 for arm64 Apple Silicon and darwin-amd64 for
x86_64 (e.g. use the existing $arch value in the darwin branch instead of a
fixed string). Modify the case "$os" in ... esac block (referencing
detect_os_arch, $os and $arch) to emit darwin-${arch} for darwin hosts (or
explicitly map arm64->arm64 and amd64->amd64) while keeping the linux and
default branches unchanged.

---

Nitpick comments:
In @.agents/scripts/compare-models-helper.sh:
- Around line 1036-1052: Replace the six separate Python subprocess calls inside
the for-loop that read from judge_json (the corr, comp, qual, clar, adhr
assignments) with a single JSON extraction per model to reduce subprocess
overhead; for example, within the loop over models_with_output use one jq
invocation or a single python -c that reads judge_json once and prints all five
scores for the current model_tier, then parse that single output into corr,
comp, qual, clar, adhr before appending to score_args (keep references to
judge_json, model_tier, and score_args intact).
- Around line 997-1011: The inline Python fallback embeds the filename via shell
interpolation into the python3 -c string, which can break for paths with quotes
or special chars; instead update the fallback to invoke python3 with the
filename as an argument (use sys.argv in the Python snippet) or feed the file
via stdin (here refer to the variable judge_output_file and the python3 -c call
currently producing judge_json), so the Python code reads the file path from
argv[1] (or stdin) and you call python3 with "$judge_output_file" as an argument
to safely handle arbitrary paths.

ℹ️ Review info

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between fcc9544 and 0d151bf.

📒 Files selected for processing (6)

.agents/AGENTS.md
.agents/scripts/commands/cross-review.md
.agents/scripts/compare-models-helper.sh
.agents/scripts/matterbridge-helper.sh
.agents/services/communications/matterbridge.md
.agents/subagent-index.toon

.agents/scripts/matterbridge-helper.sh

gemini-code-assist

Code Review

This pull request focuses on simplifying the Matterbridge agent and cross-review slash command, along with framework consolidation by removing unused code and reorganizing scripts. The changes include simplifying documentation, removing the simplex-bridge subcommand from matterbridge-helper.sh, streamlining scoring criteria for the /cross-review command, and reorganizing/removing various scripts and configurations. Additionally, a new matterbridge-helper.sh script and matterbridge.md documentation file have been added to manage and document the Matterbridge multi-platform chat bridge. The review focuses on identifying potential issues and improvement opportunities in the added and modified code, particularly in the new matterbridge-helper.sh script and the updated compare-models-helper.sh script.

gemini-code-assist · 2026-02-26T19:54:59Z

.agents/scripts/compare-models-helper.sh


+	# Judge scoring pipeline (t1329)
+	# When --score is set, dispatch all outputs to a judge model for structured scoring.
+	if [[ "$score_flag" == "true" ]]; then
+		_cross_review_judge_score \
+			"$prompt" "$models_str" "$output_dir" "$judge_model" "${model_names[@]}"
+	fi


The _cross_review_judge_score function is called only when $score_flag is true. It would be clearer to move the function definition before its usage to improve readability.

gemini-code-assist · 2026-02-26T19:54:59Z

.agents/scripts/compare-models-helper.sh

+	"$runner_helper" create "$judge_runner" \
+		--model "$judge_model" \
+		--description "Cross-review judge" \
+		--workdir "$(pwd)" 2>/dev/null || true


The use of 2>/dev/null || true after the create command suppresses any errors that might occur during runner creation. This could mask issues such as insufficient permissions or invalid model configurations. It's better to allow errors to be displayed for debugging purposes.

References

Avoid using 2>/dev/null for blanket suppression of command errors in shell scripts to ensure that authentication, syntax, or system issues remain visible for debugging.

gemini-code-assist · 2026-02-26T19:54:59Z

.agents/scripts/compare-models-helper.sh

+	"$runner_helper" run "$judge_runner" "$judge_prompt" \
+		--model "$judge_model" \
+		--timeout "120" \
+		--format text 2>/dev/null >"$judge_output_file" || true


Similar to the create command, suppressing errors with 2>/dev/null >"$judge_output_file" || true after the run command can hide important information about why the judge model failed to execute or produce output. Consider removing the suppression to allow errors to be displayed.

References

Avoid using 2>/dev/null for blanket suppression of command errors in shell scripts to ensure that authentication, syntax, or system issues remain visible for debugging.

gemini-code-assist · 2026-02-26T19:54:59Z

.agents/scripts/compare-models-helper.sh

+		--timeout "120" \
+		--format text 2>/dev/null >"$judge_output_file" || true
+
+	"$runner_helper" destroy "$judge_runner" --force 2>/dev/null || true


The destroy command also has error suppression. If the runner fails to destroy, it might indicate a resource leak or other issue that should be investigated. Removing 2>/dev/null || true would allow these errors to surface.

References

Avoid using 2>/dev/null for blanket suppression of command errors in shell scripts to ensure that authentication, syntax, or system issues remain visible for debugging.

gemini-code-assist · 2026-02-26T19:54:59Z

.agents/scripts/compare-models-helper.sh

+
+	# Extract JSON from judge output (strip any surrounding text)
+	local judge_json
+	judge_json=$(grep -o '{.*}' "$judge_output_file" 2>/dev/null | head -1 || true)


The grep command uses 2>/dev/null to suppress errors. If grep fails to find the JSON, it might be due to an issue with the output format or file content. Removing the suppression would help diagnose these cases.

References

Avoid using 2>/dev/null for blanket suppression of command errors in shell scripts to ensure that authentication, syntax, or system issues remain visible for debugging.

gemini-code-assist · 2026-02-26T19:54:59Z

.agents/scripts/matterbridge-helper.sh

+	# Basic TOML syntax check via matterbridge dry-run
+	# matterbridge exits non-zero if config is invalid
+	log "Validating config: $config_path"
+	if timeout 5 "$BINARY_PATH" -conf "$config_path" -version >/dev/null 2>&1; then


The cmd_validate function uses >/dev/null 2>&1 to suppress all output from the matterbridge -version command. This prevents the user from seeing any error messages or version information. It's better to redirect only stdout (>/dev/null) to allow error messages to be displayed.

References

In shell scripts, avoid blanket suppression of errors with '2>/dev/null'. For 'declare -f', redirecting only stdout ('>/dev/null') is sufficient as it does not output to stderr, allowing potential syntax errors to remain visible.

gemini-code-assist · 2026-02-26T19:54:59Z

.agents/scripts/matterbridge-helper.sh

+
+	if [ "$daemon_mode" = true ]; then
+		log "Starting in daemon mode..."
+		nohup "$BINARY_PATH" -conf "$CONFIG_PATH" >>"$LOG_FILE" 2>&1 &


The cmd_start function uses 2>&1 to redirect stderr to stdout when starting matterbridge in daemon mode. This can make it difficult to diagnose startup issues. Consider redirecting stderr to a separate log file or allowing it to be displayed on the console.

References

Avoid using 2>/dev/null for blanket suppression of command errors in shell scripts to ensure that authentication, syntax, or system issues remain visible for debugging.

gemini-code-assist · 2026-02-26T19:54:59Z

.agents/scripts/matterbridge-helper.sh

+	local pid
+	pid="$(cat "$PID_FILE")"
+	log "Stopping (PID: $pid)..."
+	kill "$pid" 2>/dev/null || true


The cmd_stop function uses 2>/dev/null to suppress errors from kill. While this prevents error output when the process doesn't exist, it also hides other potential issues (e.g., permission problems). Consider removing the suppression and handling specific error codes if needed.

References

Avoid using 2>/dev/null for blanket suppression of command errors in shell scripts to ensure that authentication, syntax, or system issues remain visible for debugging.

gemini-code-assist · 2026-02-26T19:54:59Z

.agents/scripts/matterbridge-helper.sh

+
+	if is_running; then
+		log "Force killing..."
+		kill -9 "$pid" 2>/dev/null || true


The cmd_stop function uses 2>/dev/null to suppress errors from kill -9. While this prevents error output when the process doesn't exist, it also hides other potential issues (e.g., permission problems). Consider removing the suppression and handling specific error codes if needed.

References

Avoid using 2>/dev/null for blanket suppression of command errors in shell scripts to ensure that authentication, syntax, or system issues remain visible for debugging.

gemini-code-assist · 2026-02-26T19:54:59Z

.agents/scripts/matterbridge-helper.sh

+
+cmd_update() {
+	local current_version new_version
+	current_version="$("$BINARY_PATH" -version 2>&1 | grep -oE '[0-9]+\.[0-9]+\.[0-9]+' | head -1 || echo "unknown")"


The cmd_update function uses 2>&1 to redirect stderr to stdout when getting the current version. This can prevent error messages from being displayed. Consider redirecting only stdout (>/dev/null) to allow error messages to be displayed.

References

In shell scripts, avoid blanket suppression of errors with '2>/dev/null'. For 'declare -f', redirecting only stdout ('>/dev/null') is sufficient as it does not output to stderr, allowing potential syntax errors to remain visible.

t1328: Add Matterbridge multi-platform chat bridge subagent and helper script - .agents/services/communications/matterbridge.md — install, config, gateways, Docker, REST API, SimpleX adapter, security cross-reference to opsec agent - .agents/scripts/matterbridge-helper.sh — setup/start/stop/status/logs/update subcommands, ShellCheck clean - subagent-index.toon + AGENTS.md domain index updated with communications row t1329: Wire cross-review output into automated judge scoring pipeline - compare-models-helper.sh: add --score/--judge flags to cmd_cross_review() - _cross_review_judge_score(): dispatches all outputs to judge model (default: opus), parses structured JSON scores, records via cmd_score, feeds pattern tracker - .agents/scripts/commands/cross-review.md — /cross-review slash command doc Closes GH#2254, GH#2262

github-actions · 2026-02-26T19:58:39Z

🔍 Code Quality Report

�[0;35m[MONITOR]�[0m Code Review Monitoring Report

�[0;34m[INFO]�[0m Latest Quality Status:
SonarCloud: 0 bugs, 0 vulnerabilities, 90 code smells

�[0;34m[INFO]�[0m Recent monitoring activity:
Thu Feb 26 19:58:35 UTC 2026: Code review monitoring started
Thu Feb 26 19:58:36 UTC 2026: SonarCloud - Bugs: 0, Vulnerabilities: 0, Code Smells: 90

📈 Current Quality Metrics

BUGS: 0
CODE SMELLS: 90
VULNERABILITIES: 0

Generated on: Thu Feb 26 19:58:38 UTC 2026

Generated by AI DevOps Framework Code Review Monitoring

… detection, stdin JSON - matterbridge-helper.sh: convert cmd_logs to while/shift loop for proper --tail value capture - matterbridge-helper.sh: separate binary check (-version) from config validation (-conf), detect TOML parse errors - matterbridge-helper.sh: detect Apple Silicon (arm64) for native darwin binary instead of hardcoding amd64 - compare-models-helper.sh: use stdin (< file) instead of open() for safe path handling in Python JSON extraction - compare-models-helper.sh: batch 5 Python score extractions into single subprocess per model

github-actions · 2026-02-26T20:05:39Z

🔍 Code Quality Report

�[0;35m[MONITOR]�[0m Code Review Monitoring Report

�[0;34m[INFO]�[0m Latest Quality Status:
SonarCloud: 0 bugs, 0 vulnerabilities, 90 code smells

�[0;34m[INFO]�[0m Recent monitoring activity:
Thu Feb 26 20:05:35 UTC 2026: Code review monitoring started
Thu Feb 26 20:05:35 UTC 2026: SonarCloud - Bugs: 0, Vulnerabilities: 0, Code Smells: 90

📈 Current Quality Metrics

BUGS: 0
CODE SMELLS: 90
VULNERABILITIES: 0

Generated on: Thu Feb 26 20:05:37 UTC 2026

Generated by AI DevOps Framework Code Review Monitoring

…ror handling - compare-models-helper.sh: redirect runner stderr to judge-errors.log instead of /dev/null so judge pipeline failures are debuggable - matterbridge-helper.sh: explicit curl failure handling in get_latest_version with warning log instead of silent fallback - matterbridge-helper.sh: allow stderr through on binary version check so errors are visible during validation

github-actions · 2026-02-26T20:24:36Z

🔍 Code Quality Report

�[0;35m[MONITOR]�[0m Code Review Monitoring Report

�[0;34m[INFO]�[0m Latest Quality Status:
SonarCloud: 0 bugs, 0 vulnerabilities, 88 code smells

�[0;34m[INFO]�[0m Recent monitoring activity:
Thu Feb 26 20:24:31 UTC 2026: Code review monitoring started
Thu Feb 26 20:24:32 UTC 2026: SonarCloud - Bugs: 0, Vulnerabilities: 0, Code Smells: 88

📈 Current Quality Metrics

BUGS: 0
CODE SMELLS: 88
VULNERABILITIES: 0

Generated on: Thu Feb 26 20:24:34 UTC 2026

Generated by AI DevOps Framework Code Review Monitoring

coderabbitai

Actionable comments posted: 4

🧹 Nitpick comments (3)

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In @.agents/scripts/compare-models-helper.sh:
- Line 874: The pipeline using diff --unified=3 "$file_a" "$file_b" 2>/dev/null
| head -100 || echo ... fails under set -euo pipefail because diff exiting with
status 1 (files differ) causes the whole pipeline to be treated as failure and
triggers the fallback message; fix by capturing diff output and its exit status
separately: run diff --unified=3 "$file_a" "$file_b" 2>/dev/null into a variable
(e.g. diff_output) while saving the exit code (e.g. diff_status=$?), then if
diff_output is non-empty print it via head -100, else if diff_status equals 1
print the diff_output (or re-run diff) to show differences, otherwise print the
fallback "(files are identical or diff unavailable)". Use the existing diff
command text (diff --unified=3 "$file_a" "$file_b") and PIPESTATUS handling
instead of relying on the simple pipeline with ||.
- Around line 1021-1062: The score persistence uses untrusted judge JSON fields
(task_type, winner, numeric scores) passed into cmd_score via score_args;
restrict task_type to a fixed allowlist (e.g., map/array of permitted types) and
default to "general" if not allowed, ensure winner is one of the models in
models_with_output before adding --winner, and sanitize/clamp numeric values
read from scores_line (corr, comp, qual, clar, adhr) to an integer range 1–10
(or 0–10 if 0 is allowed) before appending them to score_args; perform these
checks where task_type, winner, and scores_line are parsed (referencing
variables task_type, winner, scores_line, corr/comp/qual/clar/adhr, score_args,
and cmd_score) so only validated values are passed to cmd_score.

In @.agents/scripts/matterbridge-helper.sh:
- Around line 176-183: The script currently invokes timeout when running
"$BINARY_PATH" and stores output in parse_output, but doesn't check whether the
timeout command exists or handle its special exit code; update the validation in
the block using parse_output/$BINARY_PATH/$config_path to first probe for a
usable timeout binary (try timeout then gtimeout), fail explicitly with a clear
error if neither is installed, then run the binary through the discovered
timeout command and capture its exit status; if the timeout command exits with
124 treat that as a hung process and fail accordingly, and only when the command
ran but produced stdout/stderr inspect parse_output for "toml|parse|syntax" to
report a config parse error. Ensure error messages distinguish “timeout not
found”, “process timed out (exit 124)”, and real config parse errors.
- Around line 285-288: The --tail handler currently sets tail_lines="${2:-50}"
but always runs shift 2 which fails if no value was provided; modify the --tail
case so it checks whether a following argument exists and is not another option
before consuming it—i.e., if a non-empty non-dash-prefixed $2 is present then
assign tail_lines="$2" and shift 2, otherwise set tail_lines=50 and only shift 1
(or no extra shift if you prefer to leave args intact). Update the --tail block
to perform that conditional test around shift to avoid "shift count out of
range" while preserving current behavior when a value is supplied.

---

Nitpick comments:
In @.agents/scripts/compare-models-helper.sh:
- Around line 919-937: The judge prompt currently concatenates full model
outputs into judge_prompt (built using original_prompt, model_names,
result_file, output_dir) which can exceed shell/LLM token limits; change the
logic to enforce a maximum prompt size (configurable via MAX_JUDGE_CHARS or
MAX_JUDGE_TOKENS), and while iterating model_names/read result_file only append
up to the remaining allowed length (truncate long response_text per model and
append a clear "[TRUNCATED]" marker), stop adding more models when the budget is
exhausted, and ensure models_with_output and final prompt respect the limit
before dispatching to the judge.
- Around line 659-890: The ShellCheck suggestion SC2001 flags use of external
sed/tr constructs for simple string replacement; inside cmd_cross_review replace
those calls that normalize model_tier (currently using echo/tr/sed) with Bash
parameter expansion (e.g., use model_tier=${model_tier// /} to remove spaces or
${var//search/replace} for other replacements) so you avoid unnecessary
subprocesses; update any similar occurrences in this function (references:
variable model_tier and the code block that sets/normalizes it, and places that
parse models_str into model_array) to use the parameter expansion form.

In @.agents/scripts/matterbridge-helper.sh:
- Around line 33-43: The current fallback hardcodes "1.26.0" when parsing fails
(variables: LATEST_RELEASE_URL, curl_output, version) which can produce 404s if
upstream advances; modify the function to use a rotating/secondary fallback:
after the initial curl and parse attempt, query the GitHub Releases API (or list
releases via LATEST_RELEASE_URL without redirect) to pick the most recent tag
that actually has the expected asset, validate the asset URL (HEAD request)
before returning it, and only fall back to the hardcoded "1.26.0" if no
validated release is found; keep existing log calls (log) to record failures and
which fallback was used.

ℹ️ Review info

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 0d151bf and 0582db1.

📒 Files selected for processing (4)

.agents/scripts/commands/cross-review.md
.agents/scripts/compare-models-helper.sh
.agents/scripts/matterbridge-helper.sh
.agents/services/communications/matterbridge.md

🚧 Files skipped from review as they are similar to previous changes (2)

.agents/services/communications/matterbridge.md
.agents/scripts/commands/cross-review.md

.agents/scripts/compare-models-helper.sh

.agents/scripts/matterbridge-helper.sh

…diff pipefail, score sanitization - Fix --tail arg parsing in cmd_logs: check $2 exists and is not a flag before shift 2, preventing 'shift count out of range' error - Fix binary validation in cmd_validate: use -x check instead of -version flag, probe for timeout/gtimeout, handle exit code 124 (timed out) separately from config parse errors - Fix diff pipeline exit code under set -euo pipefail: capture diff output and status separately instead of relying on || fallback - Sanitize judge JSON score fields: restrict task_type to allowlist, validate winner against models_with_output, clamp numeric scores to 0-10 integer range before passing to cmd_score

marcusquinn · 2026-02-26T20:39:16Z

@coderabbitai review

coderabbitai · 2026-02-26T20:39:26Z

✅ Actions performed

Review triggered.

github-actions · 2026-02-26T20:39:41Z

🔍 Code Quality Report

�[0;35m[MONITOR]�[0m Code Review Monitoring Report

�[0;34m[INFO]�[0m Latest Quality Status:
SonarCloud: 0 bugs, 0 vulnerabilities, 88 code smells

�[0;34m[INFO]�[0m Recent monitoring activity:
Thu Feb 26 20:39:37 UTC 2026: Code review monitoring started
Thu Feb 26 20:39:37 UTC 2026: SonarCloud - Bugs: 0, Vulnerabilities: 0, Code Smells: 88

📈 Current Quality Metrics

BUGS: 0
CODE SMELLS: 88
VULNERABILITIES: 0

Generated on: Thu Feb 26 20:39:40 UTC 2026

Generated by AI DevOps Framework Code Review Monitoring

…n (SC2001)

github-actions · 2026-02-26T20:47:55Z

🔍 Code Quality Report

�[0;35m[MONITOR]�[0m Code Review Monitoring Report

�[0;34m[INFO]�[0m Latest Quality Status:
SonarCloud: 0 bugs, 0 vulnerabilities, 88 code smells

�[0;34m[INFO]�[0m Recent monitoring activity:
Thu Feb 26 20:47:51 UTC 2026: Code review monitoring started
Thu Feb 26 20:47:52 UTC 2026: SonarCloud - Bugs: 0, Vulnerabilities: 0, Code Smells: 88

📈 Current Quality Metrics

BUGS: 0
CODE SMELLS: 88
VULNERABILITIES: 0

Generated on: Thu Feb 26 20:47:54 UTC 2026

Generated by AI DevOps Framework Code Review Monitoring

marcusquinn · 2026-02-26T20:55:00Z

@coderabbitai review

coderabbitai · 2026-02-26T20:55:06Z

✅ Actions performed

Review triggered.

- Replace 2>/dev/null with per-model error logs in cross-review dispatch pipeline for debuggability (addresses CodeRabbit error suppression feedback) - Consolidate winner/task_type/reasoning extraction into single Python call - Truncate reasoning to 500 chars and strip control characters - Point failure messages to per-model error log files

github-actions · 2026-02-26T21:04:52Z

🔍 Code Quality Report

�[0;35m[MONITOR]�[0m Code Review Monitoring Report

�[0;34m[INFO]�[0m Latest Quality Status:
SonarCloud: 0 bugs, 0 vulnerabilities, 88 code smells

�[0;34m[INFO]�[0m Recent monitoring activity:
Thu Feb 26 21:04:48 UTC 2026: Code review monitoring started
Thu Feb 26 21:04:49 UTC 2026: SonarCloud - Bugs: 0, Vulnerabilities: 0, Code Smells: 88

📈 Current Quality Metrics

BUGS: 0
CODE SMELLS: 88
VULNERABILITIES: 0

Generated on: Thu Feb 26 21:04:51 UTC 2026

Generated by AI DevOps Framework Code Review Monitoring

…sset name - Escape task_type, evaluator, winner, and model_id in cmd_score SQL INSERTs to prevent SQL injection from untrusted judge JSON fields (CodeRabbit critical) - Fix detect_os_arch to output darwin-64bit (not darwin-amd64) matching actual matterbridge release asset naming convention - Replace echo|tr with bash parameter expansion for model_tier normalization

github-actions · 2026-02-26T21:30:33Z

🔍 Code Quality Report

�[0;35m[MONITOR]�[0m Code Review Monitoring Report

�[0;34m[INFO]�[0m Latest Quality Status:
SonarCloud: 0 bugs, 0 vulnerabilities, 88 code smells

�[0;34m[INFO]�[0m Recent monitoring activity:
Thu Feb 26 21:30:29 UTC 2026: Code review monitoring started
Thu Feb 26 21:30:29 UTC 2026: SonarCloud - Bugs: 0, Vulnerabilities: 0, Code Smells: 88

📈 Current Quality Metrics

BUGS: 0
CODE SMELLS: 88
VULNERABILITIES: 0

Generated on: Thu Feb 26 21:30:32 UTC 2026

Generated by AI DevOps Framework Code Review Monitoring

…bit, not darwin-amd64)

marcusquinn · 2026-02-27T00:47:43Z

Review Feedback Resolution

All actionable review feedback from both CodeRabbit and Gemini Code Assist has been addressed across commits d239c55f..5c7d2b57.

CodeRabbit (Round 1 — `0d151bf`)

Issue	Status	Commit
macOS ARM64 not handled in `detect_os_arch`	Fixed	`02dcb75a`
Config validation runs `-version` not actual parse	Fixed	`d239c55f`
`--tail` arg parsing doesn't capture value	Fixed	`d239c55f`

CodeRabbit (Round 2 — `0582db1`)

Issue	Status	Commit
`diff` pipefail under `set -euo pipefail`	Fixed — captures exit status separately	`d239c55f`
Score persistence uses untrusted judge JSON	Fixed — task_type allowlist, winner validation, `_clamp_score` 0-10	`d239c55f`
timeout command existence/exit code 124	Fixed — `timeout_cmd` detection + gtimeout fallback + 124 handling	`d239c55f`
`--tail` shift safety	Fixed — conditional `${2:-}` check before shift 2	`d239c55f`

CodeRabbit Nitpicks (addressed)

Issue	Status	Commit
Multiple Python invocations for JSON parsing	Fixed — single `scores_line` extraction	`a841d196`
Inline Python with hardcoded filename	Fixed — reads via stdin	`02dcb75a`
Bound judge prompt size	Fixed — `max_chars_per_model=20000` with truncation marker	`a841d196`
SC2001 sed/tr → parameter expansion	Fixed	`a841d196`

Gemini Code Assist

Issue	Status	Commit
Error suppression in runner create/run/destroy	Fixed — errors redirected to per-model log files	`2760c2cc`
Other `2>/dev/null` suggestions	Evaluated — standard daemon/PID management patterns, no change needed	N/A

Additional fix

Issue	Status	Commit
Docs reference `darwin-amd64` but actual asset is `darwin-64bit`	Fixed — corrected asset names, added Apple Silicon example	`5c7d2b57`
SQL injection via unsanitized string values in `cmd_score`	Fixed — all string values escaped with `//\'/\'\'`	`abfe08da`

github-actions · 2026-02-27T00:48:01Z

🔍 Code Quality Report

�[0;35m[MONITOR]�[0m Code Review Monitoring Report

�[0;34m[INFO]�[0m Latest Quality Status:
SonarCloud: 0 bugs, 0 vulnerabilities, 88 code smells

�[0;34m[INFO]�[0m Recent monitoring activity:
Fri Feb 27 00:47:56 UTC 2026: Code review monitoring started
Fri Feb 27 00:47:57 UTC 2026: SonarCloud - Bugs: 0, Vulnerabilities: 0, Code Smells: 88

📈 Current Quality Metrics

BUGS: 0
CODE SMELLS: 88
VULNERABILITIES: 0

Generated on: Fri Feb 27 00:47:59 UTC 2026

Generated by AI DevOps Framework Code Review Monitoring

…of /dev/null Address Gemini code review feedback: - compare-models-helper.sh: redirect Python/grep stderr to judge_err_log instead of /dev/null so parsing failures are diagnosable - matterbridge-helper.sh: capture kill errors with explicit warnings, redirect version-check stderr to log file, include curl exit code in version fetch warnings, rename local 'timeout' var to avoid shadowing the timeout command

github-actions · 2026-02-27T01:08:12Z

🔍 Code Quality Report

�[0;35m[MONITOR]�[0m Code Review Monitoring Report

�[0;34m[INFO]�[0m Latest Quality Status:
SonarCloud: 0 bugs, 0 vulnerabilities, 88 code smells

�[0;34m[INFO]�[0m Recent monitoring activity:
Fri Feb 27 01:08:08 UTC 2026: Code review monitoring started
Fri Feb 27 01:08:08 UTC 2026: SonarCloud - Bugs: 0, Vulnerabilities: 0, Code Smells: 88

📈 Current Quality Metrics

BUGS: 0
CODE SMELLS: 88
VULNERABILITIES: 0

Generated on: Fri Feb 27 01:08:11 UTC 2026

Generated by AI DevOps Framework Code Review Monitoring

…patterns

github-actions · 2026-02-27T01:27:36Z

🔍 Code Quality Report

�[0;35m[MONITOR]�[0m Code Review Monitoring Report

�[0;34m[INFO]�[0m Latest Quality Status:
SonarCloud: 0 bugs, 0 vulnerabilities, 88 code smells

�[0;34m[INFO]�[0m Recent monitoring activity:
Fri Feb 27 01:27:32 UTC 2026: Code review monitoring started
Fri Feb 27 01:27:32 UTC 2026: SonarCloud - Bugs: 0, Vulnerabilities: 0, Code Smells: 88

📈 Current Quality Metrics

BUGS: 0
CODE SMELLS: 88
VULNERABILITIES: 0

Generated on: Fri Feb 27 01:27:35 UTC 2026

Generated by AI DevOps Framework Code Review Monitoring

coderabbitai

Actionable comments posted: 2

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (2)

.agents/scripts/compare-models-helper.sh (2)

775-790: ⚠️ Potential issue | 🟠 Major

Sanitize model identifiers before reusing them as filenames/log keys.

At Line 775, only spaces are stripped. A crafted model value (e.g. containing / or ..) can escape output_dir when building ${output_dir}/${model_tier}*.{txt,json,log}.

🔧 Suggested hardening

 	for model_tier in "${model_array[@]}"; do
 		model_tier="${model_tier// /}"
 		[[ -z "$model_tier" ]] && continue
+		if [[ ! "$model_tier" =~ ^[A-Za-z0-9._-]+$ ]]; then
+			print_warning "Skipping invalid model identifier: $model_tier"
+			continue
+		fi
 
 		local runner_name="cross-review-${model_tier}-$$"

As per coding guidelines .agents/scripts/*.sh: Automation scripts should prioritize reliability and robustness.

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In @.agents/scripts/compare-models-helper.sh around lines 775 - 790, The
model_tier value is used directly in filenames/keys (runner_name, model_err_log)
and only spaces are stripped, allowing path traversal or slashes; before reusing
model_tier build a sanitized safe_model_tier (e.g., strip leading/trailing path
components, reject or replace any chars not in [A-Za-z0-9._-] with an
underscore, and fallback to a fixed token if empty) and use that safe variable
when constructing runner_name, adding to runner_names/model_names, and when
creating model_err_log; also keep resolved_model for display but never use it
for filesystem paths. Ensure the sanitization happens immediately after
resolving/triming model_tier (references: model_tier, safe_model_tier,
runner_name, runner_names, model_names, resolve_model_tier, resolved_model,
model_err_log, output_dir).

790-825: ⚠️ Potential issue | 🟠 Major

Per-model failures are swallowed, so completion status is misleading.

At Line 794/Line 800/Line 805/Line 809, || true forces the background subshell to exit successfully; wait at Line 819 then reports done even when create/run/jq failed.

🔧 Suggested fix with accurate exit codes

 		(
+			local model_failed=0
 			"$runner_helper" create "$runner_name" \
 				--model "$model_tier" \
 				--description "Cross-review: $model_tier" \
-				--workdir "$workdir" 2>>"$model_err_log" || true
+				--workdir "$workdir" 2>>"$model_err_log" || model_failed=1
 
 			local result_file="${output_dir}/${model_tier}.txt"
 			"$runner_helper" run "$runner_name" "$prompt" \
 				--model "$model_tier" \
 				--timeout "$review_timeout" \
-				--format json 2>>"$model_err_log" >"${output_dir}/${model_tier}.json" || true
+				--format json 2>>"$model_err_log" >"${output_dir}/${model_tier}.json" || model_failed=1
 
 			# Extract text response from JSON
 			if [[ -f "${output_dir}/${model_tier}.json" ]]; then
 				jq -r '.parts[]? | select(.type == "text") | .text' \
-					"${output_dir}/${model_tier}.json" 2>>"$model_err_log" >"$result_file" || true
+					"${output_dir}/${model_tier}.json" 2>>"$model_err_log" >"$result_file" || model_failed=1
 			fi
 
 			# Clean up runner
 			"$runner_helper" destroy "$runner_name" --force 2>>"$model_err_log" || true
+			[[ -s "$result_file" ]] || model_failed=1
+			exit "$model_failed"
 		) &

As per coding guidelines .agents/scripts/*.sh: Automation scripts should use proper exit codes and robust error recovery.

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In @.agents/scripts/compare-models-helper.sh around lines 790 - 825, The
background subshell currently swallows errors with multiple "|| true", causing
wait() to always report success; update the subshell that calls the runner
helper (the block invoking "$runner_helper" create, "$runner_helper" run, the jq
extraction, and "$runner_helper" destroy) to stop using "|| true" and instead
propagate failures by either enabling "set -e" at the top of the subshell or by
checking each command's exit code and calling "exit 1" on failure (ensure stderr
still redirects into "${model_err_log}"). Keep the subshell running in
background and appending its PID to pids[], but make sure the jq extraction only
runs if the JSON exists and exits non-zero on jq errors so the wait loop can
detect failures for each model_name in the pids/wait section.

🧹 Nitpick comments (1)

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In @.agents/scripts/compare-models-helper.sh:
- Around line 1867-1874: The INSERT currently interpolates numeric fields
(m_cor, m_com, m_qua, m_cla, m_adh, m_ove, m_lat, m_tok) without validation,
allowing malicious cmd_score inputs to inject SQL; before calling sqlite3 in the
loop that iterates model_entries, validate/coerce each numeric variable (m_cor,
m_com, m_qua, m_cla, m_adh, m_ove, m_lat, m_tok) to a safe numeric form (e.g.,
test with a regex like ^-?[0-9]+(\.[0-9]+)?$ and convert via printf or bash
arithmetic) and set a safe default (NULL or 0) on failure, then use those
sanitized variables in the INSERT statement that writes to comparison_scores
(keep safe_id/safe_str/safe_wea/safe_res handling); alternatively, switch to a
parameterized insertion approach using sqlite3 prepared statements or a here-doc
to bind values instead of raw interpolation.

In @.agents/services/communications/matterbridge.md:
- Around line 79-86: The macOS (Intel) download block currently uses curl to
write the binary to /usr/local/bin/matterbridge but omits making it executable;
update the macOS (Intel) section (the curl -L ... -o /usr/local/bin/matterbridge
line) to follow the Apple Silicon/Linux sections by adding a chmod +x
/usr/local/bin/matterbridge after the curl command so the downloaded binary is
executable.

---

Outside diff comments:
In @.agents/scripts/compare-models-helper.sh:
- Around line 775-790: The model_tier value is used directly in filenames/keys
(runner_name, model_err_log) and only spaces are stripped, allowing path
traversal or slashes; before reusing model_tier build a sanitized
safe_model_tier (e.g., strip leading/trailing path components, reject or replace
any chars not in [A-Za-z0-9._-] with an underscore, and fallback to a fixed
token if empty) and use that safe variable when constructing runner_name, adding
to runner_names/model_names, and when creating model_err_log; also keep
resolved_model for display but never use it for filesystem paths. Ensure the
sanitization happens immediately after resolving/triming model_tier (references:
model_tier, safe_model_tier, runner_name, runner_names, model_names,
resolve_model_tier, resolved_model, model_err_log, output_dir).
- Around line 790-825: The background subshell currently swallows errors with
multiple "|| true", causing wait() to always report success; update the subshell
that calls the runner helper (the block invoking "$runner_helper" create,
"$runner_helper" run, the jq extraction, and "$runner_helper" destroy) to stop
using "|| true" and instead propagate failures by either enabling "set -e" at
the top of the subshell or by checking each command's exit code and calling
"exit 1" on failure (ensure stderr still redirects into "${model_err_log}").
Keep the subshell running in background and appending its PID to pids[], but
make sure the jq extraction only runs if the JSON exists and exits non-zero on
jq errors so the wait loop can detect failures for each model_name in the
pids/wait section.

---

Nitpick comments:
In @.agents/scripts/matterbridge-helper.sh:
- Around line 340-350: In cmd_update, check that the binary referenced by
BINARY_PATH exists and is executable before running "$BINARY_PATH -version" (and
before creating the version_err_file); if missing, log a clear error via log and
return a non-zero status instead of letting set -euo pipefail abort. Update the
cmd_update function to perform this existence check (using -x or -f tests) prior
to the current_version assignment and reference LOG_FILE/version_err_file
handling so the early-return/cleanup path is handled cleanly.

ℹ️ Review info

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 0582db1 and 26afa19.

📒 Files selected for processing (3)

.agents/scripts/compare-models-helper.sh
.agents/scripts/matterbridge-helper.sh
.agents/services/communications/matterbridge.md

.agents/scripts/compare-models-helper.sh

.agents/services/communications/matterbridge.md

…agation, docs - Sanitize model identifiers in cross-review to prevent path traversal - Propagate per-model failures in subshells instead of swallowing with || true - Validate numeric SQL fields in cmd_score before INSERT (prevent injection) - Clamp score values to 0-10 range in cmd_score - Add chmod +x for macOS Intel binary download in matterbridge.md - Add binary existence check in cmd_update before running -version

github-actions · 2026-02-27T01:43:49Z

🔍 Code Quality Report

�[0;35m[MONITOR]�[0m Code Review Monitoring Report

�[0;34m[INFO]�[0m Latest Quality Status:
SonarCloud: 0 bugs, 0 vulnerabilities, 89 code smells

�[0;34m[INFO]�[0m Recent monitoring activity:
Fri Feb 27 01:43:45 UTC 2026: Code review monitoring started
Fri Feb 27 01:43:46 UTC 2026: SonarCloud - Bugs: 0, Vulnerabilities: 0, Code Smells: 89

📈 Current Quality Metrics

BUGS: 0
CODE SMELLS: 89
VULNERABILITIES: 0

Generated on: Fri Feb 27 01:43:48 UTC 2026

Generated by AI DevOps Framework Code Review Monitoring

alex-solovyev · 2026-02-27T01:59:33Z

@coderabbitai review

All review findings have been addressed in commits d239c55 through a656c53:

Round 1 (CodeRabbit):

macOS ARM64 detection in detect_os_arch ✅
Config validation (actual config parse, not just -version) ✅
--tail argument parsing with proper shift ✅

Round 2 (CodeRabbit):

Score persistence: task_type allowlist, winner validation, score clamping ✅
Handle missing timeout command (timeout/gtimeout detection) ✅
--tail shift safety (conditional shift based on arg count) ✅
diff pipefail handling (capture exit status separately) ✅
Model identifier sanitization (reject non-alphanumeric) ✅
Per-model error propagation (model_failed flag, exit code) ✅

Round 3 (CodeRabbit):

Numeric SQL field validation in cmd_score (regex + clamp) ✅
chmod +x for macOS Intel download in docs ✅
cmd_update binary existence check ✅

Quality verification:

ShellCheck: zero violations on both scripts
All error output redirected to per-model/judge log files (not /dev/null)

coderabbitai · 2026-02-27T01:59:55Z

@alex-solovyev: Thank you for the comprehensive resolution summary! Your detailed tracking across all three review rounds demonstrates excellent attention to addressing feedback systematically.

I can see you've addressed:

Platform compatibility (macOS ARM64, timeout command detection)
Security hardening (SQL injection prevention, input validation, score clamping)
Robustness (pipefail handling, error propagation, shift safety)
Quality standards (ShellCheck clean, proper error logging)

I'll trigger a fresh review of the current state of the PR.

✅ Actions performed

Review triggered.

…ility

github-actions · 2026-02-27T02:06:09Z

🔍 Code Quality Report

�[0;35m[MONITOR]�[0m Code Review Monitoring Report

�[0;34m[INFO]�[0m Latest Quality Status:
SonarCloud: 0 bugs, 0 vulnerabilities, 89 code smells

�[0;34m[INFO]�[0m Recent monitoring activity:
Fri Feb 27 02:06:04 UTC 2026: Code review monitoring started
Fri Feb 27 02:06:05 UTC 2026: SonarCloud - Bugs: 0, Vulnerabilities: 0, Code Smells: 89

📈 Current Quality Metrics

BUGS: 0
CODE SMELLS: 89
VULNERABILITIES: 0

Generated on: Fri Feb 27 02:06:08 UTC 2026

Generated by AI DevOps Framework Code Review Monitoring

marcusquinn · 2026-02-27T02:11:22Z

@coderabbitai review

coderabbitai · 2026-02-27T02:11:27Z

✅ Actions performed

Review triggered.

alex-solovyev · 2026-02-27T02:27:50Z

All CodeRabbit and Gemini Code Assist review findings have been addressed in the existing commits on this branch:

CodeRabbit critical/major (all marked 'Addressed'):

cmd_logs --tail arg parsing: Fixed with while-loop + conditional shift (handles missing value gracefully)
Config validation binary invocation: Proper timeout detection (timeout/gtimeout), exit code 124 handling, TOML parse error detection
detect_os_arch macOS arm64: Now correctly returns darwin-arm64 for Apple Silicon
diff pipefail fix: Captures diff output/status separately to avoid false 'identical' message
Judge-sourced SQL field validation: task_type allowlist, winner validation against models_with_output
Score clamping: _clamp_score helper (0-10 range)
Numeric SQL field validation in cmd_score: regex validation + range clamping
Python subprocess efficiency: Single call for winner/task_type/reasoning
matterbridge.md: Added missing chmod +x for macOS Intel

Gemini Code Assist medium-priority (all already fixed in code):

All 2>/dev/null suppression patterns replaced with proper error logging to per-model/judge error log files
get_latest_version now captures curl output and checks exit status
cmd_stop captures kill errors instead of suppressing
cmd_update redirects version stderr to error file

ShellCheck passes clean on both scripts. Requesting re-review.

@coderabbitai review

coderabbitai · 2026-02-27T02:28:15Z

@alex-solovyev: Excellent work on the comprehensive resolution tracking! Your systematic approach to addressing all review findings across multiple rounds is exemplary.

I can see you've thoroughly addressed:

Security & Robustness:

SQL injection prevention (field validation, escaping, allowlists)
Input validation and sanitization (score clamping, model identifier checks)
Error propagation and proper exit codes

Platform Compatibility:

macOS ARM64 detection
Timeout command detection (timeout/gtimeout fallback)
Correct binary asset naming

Code Quality:

Pipefail handling with separate status capture
Proper argument parsing with safety checks
Error logging to appropriate files (not /dev/null)
ShellCheck clean

I'll trigger a fresh review of the current state.

✅ Actions performed

Review triggered.

coderabbitai

Actionable comments posted: 2

🧹 Nitpick comments (1)

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In @.agents/scripts/compare-models-helper.sh:
- Around line 845-857: _clamp_score currently strips all non-digits causing
decimals like "8.5" to become "85" and be clamped to 10; update the _clamp_score
function to preserve a single decimal point (e.g., allow [0-9.] but collapse
multiple dots), validate the resulting value is a proper number, and then
perform numeric comparison/clamping using a numeric-aware tool (awk/bc/shell
arithmetic) so values between 0 and 10 (including decimals) are correctly
returned; apply the same change to the duplicate logic referenced around lines
881-886 so both places use the same decimal-preserving validation and clamping
behavior.
- Around line 742-743: The script uses the unvalidated variable judge_model when
building filenames and invoking runners (e.g., local
judge_output_file="${output_dir}/judge-${judge_model}.json"), which allows
unsafe characters (like '/') to break paths; add a validation/sanitization step
(e.g., a function validate_or_sanitize_judge_model that rejects or transforms
inputs containing slashes, path traversal, nulls, or other unsafe chars into a
safe token) run immediately after parsing --judge, call it before any use of
judge_model, and exit non‑zero with a clear error if validation fails; apply the
same validation/usage fix to all other places you interpolate judge_model or
pass it to runner calls (the blocks around the current uses at lines cited: 742,
749-756, and 967-973).

---

Nitpick comments:
In @.agents/scripts/matterbridge-helper.sh:
- Around line 340-349: In cmd_update, the temporary error file stored in
version_err_file uses LOG_FILE (which lives under DATA_DIR) but DATA_DIR may not
exist; before creating version_err_file invoke a check/ensure that DATA_DIR (or
the parent directory of LOG_FILE) exists (mkdir -p) so writing
"$version_err_file" won't fail; update the cmd_update function to create the
directory if missing (use the same DATA_DIR/LOG_FILE variables so behavior is
consistent) before calling "$BINARY_PATH" -version.

ℹ️ Review info

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 26afa19 and b757969.

📒 Files selected for processing (3)

.agents/scripts/compare-models-helper.sh
.agents/scripts/matterbridge-helper.sh
.agents/services/communications/matterbridge.md

.agents/scripts/compare-models-helper.sh

…el validation, ensure_dirs - Fix _clamp_score to handle decimals correctly (8.5 was parsed as 85, now properly rounds to 9 via printf '%.0f') - Validate judge_model identifier at parse time and before use in filenames to prevent path traversal (reject chars outside [A-Za-z0-9._-]) - Add ensure_dirs call in cmd_update to create DATA_DIR before writing version error temp file

alex-solovyev · 2026-02-27T02:41:20Z

@coderabbitai review

coderabbitai · 2026-02-27T02:41:30Z

✅ Actions performed

Review triggered.

github-actions · 2026-02-27T02:41:48Z

🔍 Code Quality Report

�[0;35m[MONITOR]�[0m Code Review Monitoring Report

�[0;34m[INFO]�[0m Latest Quality Status:
SonarCloud: 0 bugs, 0 vulnerabilities, 53 code smells

�[0;34m[INFO]�[0m Recent monitoring activity:
Fri Feb 27 02:41:43 UTC 2026: Code review monitoring started
Fri Feb 27 02:41:44 UTC 2026: SonarCloud - Bugs: 0, Vulnerabilities: 0, Code Smells: 53

📈 Current Quality Metrics

BUGS: 0
CODE SMELLS: 53
VULNERABILITIES: 0

Generated on: Fri Feb 27 02:41:46 UTC 2026

Generated by AI DevOps Framework Code Review Monitoring

…R before temp write Address remaining CodeRabbit review feedback: - Sanitize judge_model identifier before using as filename/runner name (prevents path traversal via crafted --judge values) - Fix _clamp_score decimal handling: '8.5' was becoming '85' -> 10; now truncates at decimal point first ('8.5' -> '8') - Ensure DATA_DIR exists in cmd_update before writing version_err_file (prevents failure when user runs update without prior setup)

github-actions · 2026-02-27T02:56:06Z

🔍 Code Quality Report

�[0;35m[MONITOR]�[0m Code Review Monitoring Report

�[0;34m[INFO]�[0m Latest Quality Status:
SonarCloud: 0 bugs, 0 vulnerabilities, 53 code smells

�[0;34m[INFO]�[0m Recent monitoring activity:
Fri Feb 27 02:56:01 UTC 2026: Code review monitoring started
Fri Feb 27 02:56:02 UTC 2026: SonarCloud - Bugs: 0, Vulnerabilities: 0, Code Smells: 53

📈 Current Quality Metrics

BUGS: 0
CODE SMELLS: 53
VULNERABILITIES: 0

Generated on: Fri Feb 27 02:56:04 UTC 2026

Generated by AI DevOps Framework Code Review Monitoring

coderabbitai

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (2)

.agents/scripts/compare-models-helper.sh (2)

58-67: ⚠️ Potential issue | 🟡 Minor

SQL injection risk in pattern tracker queries.

The $task_type and $tier variables are interpolated directly into SQLite queries without escaping. While tier is often validated upstream against PATTERN_VALID_MODELS, task_type can contain arbitrary user input with SQL metacharacters (', %).

🔧 Suggested fix: escape single quotes

 _get_tier_pattern_counts() {
 	local tier="$1"
 	local task_type="${2:-}"
 	[[ -f "$PATTERN_DB" ]] || {
 		echo "0|0"
 		return 0
 	}

+	# Escape single quotes for SQL safety
+	local safe_tier="${tier//\'/\'\'}"
+	local safe_task_type="${task_type//\'/\'\'}"
+
 	local filter=""
 	if [[ -n "$task_type" ]]; then
-		filter="AND (tags LIKE '%${task_type}%' OR content LIKE '%task:${task_type}%')"
+		filter="AND (tags LIKE '%${safe_task_type}%' OR content LIKE '%task:${safe_task_type}%')"
 	fi

-	local model_filter="AND (tags LIKE '%model:${tier}%' OR content LIKE '%model:${tier}%')"
+	local model_filter="AND (tags LIKE '%model:${safe_tier}%' OR content LIKE '%model:${safe_tier}%')"

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In @.agents/scripts/compare-models-helper.sh around lines 58 - 67, Sanitize
variables interpolated into the sqlite3 queries to prevent SQL injection: before
building model_filter and filter, create escaped versions of task_type and tier
by replacing any single quote ' with two single quotes '' (e.g.,
escaped_task_type and escaped_tier) and use those escaped variables in the query
construction (replace references to $task_type and $tier in filter and
model_filter with the escaped names); keep using PATTERN_DB and the same SELECTs
for successes and failures but ensure all direct interpolations use the escaped
variables.

2039-2047: ⚠️ Potential issue | 🟡 Minor

SQL injection risk in results filtering.

The model_filter and type_filter variables from command-line arguments are interpolated directly into SQL WHERE clauses without escaping. A malicious --model value could inject arbitrary SQL.

🔧 Suggested fix: escape filter values

 	local where_clause=""
+	local safe_model_filter="${model_filter//\'/\'\'}"
+	local safe_type_filter="${type_filter//\'/\'\'}"
 	if [[ -n "$model_filter" ]]; then
-		where_clause="WHERE cs.model_id LIKE '%${model_filter}%'"
+		where_clause="WHERE cs.model_id LIKE '%${safe_model_filter}%'"
 	fi
 	if [[ -n "$type_filter" ]]; then
 		if [[ -n "$where_clause" ]]; then
-			where_clause="$where_clause AND c.task_type = '$type_filter'"
+			where_clause="$where_clause AND c.task_type = '${safe_type_filter}'"
 		else
-			where_clause="WHERE c.task_type = '$type_filter'"
+			where_clause="WHERE c.task_type = '${safe_type_filter}'"
 		fi
 	fi

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In @.agents/scripts/compare-models-helper.sh around lines 2039 - 2047, The WHERE
clause currently interpolates raw model_filter and type_filter into SQL,
creating an injection risk; sanitize these inputs before building where_clause
by escaping single quotes (e.g., replace ' with '' ) and use the sanitized
variables (sanitized_model_filter, sanitized_type_filter) when composing
where_clause for the cs.model_id LIKE and c.task_type comparisons, or better yet
switch to a parameterized query if the script invokes psql/psql variables;
update the code that sets where_clause (the blocks referencing model_filter,
type_filter, and where_clause) to use the sanitized variables instead of the raw
ones.

🧹 Nitpick comments (3)

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Outside diff comments:
In @.agents/scripts/compare-models-helper.sh:
- Around line 58-67: Sanitize variables interpolated into the sqlite3 queries to
prevent SQL injection: before building model_filter and filter, create escaped
versions of task_type and tier by replacing any single quote ' with two single
quotes '' (e.g., escaped_task_type and escaped_tier) and use those escaped
variables in the query construction (replace references to $task_type and $tier
in filter and model_filter with the escaped names); keep using PATTERN_DB and
the same SELECTs for successes and failures but ensure all direct interpolations
use the escaped variables.
- Around line 2039-2047: The WHERE clause currently interpolates raw
model_filter and type_filter into SQL, creating an injection risk; sanitize
these inputs before building where_clause by escaping single quotes (e.g.,
replace ' with '' ) and use the sanitized variables (sanitized_model_filter,
sanitized_type_filter) when composing where_clause for the cs.model_id LIKE and
c.task_type comparisons, or better yet switch to a parameterized query if the
script invokes psql/psql variables; update the code that sets where_clause (the
blocks referencing model_filter, type_filter, and where_clause) to use the
sanitized variables instead of the raw ones.

---

Nitpick comments:
In @.agents/scripts/compare-models-helper.sh:
- Line 779: The grep using a greedy regex "{.*}" when setting judge_json can
capture across multiple JSON-like segments and produce malformed output; update
the extraction to use a non-greedy/posix-safe pattern or remove this grep step
and rely on the existing Python-based extraction instead. Specifically, change
the logic around the judge_json assignment that reads from "$judge_output_file"
(and writes to "$judge_err_log") so it either uses a non-greedy/lowest-match
regex (e.g., match the smallest {...} per line) or simply call the Python
extractor directly to populate judge_json, keeping the variable name judge_json
and preserving error redirection to judge_err_log.
- Around line 746-750: Remove the redundant judge_model validation block shown
(the second check that uses print_warning and returns 0) and rely on the
existing validation already performed earlier for judge_model; keep only the
original validation that logs via print_error and returns 1 on invalid input so
behavior is consistent. Specifically, delete the duplicated if [[ !
"$judge_model" =~ ^[A-Za-z0-9._-]+$ ]] { ... } snippet and ensure the
remaining/earlier check for judge_model (the first validation near the top of
the function) is the single source-of-truth for sanitization and error handling.

In @.agents/scripts/matterbridge-helper.sh:
- Around line 345-348: There are two consecutive calls to ensure_dirs
(duplicated at the start of the block) — remove the redundant second call and
keep a single invocation of ensure_dirs before writing temp files so
DATA_DIR/LOG_FILE are guaranteed to exist; update or retain the comment ("Ensure
DATA_DIR exists before writing temp files (LOG_FILE lives under DATA_DIR)") to
sit immediately above the single ensure_dirs call to preserve intent.

ℹ️ Review info

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between b757969 and 465cee9.

📒 Files selected for processing (2)

.agents/scripts/compare-models-helper.sh
.agents/scripts/matterbridge-helper.sh

- Remove duplicate ensure_dirs call in matterbridge-helper.sh cmd_update - Escape model_filter and type_filter in cmd_results to prevent SQL injection - Validate --limit is numeric before interpolating into SQL LIMIT clause

github-actions · 2026-02-27T02:59:32Z

🔍 Code Quality Report

�[0;35m[MONITOR]�[0m Code Review Monitoring Report

�[0;34m[INFO]�[0m Latest Quality Status:
SonarCloud: 0 bugs, 0 vulnerabilities, 53 code smells

�[0;34m[INFO]�[0m Recent monitoring activity:
Fri Feb 27 02:59:28 UTC 2026: Code review monitoring started
Fri Feb 27 02:59:29 UTC 2026: SonarCloud - Bugs: 0, Vulnerabilities: 0, Code Smells: 53

📈 Current Quality Metrics

BUGS: 0
CODE SMELLS: 53
VULNERABILITIES: 0

Generated on: Fri Feb 27 02:59:31 UTC 2026

Generated by AI DevOps Framework Code Review Monitoring

sonarqubecloud · 2026-02-27T03:03:58Z

Quality Gate passed

Issues
4 New issues
0 Accepted issues

Measures
0 Security Hotspots
0.0% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarQube Cloud

coderabbitai bot requested changes Feb 26, 2026

View reviewed changes

.agents/scripts/matterbridge-helper.sh Show resolved Hide resolved

.agents/scripts/matterbridge-helper.sh Outdated Show resolved Hide resolved

.agents/scripts/matterbridge-helper.sh Show resolved Hide resolved

gemini-code-assist bot reviewed Feb 26, 2026

View reviewed changes

marcusquinn force-pushed the feature/t1328-t1329-matterbridge-cross-review branch from 0d151bf to 3214437 Compare February 26, 2026 19:58

coderabbitai bot requested changes Feb 26, 2026

View reviewed changes

.agents/scripts/compare-models-helper.sh Outdated Show resolved Hide resolved

.agents/scripts/compare-models-helper.sh Outdated Show resolved Hide resolved

.agents/scripts/matterbridge-helper.sh Show resolved Hide resolved

.agents/scripts/matterbridge-helper.sh Show resolved Hide resolved

fix: bound judge payload size and replace sed with parameter expansio…

a841d19

…n (SC2001)

fix: correct macOS binary asset names in matterbridge docs (darwin-64…

5c7d2b5

…bit, not darwin-amd64)

fix: resolve SC2119/SC2120 — pass explicit empty arg to get_all_tier_…

26afa19

…patterns

coderabbitai bot requested changes Feb 27, 2026

View reviewed changes

.agents/scripts/compare-models-helper.sh Outdated Show resolved Hide resolved

.agents/services/communications/matterbridge.md Show resolved Hide resolved

refactor: move _cross_review_judge_score before its caller for readab…

b757969

…ility

coderabbitai bot requested changes Feb 27, 2026

View reviewed changes

.agents/scripts/compare-models-helper.sh Show resolved Hide resolved

.agents/scripts/compare-models-helper.sh Show resolved Hide resolved

coderabbitai bot reviewed Feb 27, 2026

View reviewed changes

coderabbitai bot approved these changes Feb 27, 2026

View reviewed changes

marcusquinn merged commit cb405fa into main Feb 27, 2026
12 of 13 checks passed

github-actions bot mentioned this pull request Feb 27, 2026

t1328: Matterbridge agent for multi-platform chat bridging #2254

Closed

6 tasks

Conversation

marcusquinn commented Feb 26, 2026 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Changes by Category

t1328 — Matterbridge Agent (core deliverable)

t1329 — Cross-Review Pipeline

Script Reorganization

TODO.md State Reset

Framework Changes

Review Notes

Stats

Summary by CodeRabbit

Uh oh!

coderabbitai bot commented Feb 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Reviews paused

Walkthrough

Changes

Sequence Diagram(s)

Estimated code review effort

Possibly related PRs

Suggested labels

Poem

❌ Failed checks (2 warnings)

Uh oh!

gemini-code-assist bot commented Feb 26, 2026

Summary of Changes

Highlights

Footnotes

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Feb 26, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Feb 26, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Feb 26, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Feb 26, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Feb 26, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Feb 26, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Feb 26, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Feb 26, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Feb 26, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Feb 26, 2026

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Feb 26, 2026

🔍 Code Quality Report

📈 Current Quality Metrics

Uh oh!

github-actions bot commented Feb 26, 2026

🔍 Code Quality Report

📈 Current Quality Metrics

Uh oh!

github-actions bot commented Feb 26, 2026

marcusquinn commented Feb 26, 2026 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Feb 26, 2026 •

edited

Loading

CodeRabbit (Round 1 — `0d151bf`)

CodeRabbit (Round 2 — `0582db1`)