fix: CLI crash + auto-generate YAML config templates from schema by viraatc · Pull Request #237 · mlcommons/endpoints

viraatc · 2026-04-01T21:56:51Z

Summary

Bug fix: CLI crash on --load-pattern + --target-qps (IndexError) — LoadPattern.type used alias= instead of name= on cyclopts.Parameter
Template generation: regenerate_templates.py auto-generates minimal + full YAML templates from Pydantic schema defaults
Pre-commit hook: auto-regenerates templates on schema/config changes; CI validates via --check mode

Templates

Two variants per mode (offline, online, concurrency):

_template.yaml — minimal: only required fields + <PLACEHOLDER eg: example> values
_template_full.yaml — all fields with schema defaults, inline # options: comments auto-discovered from Enum/Literal annotations, 2 dataset examples (perf + accuracy with parser columns), multiple endpoints

Init command

inference-endpoint init offline always generates the full template. Available types: offline, online, concurrency.

Tests

Hypothesis fuzz: 4000 random CLI flag combinations across offline/online/concurrency
Unit: all 8 templates (including eval/submission) validate via from_yaml_file (auto-discovered with glob)
Integration (e2e): every generated template is loaded, <> placeholders stripped to real values, and run end-to-end against the echo server — verifying templates produce functional configs
Init command: generates template + tested

Docs updated

AGENTS.md, CLI_QUICK_REFERENCE.md, DEVELOPMENT.md — all stale template/init references fixed

Test plan

python scripts/regenerate_templates.py --check passes
pytest tests/unit/ — 484 passed
pytest tests/integration/commands/test_benchmark_command.py::TestTemplateIntegration — 6 templates run e2e
Pre-commit hooks pass (ruff, mypy, prettier, template regeneration)

🤖 Generated with Claude Code

github-actions · 2026-04-01T21:57:02Z

MLCommons CLA bot All contributors have signed the MLCommons CLA ✍️ ✅

Copilot

Pull request overview

Fixes a cyclopts CLI parsing crash triggered when --load-pattern is combined with load-pattern subfields like --target-qps / --concurrency in the online benchmark command.

Changes:

Annotates LoadPattern to adjust how cyclopts maps nested parameters (@cyclopts.Parameter(name="*")).
Updates the CLI parameter definition for LoadPattern.type to avoid the prior name collision.

Comments suppressed due to low confidence (1)

src/inference_endpoint/config/schema.py:360

This change is a regression fix for a CLI crash when combining --load-pattern with nested load-pattern fields (e.g. --target-qps). There’s existing automated test coverage for config validation in tests/unit/commands/test_benchmark.py, but no test currently exercises cyclopts parsing for this flag combination.

Add a regression test that parses benchmark online ... --load-pattern poisson --target-qps 100 (or directly parses OnlineBenchmarkConfig via cyclopts) and asserts it no longer raises and that config.settings.load_pattern.type/target_qps are set as expected.

@cyclopts.Parameter(name="*")
class LoadPattern(BaseModel):
    """Load pattern configuration.

    Different patterns use target_qps differently:
    - max_throughput: target_qps used for calculating total queries (offline, optional with default)
    - poisson: target_qps sets scheduler rate (online, required - validated)
    - concurrency: issue at fixed target_concurrency (online, required - validated)
    """

    model_config = ConfigDict(extra="forbid", frozen=True)

    type: Annotated[
        LoadPatternType,
        cyclopts.Parameter(name="--load-pattern", help="Load pattern type"),
    ] = LoadPatternType.MAX_THROUGHPUT
    target_qps: Annotated[
        float | None, cyclopts.Parameter(alias="--target-qps", help="Target QPS")
    ] = Field(None, gt=0)

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

src/inference_endpoint/config/schema.py

gemini-code-assist

Code Review

This pull request modifies the LoadPattern class in the configuration schema by applying a class-level cyclopts.Parameter decorator and updating the type field's parameter definition to use the name argument instead of alias. I have no feedback to provide.

arekay-nv

Can we also add a test for this - seems like a change that shouldn't have gone in.

tests/integration/commands/test_cli.py

Copilot

Pull request overview

Copilot reviewed 5 out of 5 changed files in this pull request and generated 3 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

.github/workflows/test.yml

tests/integration/commands/test_cli.py

Copilot

Pull request overview

Copilot reviewed 5 out of 5 changed files in this pull request and generated 3 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

.github/workflows/test.yml

Copilot

Pull request overview

Copilot reviewed 10 out of 10 changed files in this pull request and generated 1 comment.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

.pre-commit-config.yaml

viraatc

duplicate

tests/integration/commands/test_cli.py

src/inference_endpoint/config/templates/offline_template.yaml

.github/workflows/test.yml

.pre-commit-config.yaml

viraatc · 2026-04-03T11:51:28Z

Review Council — Multi-AI Code Review

Reviewed by: Claude (Codex ran but produced investigation output, not structured findings) | Depth: standard

Found 3 issues across 3 files:

1 high (fixed)
1 medium (already fixed)
1 low (deferred)

#	File	Line	Severity	Category	Summary
1	`scripts/regenerate_templates.py`	95	high	error-handling	Pre-commit hook exited 0 on template generation failure — stale files could slip through. Fixed: now tracks failures and `sys.exit(1)`.
2	`.github/workflows/test.yml`	61	medium	security	Unpinned action SHAs in `schema-updated` job. Already fixed in latest push.
3	`tests/integration/commands/test_cli.py`	76	low	testing	`Optional` union types (`float

Also addressed all Copilot review comments (pinned SHAs, quoted pip install, heredoc for inline Python, expanded pre-commit files: regex, added except comment).

viraatc

added new schema-updated CI:

fuzz tests on CLI in CI
template validated against schema default in CI

NOTE: template now includes all supported fields

was pending items from past.
++ @rashid for thoughts?

tests/integration/commands/test_cli.py

viraatc · 2026-04-03T12:16:54Z

src/inference_endpoint/config/templates/offline_template.yaml

+type: offline
 model_params:
-  name: "meta-llama/Llama-3.1-8B-Instruct"
+  name: '<MODEL_NAME eg: meta-llama/Llama-3.1-8B-Instruct>'


Templates auto-generated from schema defaults by scripts/regenerate_templates.py.
Full YAML spec with placeholder overrides (model name, dataset)

Pre-commit validates templates are valid locally.
CI checks if they're up to date — if stale it will suggest to, run python scripts/regenerate_templates.py.

Is this overkill? Should we drop?

This creates more burden to the end user to understand all the flags, and we should keep the template simple. cc: @arekay-nv to review as well

I think this makes reproduction easier. Only the flags with values in <> need to be specified, all others have defaults.
We can even make this part of a pre-commit to have an integration test that takes in these templates, substitutes for an echo server just to make sure that the minimal version runs.

For beginners, they can always use the minimal command line which already has the defaults baked in.

.github/workflows/test.yml

.pre-commit-config.yaml

Copilot

Pull request overview

Copilot reviewed 11 out of 11 changed files in this pull request and generated 6 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

src/inference_endpoint/config/templates/concurrency_template.yaml

tests/integration/commands/test_cli.py

.github/workflows/test.yml

docs/DEVELOPMENT.md

Falls back to handwritten _template.yaml when _template_full.yaml doesn't exist (eval, submission). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Copilot

Pull request overview

Copilot reviewed 21 out of 21 changed files in this pull request and generated 4 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-04-08T21:30:47Z

scripts/regenerate_templates.py

+def _build_minimal_template(test_type: TestType, overrides: dict) -> dict:
+    """Only the fields a user must provide. 1 dataset example."""
+    name = overrides.pop("name", None) or f"{test_type.value}_benchmark"
+    data: dict[str, object] = {
+        "name": name,
+        "type": test_type.value,
+        "model_params": {"name": _PLACEHOLDER_MODEL},
+        "datasets": [_PERF_DATASET],
+        "endpoint_config": {"endpoints": _PLACEHOLDER_ENDPOINTS_MINIMAL},
+    }
+
+    if overrides:
+        data = _deep_merge(data, overrides)
+
+    return data
+
+
+# ---------------------------------------------------------------------------
+# Main
+# ---------------------------------------------------------------------------
+
+
+def main(check_only: bool = False):
+    """Regenerate templates, or check they're up to date (--check flag)."""
+    comments = _collect_options(BenchmarkConfig) | _EXTRA_COMMENTS
+    stale = False
+
+    for name, (test_type, overrides) in _TEMPLATES.items():
+        model_cls = _MODEL_FOR_TYPE[test_type]
+
+        variants = {
+            f"{name}_template.yaml": _build_minimal_template(test_type, overrides),
+            f"{name}_template_full.yaml": _build_full_template(model_cls, overrides),
+        }


In _build_minimal_template(), overrides is mutated via overrides.pop("name", ...). Since main() passes the same overrides dict to both minimal and full builders, this side effect drops the "name" override for the full template (and also mutates the module-level _TEMPLATES mapping). Avoid mutating overrides (e.g., use overrides.get("name") and merge without popping) and/or pass a copy/deepcopy of overrides into each builder so full/minimal variants are independent.

Copilot · 2026-04-08T21:30:48Z

src/inference_endpoint/config/templates/concurrency_template_full.yaml

+name: online_benchmark
+version: '1.0'
+type: online


This full concurrency template is named "online_benchmark" even though the minimal concurrency template and CLI type imply it should be a concurrency-specific template (e.g., "concurrency_benchmark"). Regenerating after fixing the overrides mutation in scripts/regenerate_templates.py should produce a consistent name here.

Suggested change

name: online_benchmark

version: '1.0'

type: online

name: concurrency_benchmark

version: '1.0'

type: concurrency

Copilot · 2026-04-08T21:30:48Z

tests/integration/commands/test_benchmark_command.py

+
+    # Cap total samples so test finishes in seconds
+    data.setdefault("settings", {})
+    data["settings"].setdefault("runtime", {})
+    data["settings"]["runtime"]["n_samples_to_issue"] = 10
+


_resolve_template() caps n_samples_to_issue, but it doesn't override settings.client defaults. Because HTTPClientConfig resolves num_workers=-1 to an auto value (often 10+) during validation, these template E2E runs can spawn many worker processes and become slow/flaky in CI. Consider forcing small client settings here (e.g., num_workers=1, warmup_connections=0, max_connections=10) and/or disabling enable_cpu_affinity for the generated config used in this test.

Copilot · 2026-04-08T21:30:48Z

docs/DEVELOPMENT.md

+Config templates in `src/inference_endpoint/config/templates/` are auto-generated from schema defaults. When you change `config/schema.py`, regenerate them:
+
+```bash
+python scripts/regenerate_templates.py
+```
+
+The pre-commit hook auto-regenerates templates when `schema.py`, `config.py`, or `regenerate_templates.py` change. CI validates templates are up to date via `--check` mode.
+


Docs mention a pre-commit trigger on config.py, but the actual hook targets src/inference_endpoint/endpoint_client/config.py (and other files). Please update this list so contributors know which changes will regenerate templates.

- Reverted eval/submission from auto-generation (schema doesn't support them yet). Handwritten templates kept, init falls back to them. - regenerate_templates.py auto-detects CI env var → check-only mode - Pre-commit files trigger now includes templates/ dir Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Copilot

Pull request overview

Copilot reviewed 21 out of 21 changed files in this pull request and generated 2 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-04-08T21:39:06Z

scripts/regenerate_templates.py

+def _build_minimal_template(test_type: TestType, overrides: dict) -> dict:
+    """Only the fields a user must provide. 1 dataset example."""
+    name = overrides.pop("name", None) or f"{test_type.value}_benchmark"
+    data: dict[str, object] = {
+        "name": name,
+        "type": test_type.value,
+        "model_params": {"name": _PLACEHOLDER_MODEL},
+        "datasets": [_PERF_DATASET],
+        "endpoint_config": {"endpoints": _PLACEHOLDER_ENDPOINTS_MINIMAL},
+    }
+
+    if overrides:
+        data = _deep_merge(data, overrides)
+
+    return data
+
+
+# ---------------------------------------------------------------------------
+# Main
+# ---------------------------------------------------------------------------
+
+
+def main(check_only: bool = False):
+    """Regenerate templates, or check they're up to date.
+
+    Locally (pre-commit): regenerates files, pre-commit detects the diff.
+    CI: auto-detects ``CI`` env var and switches to check-only mode.
+    Explicit: ``--check`` flag forces check-only mode.
+    """
+    if os.environ.get("CI"):
+        check_only = True
+    comments = _collect_options(BenchmarkConfig) | _EXTRA_COMMENTS
+    stale = False
+
+    for name, (test_type, overrides) in _TEMPLATES.items():
+        model_cls = _MODEL_FOR_TYPE[test_type]
+
+        variants = {
+            f"{name}_template.yaml": _build_minimal_template(test_type, overrides),
+            f"{name}_template_full.yaml": _build_full_template(model_cls, overrides),
+        }


_build_minimal_template mutates the shared overrides dict via overrides.pop("name", ...). In main(), the same overrides instance is then reused to build the full template, so the full template can silently lose overrides (e.g., concurrency template loses its name override and ends up as online_benchmark). Avoid mutating the input overrides: pass a copy per variant (deepcopy if nested) and/or change _build_minimal_template to read overrides.get("name") without popping.

Copilot · 2026-04-08T21:39:07Z

src/inference_endpoint/config/templates/concurrency_template_full.yaml

@@ -0,0 +1,86 @@
+name: online_benchmark


name is online_benchmark here, but this is the concurrency full template. It should match the concurrency variant (e.g. concurrency_benchmark) so users don’t start from a misleading config name.

Suggested change

name: online_benchmark

name: concurrency_benchmark

templates/ → templates/.* so the regex matches actual file paths. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

…plates Generates cleaner YAML — all defaults present, no null noise. eval/submission still copy handwritten templates (schema not ready). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Copilot

Pull request overview

Copilot reviewed 21 out of 21 changed files in this pull request and generated 2 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-04-08T22:00:59Z

scripts/regenerate_templates.py

+
+def _build_minimal_template(test_type: TestType, overrides: dict) -> dict:
+    """Only the fields a user must provide. 1 dataset example."""
+    name = overrides.pop("name", None) or f"{test_type.value}_benchmark"


_build_minimal_template() mutates the shared overrides dict via pop("name", ...). Since main() passes the same overrides instance to both the minimal and full variants, the full template can silently lose overrides (e.g., the concurrency full template’s name override). Treat overrides as immutable here (don’t pop), or pass a deep copy into each builder.

Suggested change

name = overrides.pop("name", None) or f"{test_type.value}_benchmark"

name = overrides.get("name", None) or f"{test_type.value}_benchmark"

Copilot · 2026-04-08T22:00:59Z

src/inference_endpoint/commands/init.py

+    output_path = f"{template_type}_template.yaml"
    output_file = Path(output_path)
+


The init command currently always writes <mode>_template.yaml (and for offline/online/concurrency it generates a model_dump(exclude_none=True)), which doesn’t match the repo’s auto-generated “full” templates (*_template_full.yaml with placeholders/options comments) described in the PR/docs. If init is intended to generate the full template, consider copying <mode>_template_full.yaml from TEMPLATES_DIR (or reusing the regeneration logic) instead of emitting this minimal/partial variant.

Walks BenchmarkConfig and all nested models, renders markdown tables with field name, type, default, and description. Pre-commit hook auto-regenerates on schema changes. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Copilot

Pull request overview

Copilot reviewed 21 out of 21 changed files in this pull request and generated 3 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-04-08T22:17:29Z

scripts/regenerate_templates.py

+    for name, (test_type, overrides) in _TEMPLATES.items():
+        model_cls = _MODEL_FOR_TYPE[test_type]
+
+        variants = {
+            f"{name}_template.yaml": _build_minimal_template(test_type, overrides),
+            f"{name}_template_full.yaml": _build_full_template(model_cls, overrides),
+        }


overrides from _TEMPLATES is passed into both _build_minimal_template(...) and _build_full_template(...) but is the same dict object. Because the minimal builder mutates it, the full builder can produce incorrect output. Consider copying overrides (or otherwise ensuring immutability) per variant before calling the builders.

Copilot · 2026-04-08T22:17:29Z

src/inference_endpoint/main.py

+    """Generate config template.
+
+    Args:
+        template: Template type (offline, online, concurrency).
+    """


The init command docstring claims valid template types are only (offline, online, concurrency), but execute_init() also accepts eval and submission. Either include eval/submission in the docstring (and CLI help) or remove them from the supported types to avoid misleading users.

Copilot · 2026-04-08T22:17:29Z

src/inference_endpoint/commands/init.py

+    """Generate YAML config template.
+
+    For offline/online/concurrency: generates via model_dump(exclude_none=True).
+    For eval/submission: copies handwritten template files.

-    if template_type not in TEMPLATE_FILES:
+    Args:
+        template_type: One of "offline", "online", "concurrency", "eval", "submission".
+    """
+    if template_type not in VALID_TYPES:
        raise InputValidationError(
            f"Unknown template type: {template_type}. "
-            f"Available: {', '.join(TEMPLATE_FILES.keys())}"
+            f"Available: {', '.join(sorted(VALID_TYPES))}"
        )

-    template_file = TEMPLATES_DIR / TEMPLATE_FILES[template_type]
+    output_path = f"{template_type}_template.yaml"
    output_file = Path(output_path)
+
    if output_file.exists():
        logger.warning(f"File exists: {output_path} (will be overwritten)")

    try:
-        if not template_file.exists():
-            logger.info("Generating from BenchmarkConfig.create_default_config...")
-            config = BenchmarkConfig.create_default_config(
-                TEMPLATE_TYPE_MAP[template_type]
-            )
-            config_dict = config.model_dump(exclude_none=True)
-            with open(output_path, "w") as f:
-                yaml.dump(config_dict, f, default_flow_style=False, sort_keys=False)
-            logger.info(f"Generated: {output_path}")
-        else:
+        # TODO(vir):
+        # generate these automatically when support is added
+        # for now just copy over hand-written templates
+        if template_type in _HANDWRITTEN:
+            template_file = TEMPLATES_DIR / _HANDWRITTEN[template_type]
+            if not template_file.exists():
+                raise SetupError(f"Template file not found: {template_file}")
            shutil.copy(template_file, output_path)
-            logger.info(f"Created from template: {output_path}")
-
-    except NotImplementedError as e:
-        logger.error(str(e))
-        if template_file.exists():
-            shutil.copy(template_file, output_path)
-            logger.info(f"Created from template: {output_path}")
        else:
-            raise SetupError(f"Template file not found: {template_file}") from e
+            config = BenchmarkConfig.create_default_config(_TYPE_MAP[template_type])
+            if template_type == "concurrency":
+                config = config.with_updates(
+                    name="concurrency_benchmark",
+                    settings=OnlineSettings(
+                        load_pattern=LoadPattern(
+                            type=LoadPatternType.CONCURRENCY,
+                            target_concurrency=32,
+                        ),
+                    ),
+                )
+            data = config.model_dump(mode="json", exclude_none=True)
+            with open(output_path, "w") as f:
+                yaml.dump(data, f, default_flow_style=False, sort_keys=False)
+        logger.info(f"Created: {output_path}")


execute_init() currently writes <type>_template.yaml by dumping BenchmarkConfig.create_default_config(...) with exclude_none=True, which omits optional fields that default to None. This does not match the PR description/docs that say init “always generates the full template” / “all fields”, nor does it match the committed _template_full.yaml variants (and placeholder format differs: <MODEL_NAME> vs <MODEL_NAME eg: ...>). Consider either generating the full variant (e.g., exclude_none=False and/or writing <type>_template_full.yaml) or updating the docs/PR summary to reflect the actual behavior.

- Pin actions/checkout and setup-python to SHAs in CI - Quote pip install extras to avoid shell globbing - Add comment to empty except in fuzz test - Fix overrides.pop() → .get() to prevent dict mutation across minimal/full template builders (concurrency_template_full.yaml was getting wrong name) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

- Full templates now show CLI help text as inline comments for every field that has a description (from Field(description=) or cyclopts.Parameter(help=)) - Minimal templates include settings.runtime (min_duration_ms, max_duration_ms, n_samples_to_issue) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Copilot

Pull request overview

Copilot reviewed 21 out of 21 changed files in this pull request and generated 3 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-04-09T00:33:39Z

src/inference_endpoint/main.py

+    """Generate config template.
+
+    Args:
+        template: Template type (offline, online, concurrency).


The init command docstring lists only (offline, online, concurrency), but execute_init() accepts eval and submission too (and the CLI error message advertises them). Please update the docstring to reflect the full set of supported template types, or remove support if it’s intentionally hidden.

Suggested change

template: Template type (offline, online, concurrency).

template: Template type (offline, online, concurrency, eval, submission).

Copilot · 2026-04-09T00:33:40Z

tests/unit/commands/test_benchmark.py

    @pytest.mark.parametrize(
        "template",
-        [
-            "concurrency_template.yaml",
-            "eval_template.yaml",
-            "offline_template.yaml",
-            "online_template.yaml",
-            "submission_template.yaml",
-        ],
+        sorted(
+            p.name
+            for p in (
+                Path(__file__).parent.parent.parent.parent
+                / "src"
+                / "inference_endpoint"
+                / "config"
+                / "templates"
+            ).glob("*_template*.yaml")
+        ),


This parametrization becomes a no-op if the glob matches zero files (pytest will collect 0 cases and the template validation won’t run). To avoid silently skipping template validation, consider asserting the discovered list is non-empty (and/or using the existing TEMPLATE_DIR.glob(...) to avoid duplicating the templates path).

Copilot · 2026-04-09T00:33:40Z

docs/CLI_QUICK_REFERENCE.md

-# Generate config templates
-inference-endpoint init offline        # or: online, eval, submission
-inference-endpoint init submission
+# Generate config template (all fields with defaults)


This line claims init generates a template with “all fields with defaults”, but the implementation currently dumps with exclude_none=True, which omits fields whose defaults are None. Either adjust the wording (e.g., “common defaults”) or update init to emit the full template including null defaults.

Suggested change

# Generate config template (all fields with defaults)

# Generate config template (common defaults; fields with null/None defaults may be omitted)

Copilot AI review requested due to automatic review settings April 1, 2026 21:56

viraatc requested a review from a team as a code owner April 1, 2026 21:56

github-actions bot requested review from arekay-nv and nvzhihanj April 1, 2026 21:57

Copilot started reviewing on behalf of viraatc April 1, 2026 21:57 View session

Copilot AI reviewed Apr 1, 2026

View reviewed changes

src/inference_endpoint/config/schema.py Show resolved Hide resolved

gemini-code-assist bot reviewed Apr 1, 2026

View reviewed changes

arekay-nv approved these changes Apr 3, 2026

View reviewed changes

github-code-quality bot found potential problems Apr 3, 2026

View reviewed changes

tests/integration/commands/test_cli.py Fixed Show fixed Hide fixed

viraatc force-pushed the feat/viraatc-fix1 branch from 90fe9c8 to 80a79ef Compare April 3, 2026 10:56

Copilot AI review requested due to automatic review settings April 3, 2026 10:56

Copilot started reviewing on behalf of viraatc April 3, 2026 10:56 View session

Copilot AI reviewed Apr 3, 2026

View reviewed changes

.github/workflows/test.yml Outdated Show resolved Hide resolved

.github/workflows/test.yml Outdated Show resolved Hide resolved

.github/workflows/test.yml Outdated Show resolved Hide resolved

github-code-quality bot found potential problems Apr 3, 2026

View reviewed changes

tests/integration/commands/test_cli.py Fixed Show fixed Hide fixed

Copilot AI review requested due to automatic review settings April 3, 2026 11:05

Copilot started reviewing on behalf of viraatc April 3, 2026 11:06 View session

Copilot AI reviewed Apr 3, 2026

View reviewed changes

.github/workflows/test.yml Outdated Show resolved Hide resolved

.github/workflows/test.yml Outdated Show resolved Hide resolved

.github/workflows/test.yml Outdated Show resolved Hide resolved

Copilot AI review requested due to automatic review settings April 3, 2026 11:32

Copilot started reviewing on behalf of viraatc April 3, 2026 11:32 View session

Copilot AI reviewed Apr 3, 2026

View reviewed changes

.pre-commit-config.yaml Outdated Show resolved Hide resolved

This comment was marked as duplicate.

Sign in to view

viraatc commented Apr 3, 2026

View reviewed changes

tests/integration/commands/test_cli.py Show resolved Hide resolved

src/inference_endpoint/config/templates/offline_template.yaml Outdated Show resolved Hide resolved

.github/workflows/test.yml Show resolved Hide resolved

.pre-commit-config.yaml Show resolved Hide resolved

Copilot AI review requested due to automatic review settings April 3, 2026 12:14

Copilot started reviewing on behalf of viraatc April 3, 2026 12:15 View session

viraatc force-pushed the feat/viraatc-fix1 branch from 8915750 to ffb87d9 Compare April 3, 2026 12:16

viraatc commented Apr 3, 2026

View reviewed changes

Copilot AI reviewed Apr 3, 2026

View reviewed changes

viraatc force-pushed the feat/viraatc-fix1 branch from ffb87d9 to b781ff7 Compare April 3, 2026 12:21

update

c479056

Copilot AI review requested due to automatic review settings April 8, 2026 21:25

Copilot started reviewing on behalf of viraatc April 8, 2026 21:26 View session

viraatc and others added 4 commits April 8, 2026 14:28

fix: restore eval/submission in init command

82dd0b1

Falls back to handwritten _template.yaml when _template_full.yaml doesn't exist (eval, submission). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

fix: handle eval/submission templates explicitly in init

7711dd4

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

chore: add TODO for auto-generating eval/submission templates

d117d66

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

update

f4e308b

Copilot AI reviewed Apr 8, 2026

View reviewed changes

viraatc and others added 2 commits April 8, 2026 14:32

update

56be519

Copilot AI review requested due to automatic review settings April 8, 2026 21:35

Copilot started reviewing on behalf of viraatc April 8, 2026 21:36 View session

Update

c77cc1b

Copilot AI reviewed Apr 8, 2026

View reviewed changes

viraatc and others added 2 commits April 8, 2026 14:39

fix: pre-commit regex to match template files, not just directory

0766eca

templates/ → templates/.* so the regex matches actual file paths. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

chore: init uses model_dump(exclude_none=True) instead of copying tem…

348cc54

…plates Generates cleaner YAML — all defaults present, no null noise. eval/submission still copy handwritten templates (schema not ready). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Copilot AI review requested due to automatic review settings April 8, 2026 21:55

Copilot started reviewing on behalf of viraatc April 8, 2026 21:56 View session

Copilot AI reviewed Apr 8, 2026

View reviewed changes

viraatc and others added 2 commits April 8, 2026 15:13

chore: remove unused _ensure_trailing_newline

ef9e868

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Copilot AI review requested due to automatic review settings April 8, 2026 22:13

Copilot started reviewing on behalf of viraatc April 8, 2026 22:14 View session

chore: remove CONFIG_REFERENCE.md generator (not ready for merge)

3feeddc

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Copilot AI reviewed Apr 8, 2026

View reviewed changes

viraatc and others added 2 commits April 8, 2026 15:18

Copilot AI review requested due to automatic review settings April 9, 2026 00:28

Copilot started reviewing on behalf of viraatc April 9, 2026 00:28 View session

Copilot AI reviewed Apr 9, 2026

View reviewed changes

	name = overrides.pop("name", None) or f"{test_type.value}_benchmark"
	name = overrides.get("name", None) or f"{test_type.value}_benchmark"

		output_path = f"{template_type}_template.yaml"
		output_file = Path(output_path)

	template: Template type (offline, online, concurrency).
	template: Template type (offline, online, concurrency, eval, submission).

	# Generate config template (all fields with defaults)
	# Generate config template (common defaults; fields with null/None defaults may be omitted)

Conversation

viraatc commented Apr 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Templates

Init command

Tests

Docs updated

Test plan

Uh oh!

github-actions bot commented Apr 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

arekay-nv left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

This comment was marked as duplicate.

Uh oh!

viraatc left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

viraatc commented Apr 3, 2026

Review Council — Multi-AI Code Review

Uh oh!

viraatc left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

viraatc Apr 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

nvzhihanj Apr 6, 2026

Choose a reason for hiding this comment

Uh oh!

arekay-nv Apr 6, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

viraatc commented Apr 1, 2026 •

edited

Loading

github-actions bot commented Apr 1, 2026 •

edited

Loading

viraatc left a comment •

edited

Loading

viraatc left a comment •

edited

Loading

viraatc Apr 3, 2026 •

edited

Loading