Skip to content

Conversation

konflux-internal-p02[bot]
Copy link

@konflux-internal-p02 konflux-internal-p02 bot commented Jul 16, 2025

This PR contains the following updates:

Package Type Update Change
kubernetes-sigs/gateway-api-inference-extension Kustomization minor v0.3.0 -> v0.4.0

Release Notes

kubernetes-sigs/gateway-api-inference-extension (kubernetes-sigs/gateway-api-inference-extension)

v0.4.0

Compare Source

Overview

We are thrilled to announce the v0.4.0 release—our biggest update yet! This version brings powerful new Endpoint Picker (EPP) scheduler capabilities, performance improvements, and initial Gateway conformance tests.

Major Highlights

  • Modular Endpoint Picker (EPP) Scheduler: A kube-scheduler–style plugin API lets you build custom routing logic,
    filter and score backends, or swap in new picker strategies without touching core code.

  • Prefix-Cache-Aware Routing: Dramatically lower tail latency by routing requests based on cached network prefixes,
    improving response times under load.

  • Richer Metrics: Gain deeper insights with new metrics including:

    • NTPOT (Normalized Time Per Output Token)
    • Scheduler latency
    • Per-pod queue depth
    • Build and version info
  • Optional vLLM Simulator Backend: Spin up a lightweight simulator for local development and testing—no real model
    servers required.

  • Initial Conformance Tests: Validate your controller’s behavior with end-to-end tests covering InferencePool,
    InferenceModel, HTTPRoute, and more.

What's Changed

New Contributors


Configuration

📅 Schedule: Branch creation - "after 5am on wednesday" (UTC), Automerge - At any time (no schedule defined).

🚦 Automerge: Disabled by config. Please merge this manually once you are satisfied.

Rebasing: Whenever PR becomes conflicted, or you tick the rebase/retry checkbox.

🔕 Ignore: Close this PR and you won't be reminded about this update again.


  • If you want to rebase/retry this PR, check this box

To execute skipped test pipelines write comment /ok-to-test.

This PR has been generated by MintMaker (powered by Renovate Bot).

…extension to v0.4.0

Signed-off-by: konflux-internal-p02 <170854209+konflux-internal-p02[bot]@users.noreply.github.com>
@pierDipi pierDipi closed this Oct 14, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant