chore(deps): update dependency kubernetes-sigs/gateway-api-inference-extension to v0.4.0 #30
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
This PR contains the following updates:
v0.3.0
->v0.4.0
Release Notes
kubernetes-sigs/gateway-api-inference-extension (kubernetes-sigs/gateway-api-inference-extension)
v0.4.0
Compare Source
Overview
We are thrilled to announce the v0.4.0 release—our biggest update yet! This version brings powerful new Endpoint Picker (EPP) scheduler capabilities, performance improvements, and initial Gateway conformance tests.
Major Highlights
Modular Endpoint Picker (EPP) Scheduler: A kube-scheduler–style plugin API lets you build custom routing logic,
filter and score backends, or swap in new picker strategies without touching core code.
Prefix-Cache-Aware Routing: Dramatically lower tail latency by routing requests based on cached network prefixes,
improving response times under load.
Richer Metrics: Gain deeper insights with new metrics including:
Optional vLLM Simulator Backend: Spin up a lightweight simulator for local development and testing—no real model
servers required.
Initial Conformance Tests: Validate your controller’s behavior with end-to-end tests covering InferencePool,
InferenceModel, HTTPRoute, and more.
What's Changed
002-api-proposal/
to reflectapi/v1alpha2
inferencePool and InferenceModel by @shotarok in https://github.com/kubernetes-sigs/gateway-api-inference-extension/pull/870New Contributors
Configuration
📅 Schedule: Branch creation - "after 5am on wednesday" (UTC), Automerge - At any time (no schedule defined).
🚦 Automerge: Disabled by config. Please merge this manually once you are satisfied.
♻ Rebasing: Whenever PR becomes conflicted, or you tick the rebase/retry checkbox.
🔕 Ignore: Close this PR and you won't be reminded about this update again.
To execute skipped test pipelines write comment
/ok-to-test
.This PR has been generated by MintMaker (powered by Renovate Bot).