generated from kubernetes/kubernetes-template-project
-
Notifications
You must be signed in to change notification settings - Fork 180
Open
Labels
triage/acceptedIndicates an issue or PR is ready to be actively worked on.Indicates an issue or PR is ready to be actively worked on.
Milestone
Description
What would you like to be added:
We should provide guidance on upgrading/downgrading EPP versions to minimize downtime. Depending on the tradeoff between risk and speed, I imagine there can be different upgrade strategies:
- Simply do a rolling update of the EPP deployment. With the active-passive HA mode EPP HA deployment #692, a minimal disruption to traffic may be expected when switching the leader.
- Blue-green upgrade. Deploy a new EPP deployment, verify it works, and update the ExtensionRef.
- Create a new InferencePool (still pointing to the same model servers) with new EPP, and configure HTTPRoutes to split traffic between new and old inference pools.
Why is this needed:
Metadata
Metadata
Assignees
Labels
triage/acceptedIndicates an issue or PR is ready to be actively worked on.Indicates an issue or PR is ready to be actively worked on.