Skip to content

EPP upgrade/downgrade guide #693

@liu-cong

Description

@liu-cong

What would you like to be added:

We should provide guidance on upgrading/downgrading EPP versions to minimize downtime. Depending on the tradeoff between risk and speed, I imagine there can be different upgrade strategies:

  1. Simply do a rolling update of the EPP deployment. With the active-passive HA mode EPP HA deployment #692, a minimal disruption to traffic may be expected when switching the leader.
  2. Blue-green upgrade. Deploy a new EPP deployment, verify it works, and update the ExtensionRef.
  3. Create a new InferencePool (still pointing to the same model servers) with new EPP, and configure HTTPRoutes to split traffic between new and old inference pools.

Why is this needed:

Metadata

Metadata

Assignees

No one assigned

    Labels

    triage/acceptedIndicates an issue or PR is ready to be actively worked on.

    Type

    Projects

    No projects

    Milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions