Skip to content

Improve inter-replica fairness for real-time APIs #1240

Open
@deliahu

Description

@deliahu

Description

Currently requests are assigned to replicas at random. A smarter approach would be to assign based on least recently accessed (i.e. strict ordering), smallest queue size, or something similar.

Notes

Istio's destination rules might be relevant for this, e.g. something like:

apiVersion: networking.istio.io/v1beta1
kind: DestinationRule
metadata:
  name: api-iris-classifier
spec:
  host: api-iris-classifier.default.svc.cluster.local
  trafficPolicy:
    loadBalancer:
      simple: LEAST_CONN  # or ROUND_ROBIN

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions