choose switchover candidate based on lag and role #1700

FxKu · 2021-11-23T14:12:06Z

Change 1: When attempting a switchover, try it only on a replica that has no replication lag. In case synchronous replication is enabled there will be a replica with sync_standby role which should be chosen instead for switchover.

Change 2: In case the switchover is not executed or failing, return an error instead of emitting a Warning. Before we were always continuing by replacing the master pod anyway - producing downtime. Now, the master keeps running on an outdated pod. But, the Pod will have the rolling-update flag and the operator will notice it on the next sync and try another failover. It can be, that the failover just took longer, e.g. because taking a CHECKPOINT took longer than Patroni API waits to respond to the user. In this ideal case only a replica would be recreated. In worst case, a switchover might be retried on each sync. But this should be fixed by the admin anyway.

fixes #1686
fixes #109
fixes #600

pkg/cluster/pod.go

Jan-M · 2021-12-02T13:37:28Z

Sounds good, lets make it the one with least lag, not with 0 lag.

sdudoladov · 2021-12-10T16:07:50Z

pkg/cluster/pod.go

+		return spec.NamespacedName{Namespace: master.Namespace, Name: candidates[0].Name}, nil
+	}
+
+	return spec.NamespacedName{}, fmt.Errorf("no switchover candidate found")


why does it return the empty spec.NamespacedName{} and not nil in case of error ?

Go says: "cannot use nil (untyped nil value) as spec.NamespacedName value in return statement" 😃

sdudoladov · 2021-12-10T16:15:42Z

pkg/cluster/types.go

+	Replica       PostgresRole = "replica"
+	Leader        PostgresRole = "leader"
+	StandbyLeader PostgresRole = "standby_leader"
+	SyncStandby   PostgresRole = "sync_standby"


I feel there is a comment missing here:

how does operator treat differently Master, StandbyLeader on the one hand and Leader on the other ? Both Master and StandbyLeader are leaders.

SyncStanbdy is still a Replica

While it is still possible to figure it out from the code, a short comment about why these are not completely independent states is worthy.

master and replica are Spilo roles. The other ones role names returned by Patroni's cluster endpoint. Added comments

sdudoladov · 2021-12-10T16:21:57Z

pkg/util/patroni/patroni.go

+	Role     string `json:"role"`
+	State    string `json:"state"`
+	Timeline int    `json:"timeline"`
+	LagInMB  int    `json:"lag"`


Does Patroni always return lag in MB and not simply in bytes ?
for example, the patronictl converts bytes to MB

renamed to just Lag

hm maybe lag is reserved for something else:

cannot unmarshal string into Go struct field ClusterMember.members.lag of type int

renamed it back, since I think it's always in MB.

FxKu · 2021-12-13T11:15:28Z

👍

Jan-M · 2021-12-14T09:33:41Z

👍

choose switchover candidate based on lag and role

92b2e9d

FxKu requested review from CyberDem0n, Jan-M, RafiaSabih, erthalion and sdudoladov as code owners November 23, 2021 14:12

FxKu added this to the 1.8 milestone Nov 23, 2021

FxKu commented Nov 24, 2021

View reviewed changes

pkg/cluster/pod.go Outdated Show resolved Hide resolved

Update pkg/cluster/pod.go

87d50d2

FxKu added the zalando label Nov 25, 2021

FxKu added 4 commits December 2, 2021 14:44

Merge branch 'master' into switchover-return-err

d784c96

choose switchover candidate based on lowest lag in MB

4f87238

resolve conflict

b47b826

add comments to better explain switchover logic

3418be5

sdudoladov reviewed Dec 10, 2021

View reviewed changes

FxKu added 3 commits December 10, 2021 17:33

reflect review

eede7cf

Merge branch 'master' into switchover-return-err

1b0d51a

rename back to LagInMB

f29c2f8

FxKu merged commit 07fd4ec into master Dec 14, 2021

This was referenced Feb 1, 2022

DB pod in pending state --> Eventually all 3 DB pods are in pending states ( down ). "Rolling upgrade" #1765

Closed

do not recreate pods if previous Patroni API calls fail #1767

Merged

gdemonet mentioned this pull request Aug 2, 2022

Switchover (during a Node drain) fails randomly in synchronous mode #1983

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

choose switchover candidate based on lag and role #1700

choose switchover candidate based on lag and role #1700

Uh oh!

FxKu commented Nov 23, 2021 •

edited

Loading

Uh oh!

Uh oh!

Jan-M commented Dec 2, 2021

Uh oh!

sdudoladov Dec 10, 2021

Uh oh!

FxKu Dec 10, 2021

Uh oh!

sdudoladov Dec 10, 2021

Uh oh!

FxKu Dec 10, 2021

Uh oh!

sdudoladov Dec 10, 2021

Uh oh!

FxKu Dec 10, 2021

Uh oh!

FxKu Dec 10, 2021 •

edited

Loading

Uh oh!

FxKu commented Dec 13, 2021

Uh oh!

Jan-M commented Dec 14, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

choose switchover candidate based on lag and role #1700

choose switchover candidate based on lag and role #1700

Uh oh!

Conversation

FxKu commented Nov 23, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Jan-M commented Dec 2, 2021

Uh oh!

sdudoladov Dec 10, 2021

Choose a reason for hiding this comment

Uh oh!

FxKu Dec 10, 2021

Choose a reason for hiding this comment

Uh oh!

sdudoladov Dec 10, 2021

Choose a reason for hiding this comment

Uh oh!

FxKu Dec 10, 2021

Choose a reason for hiding this comment

Uh oh!

sdudoladov Dec 10, 2021

Choose a reason for hiding this comment

Uh oh!

FxKu Dec 10, 2021

Choose a reason for hiding this comment

Uh oh!

FxKu Dec 10, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

FxKu commented Dec 13, 2021

Uh oh!

Jan-M commented Dec 14, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

FxKu commented Nov 23, 2021 •

edited

Loading

FxKu Dec 10, 2021 •

edited

Loading