feat: startup and readiness probes for replicas by leonardoce · Pull Request #6623 · cloudnative-pg/cloudnative-pg

leonardoce · 2025-01-18T11:50:10Z

Extend the startup and readiness probes configured through the .spec.probes.startup and .spec.probes.readiness sections by adding two additional parameters:

type: Defines the criteria for considering the probe successful. Accepted values include:
- pg_isready: This setting marks the probe as successful when the pg_isready command exits with a status of 0. This is the default for both primary instances and replicas.
- query: This setting marks the probe as successful when a basic query is executed locally on the postgres database.
- streaming: This setting marks the probe successful when the replica starts streaming from its source and meets the specified lag requirements (details below).
lag: Specifies the maximum acceptable replication lag, measured in bytes (expressed using Kubernetes quantities). This parameter is applicable only when type is set to streaming. If the lag parameter is not specified, the replica is considered successfully started/ready as soon as it begins streaming.

Consequently, the liveness probe has been streamlined to verify solely that the instance manager is operational, without monitoring the underlying PostgreSQL instance.

Closes: #6621

Release Notes

Improved Startup and Readiness Probes for Replicas: Enhanced support for Kubernetes startup and readiness probes in PostgreSQL instances, providing greater control over replicas based on the streaming lag.

github-actions · 2025-01-18T11:50:21Z

❗ By default, the pull request is configured to backport to all release branches.

To stop backporting this pr, remove the label: backport-requested ◀️ or add the label 'do not backport'
To stop backporting this pr to a certain release branch, remove the specific branch label: release-x.y

pkg/management/postgres/webserver/remote.go

ardentperf · 2025-01-22T04:43:11Z

IIUC, when replicas re-join they need to catch-up WAL - which has two effects:

clients can connect as soon as the readiness probe succeeds and during catch-up their queries may get results that lag more than usual.
the impetus for this PR: with preferred DataDurability (new feature in 1.25) all writes on the primary will completely hang during catch-up. the primary needs to write WAL synchronously and the latest WAL won't get acknowledged by the replica until it's caught up, leading to the hang on the primary.

+1 using a startup probe seems like a good idea

if someone is choosing preferred dataDurability then i don't think they would ever want the primary to hang. as a user, my hope is that replicas (re)joining a cluster should be as seamless as possible. a primary database hanging even briefly can be impactful on a large workload. I’m not sure we’d ever want a replica to be considered ready as soon as it starts streaming, when dataDurability is preferred. i suspect that users who choose preferred are specifically doing it because they want high availability.

my initial thought is that this should default to a small number of bytes and might not need to be configurable at all, unless as a setting for debugging or troubleshooting.

this would result in a slightly longer delay for replicas becoming ready even with dataDurability=required however it also means we eliminate the period on startup of higher-than-usual lag, which doesn't seem like a bad idea to me.

note: what i've written above is based on my understanding but i have not had time to test or verify it, so there might be mistakes. i haven't looked yet, but it could also be interesting to check how patroni approaches this.

gbartolini · 2025-01-22T06:41:01Z

IIUC, when replicas re-join they need to catch-up WAL - which has two effects:

clients can connect as soon as the readiness probe succeeds and during catch-up their queries may get results that lag more than usual.

the impetus for this PR: with preferred DataDurability (new feature in 1.25) all writes on the primary will completely hang during catch-up. the primary needs to write WAL synchronously and the latest WAL won't get acknowledged by the replica until it's caught up, leading to the hang on the primary.

That's correct.

leonardoce · 2025-01-23T16:40:05Z

e2e: https://github.com/EnterpriseDB/cloudnative-pg/actions/runs/12933848954

ardentperf · 2025-01-24T01:03:09Z

Is this PR in patroni solving a similar problem?

patroni/patroni#1786

ardentperf · 2025-01-24T01:15:06Z

interesting - from this recent pgcon talk it sounds like actually patroni might even still have the pause

https://youtu.be/CWrFPPG5USA?feature=shared&t=1190

i think in the talk he's focused on a 3rd node rejoining synchronous_standby_names with only 2 nodes required (so it doesn't cause the hang) - so i don't know for sure if there's any "special case" treatment for a synchronous cluster with only 2 nodes.

leonardoce · 2025-01-24T07:51:05Z

interesting - from this recent pgcon talk it sounds like actually patroni might even still have the pause [...]

Yes. I think there is no solution this problem but only compromises.
Having the cluster paused reduces the time needed for the replica to get in sync.
If the cluster is not paused the replica will need more time.
If may not even be able to get in sync again - depending on the workload of the primary.

Allow the user to configure the behavior of the startup probe, to be choosen from the following list: 1. pg_isready 2. streaming, with optional lag limit Closes: cloudnative-pg#6621 Signed-off-by: Leonardo Cecchi <leonardo.cecchi@enterprisedb.com>

Signed-off-by: Gabriele Bartolini <gabriele.bartolini@enterprisedb.com>

Signed-off-by: Leonardo Cecchi <leonardo.cecchi@enterprisedb.com>

Signed-off-by: Leonardo Cecchi <leonardo.cecchi@gmail.com>

Signed-off-by: Leonardo Cecchi <leonardo.cecchi@enterprisedb.com>

Signed-off-by: Gabriele Bartolini <gabriele.bartolini@enterprisedb.com>

Signed-off-by: Leonardo Cecchi <leonardo.cecchi@enterprisedb.com>

Signed-off-by: Armando Ruocco <armando.ruocco@enterprisedb.com>

Signed-off-by: Niccolò Fei <niccolo.fei@enterprisedb.com>

Signed-off-by: Marco Nenciarini <marco.nenciarini@enterprisedb.com>

mnencia · 2025-03-10T11:32:52Z

/test d=main tl=4

github-actions · 2025-03-10T11:33:07Z

@mnencia, here's the link to the E2E on CNPG workflow run: https://github.com/cloudnative-pg/cloudnative-pg/actions/runs/13763633508

Signed-off-by: Gabriele Bartolini <gabriele.bartolini@enterprisedb.com>

When upgrading to 1.26 from previous version, PostgreSQL clusters are restarted (even with in-place update enabled) due to change into the Startup probe definition. This issue appears to be a side effect of the improvements made to the startup probe: cloudnative-pg#6623

When upgrading to 1.26 from previous version, PostgreSQL clusters are restarted (even with in-place update enabled) due to change into the Startup probe definition. This issue appears to be a side effect of the improvements made to the startup probe: cloudnative-pg#6623 Signed-off-by: Julian Vanden Broeck <julian.vandenbroeck@dalibo.com>

…8018) When upgrading from a previous version to 1.26, PostgreSQL clusters will be restarted even with in-place updates enabled, due to changes in the Startup probe definition (PR #6623). Closes #7727 Signed-off-by: Julian Vanden Broeck <julian.vandenbroeck@dalibo.com> Signed-off-by: Gabriele Bartolini <gabriele.bartolini@enterprisedb.com> Co-authored-by: Julian Vanden Broeck <julian.vandenbroeck@dalibo.com> Co-authored-by: Gabriele Bartolini <gabriele.bartolini@enterprisedb.com>

…8018) When upgrading from a previous version to 1.26, PostgreSQL clusters will be restarted even with in-place updates enabled, due to changes in the Startup probe definition (PR #6623). Closes #7727 Signed-off-by: Julian Vanden Broeck <julian.vandenbroeck@dalibo.com> Signed-off-by: Gabriele Bartolini <gabriele.bartolini@enterprisedb.com> Co-authored-by: Julian Vanden Broeck <julian.vandenbroeck@dalibo.com> Co-authored-by: Gabriele Bartolini <gabriele.bartolini@enterprisedb.com> (cherry picked from commit 48ddea1)

cnpg-bot added backport-requested ◀️ This pull request should be backported to all supported releases release-1.22 release-1.24 release-1.25 labels Jan 18, 2025

github-advanced-security bot found potential problems Jan 18, 2025

View reviewed changes

pkg/management/postgres/webserver/remote.go Fixed Show fixed Hide fixed

pkg/management/postgres/webserver/remote.go Fixed Show fixed Hide fixed

pkg/management/postgres/webserver/remote.go Fixed Show fixed Hide fixed

leonardoce force-pushed the readiness branch 5 times, most recently from 3df3ab1 to e62c4be Compare January 18, 2025 13:12

gbartolini force-pushed the readiness branch from 944b0a1 to 6387de2 Compare January 19, 2025 08:44

leonardoce force-pushed the readiness branch from 6387de2 to c1fbe65 Compare January 19, 2025 08:54

gbartolini force-pushed the readiness branch from a0b4d01 to 4f509a9 Compare January 19, 2025 13:58

leonardoce force-pushed the readiness branch 3 times, most recently from ba89383 to c756b27 Compare January 19, 2025 17:58

gbartolini added do not backport This PR must not be backported - it will be in the next minor release and removed backport-requested ◀️ This pull request should be backported to all supported releases release-1.22 release-1.24 release-1.25 labels Jan 21, 2025

gbartolini force-pushed the readiness branch from 26b7094 to db42151 Compare January 21, 2025 12:53

gbartolini changed the title ~~feat: startup probe configuration~~ feat: startup probe for replicas Jan 21, 2025

leonardoce and others added 13 commits March 10, 2025 11:53

feat: advanced startup probe

f5072f6

Allow the user to configure the behavior of the startup probe, to be choosen from the following list: 1. pg_isready 2. streaming, with optional lag limit Closes: cloudnative-pg#6621 Signed-off-by: Leonardo Cecchi <leonardo.cecchi@enterprisedb.com>

docs: first draft

15073e3

Signed-off-by: Gabriele Bartolini <gabriele.bartolini@enterprisedb.com>

wip: e2e test

4b49258

Signed-off-by: Leonardo Cecchi <leonardo.cecchi@enterprisedb.com>

test: disable data corruption test

5fcefed

Signed-off-by: Leonardo Cecchi <leonardo.cecchi@gmail.com>

wip: fix e2e tests

7b3723d

Signed-off-by: Leonardo Cecchi <leonardo.cecchi@gmail.com>

wip: improve API

fdc1a99

Signed-off-by: Leonardo Cecchi <leonardo.cecchi@enterprisedb.com>

docs: last changes

fad08a0

Signed-off-by: Gabriele Bartolini <gabriele.bartolini@enterprisedb.com>

chore: apply jmealo's suggestion

fa51ae6

Signed-off-by: Leonardo Cecchi <leonardo.cecchi@enterprisedb.com>

chore: review

35f7a6d

Signed-off-by: Armando Ruocco <armando.ruocco@enterprisedb.com>

refactor: encapsulate probe logic

fcaf836

Signed-off-by: Armando Ruocco <armando.ruocco@enterprisedb.com>

chore: more robust interface api

d60729f

Signed-off-by: Armando Ruocco <armando.ruocco@enterprisedb.com>

docs: fix leftovers of old API

62ae61c

Signed-off-by: Niccolò Fei <niccolo.fei@enterprisedb.com>

test: add E2Es for readiness probe lag control

4085997

Signed-off-by: Niccolò Fei <niccolo.fei@enterprisedb.com>

mnencia force-pushed the readiness branch from 33b390c to 4085997 Compare March 10, 2025 10:54

fix: rebase

626794c

Signed-off-by: Marco Nenciarini <marco.nenciarini@enterprisedb.com>

mnencia approved these changes Mar 10, 2025

View reviewed changes

docs: release notes

f569a05

Signed-off-by: Gabriele Bartolini <gabriele.bartolini@enterprisedb.com>

gbartolini merged commit 3bafb17 into cloudnative-pg:main Mar 10, 2025
23 checks passed

mmoscher mentioned this pull request Jun 26, 2025

[Bug]: occasional restart of postgres pods (Encountered an error while executing get cluster) #7732

Closed

4 tasks

l00ptr mentioned this pull request Jul 10, 2025

[Docs]: No information on triggering rollouts of db clusters with operator upgrade (with enabled in-place update) #7727

Closed

2 tasks

l00ptr mentioned this pull request Jul 11, 2025

docs(upgrades): warn about Postgres restarts when upgrading to 1.26 #8018

Merged

ardentperf mentioned this pull request Sep 8, 2025

merge upstream geico/cloudnative-pg#2

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: startup and readiness probes for replicas#6623

feat: startup and readiness probes for replicas#6623
gbartolini merged 15 commits intocloudnative-pg:mainfrom
leonardoce:readiness

leonardoce commented Jan 18, 2025 •

edited by gbartolini

Loading

Uh oh!

github-actions bot commented Jan 18, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ardentperf commented Jan 22, 2025 •

edited

Loading

Uh oh!

gbartolini commented Jan 22, 2025

Uh oh!

leonardoce commented Jan 23, 2025

Uh oh!

ardentperf commented Jan 24, 2025

Uh oh!

ardentperf commented Jan 24, 2025 •

edited

Loading

Uh oh!

leonardoce commented Jan 24, 2025

Uh oh!

mnencia commented Mar 10, 2025

Uh oh!

github-actions bot commented Mar 10, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

8 participants

Conversation

leonardoce commented Jan 18, 2025 • edited by gbartolini Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Release Notes

Uh oh!

github-actions bot commented Jan 18, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ardentperf commented Jan 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gbartolini commented Jan 22, 2025

Uh oh!

leonardoce commented Jan 23, 2025

Uh oh!

ardentperf commented Jan 24, 2025

Uh oh!

ardentperf commented Jan 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

leonardoce commented Jan 24, 2025

Uh oh!

mnencia commented Mar 10, 2025

Uh oh!

github-actions bot commented Mar 10, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

8 participants

leonardoce commented Jan 18, 2025 •

edited by gbartolini

Loading

ardentperf commented Jan 22, 2025 •

edited

Loading

ardentperf commented Jan 24, 2025 •

edited

Loading