Add readinessProbe/minReadySeconds to kube-router #4420

twz123 · 2024-05-15T17:07:22Z

Description

This allows for better feedback of kube-router health via the DaemonSet resource. Without those, it's possible to observe a "healthy" DaemonSet, even if it's not. This affects e.g. rolling updates, and, most notably k0s's own integration tests.

See:

Type of change

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
Documentation update

How Has This Been Tested?

Manual test
Auto test added

Checklist:

This allows for better feedback of kube-router health via the DaemonSet resource. Without those, it's possible to observe a "healthy" DaemonSet, even if it's not. This affects e.g. rolling updates, and, most notably k0s's own integration tests. Signed-off-by: Tom Wieczorek <twieczorek@mirantis.com>

jnummelin · 2024-05-16T10:35:29Z

pkg/component/controller/kuberouter.go

-            port: 20244
-          initialDelaySeconds: 10
+            port: healthz
+          initialDelaySeconds: 300


Seems bit excessive? What's the reasoning for such a long delay?

In my experience, liveness probes are only helpful in very few cases. Forcefully restarting a container over and over again is usually not helping much and will just increase churn/load on a cluster that is probably already busy with other things that lead to healthz answering with non-2xx responses. That's why I prefer high timeouts here. Henning wrote a blog post about this back in the day.

The basic difference here is that an app usually reports failures that it might be able to recover from by itself via the readiness endpoint. Restarting the app won't help and might even worse the situation in such a case. Unrecoverable errors should make an app terminate itself. This leaves the liveness probe to detect situations in which the app itself is broken due to things like deadlocks, tight endless loops, blocked on system calls that usually don't block for too long.

k0s-bot · 2024-05-22T18:57:21Z

Successfully created backport PR for release-1.30:

[Backport release-1.30] Add readinessProbe/minReadySeconds to kube-router #4471

twz123 added bug Something isn't working component/kube-router labels May 15, 2024

twz123 mentioned this pull request May 15, 2024

Change kine metrics port from 8080 to 2380 #4421

Merged

16 tasks

twz123 added the backport/release-1.30 PR that needs to be backported/cherrypicked to the release-1.30 branch label May 15, 2024

twz123 mentioned this pull request May 15, 2024

kube-router failed to start when installation with default settings #4411

Closed

4 tasks

k0s-bot mentioned this pull request May 16, 2024

[Backport release-1.30] Change kine metrics port from 8080 to 2380 #4423

Merged

twz123 force-pushed the kube-router-readinessprobe branch from 3180ab6 to 65fcf39 Compare May 16, 2024 08:56

twz123 marked this pull request as ready for review May 16, 2024 10:31

twz123 requested a review from a team as a code owner May 16, 2024 10:31

twz123 requested review from makhov and jnummelin May 16, 2024 10:31

jnummelin reviewed May 16, 2024

View reviewed changes

twz123 added backport/release-1.30 PR that needs to be backported/cherrypicked to the release-1.30 branch and removed backport/release-1.30 PR that needs to be backported/cherrypicked to the release-1.30 branch labels May 16, 2024

jnummelin approved these changes May 17, 2024

View reviewed changes

twz123 merged commit 3cc1ed9 into k0sproject:main May 22, 2024
77 checks passed

twz123 deleted the kube-router-readinessprobe branch May 22, 2024 18:56

k0s-bot mentioned this pull request May 22, 2024

[Backport release-1.30] Add readinessProbe/minReadySeconds to kube-router #4471

Merged

twz123 mentioned this pull request Sep 11, 2024

Remove minReadySeconds from kube-router DaemonSet #4957

Merged

16 tasks

k0s-bot mentioned this pull request Sep 16, 2024

[Backport release-1.30] Remove minReadySeconds from kube-router DaemonSet #4977

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add readinessProbe/minReadySeconds to kube-router #4420

Add readinessProbe/minReadySeconds to kube-router #4420

twz123 commented May 15, 2024 •

edited

Loading

jnummelin May 16, 2024

twz123 May 16, 2024

twz123 May 16, 2024

jnummelin May 17, 2024

k0s-bot commented May 22, 2024

Add readinessProbe/minReadySeconds to kube-router #4420

Add readinessProbe/minReadySeconds to kube-router #4420

Conversation

twz123 commented May 15, 2024 • edited Loading

Description

Type of change

How Has This Been Tested?

Checklist:

jnummelin May 16, 2024

Choose a reason for hiding this comment

twz123 May 16, 2024

Choose a reason for hiding this comment

twz123 May 16, 2024

Choose a reason for hiding this comment

jnummelin May 17, 2024

Choose a reason for hiding this comment

k0s-bot commented May 22, 2024

twz123 commented May 15, 2024 •

edited

Loading