KEP-3085: Add KEP for Pod network ready condition #3087

ddebroy · 2021-12-14T20:41:17Z

One-line PR description:
KEP to surface PodHasNetwork condition in pods to indicate when pod sandbox creation and network configuration of pod through CRI runtime is complete.

Issue link: Pod conditions around readiness to start containers after completion of pod sandbox creation #3085

Other comments:

marosset · 2022-01-04T17:13:29Z

I think this would be useful since it can be hard to determine why pods are not running if there are errors during CreatePodSandbox - kubernetes/kubernetes#104635

ehashman · 2022-01-04T18:37:22Z

/assign

jsturtevant · 2022-01-04T20:33:24Z

I think this would be useful since it can be hard to determine why pods are not running if there are errors during CreatePodSandbox - kubernetes/kubernetes#104635

some more info here: kubernetes/kubernetes#105984

fujitatomoya

it sounds reasonable and useful, a few minor nitpicks though.

keps/sig-node/3085-pod-conditions-for-starting-completition-of-sandbox-creation/kep.yaml

keps/sig-node/3085-pod-conditions-for-starting-completition-of-sandbox-creation/README.md

ddebroy · 2022-01-09T14:46:45Z

/assign @derekwaynecarr
(based on sig-node discussion)

yangjunmyfm192085 · 2022-01-14T08:05:37Z

/cc

derekwaynecarr

If we changed this condition to PodHasNetwork would that satisfy your use case?

The attributes of what is implied by a sandbox is vague, but saying the pod has an IP address is a clearer condition for future evolution.

In the stop pod case, it would map cleanly as it no longer has a network.

Thoughts?

derekwaynecarr · 2022-06-16T19:28:01Z

keps/sig-node/3085-pod-conditions-for-starting-completition-of-sandbox-creation/README.md

+operators (especially of multi-tenant clusters) who are responsible for
+configuration and operational aspects of the various components that play a role
+in pod sandbox creation: CSI plugins, CRI runtime and associated runtime
+handlers, CNI plugins, etc. The duration between `lastTransitionTime` field of


I am trying to get clarity on when this condition is set to true to call out what is or is not missed.

So when a kubelet sees a pod that must be started.

It creates the pod level cgroup.

It creates the etc-hosts file for dns configuration.

It creates the data directories on local host (emptyDir, etc.)

It waits for volumes to attach and mount for associated pod.

It fetches the pull secret associated with pod (if any) uses to pull its container image.

It requests the CRI to create the pod sandbox for the CRI.

It requests the kubelet to pull an image if not present.

It creates pod containers according to the pod spec.

It starts pod containers according to the pod spec.

If I understand the proposal, this condition would go TRUE if step 6 is satisfied.

This means any SLI/SLO held by a cluster administrator is subject to issues related to the pod author/deployer.

If a referenced secret or configmap is not available

If referenced pull secret is not available

Would you want to message those conditions differently?

derekwaynecarr · 2022-06-16T19:29:34Z

keps/sig-node/3085-pod-conditions-for-starting-completition-of-sandbox-creation/README.md

+around pod initialization to their customers who launch workloads on the
+cluster.
+
+Custom pod controllers/operators can use a dedicated condition indicating


I think the only thing you can pro-actively do is delete a pod that is not having its Sandbox go ready, but the underlying reason for why it did not go ready may vary by reason (secret does not exist, configmap does not exist, pull secret is invalid, volumes cannot be attached). Let me know if I am misunderstanding.

Deleting the pod and re-creating is absolutely necessary if a pod is not coming up - there is no avoiding that. What I was trying to allude to here is: for controllers whose custom resources specify other dependent resources (like VolumeClaimTemplates to generate a PVC from that the pod would mount), the extra condition can serve as a signal on whether to try to also delete the dependent resource (like a PVCs that the failing pod mounts) and re-create them versus just re-creating the pod that is failing to come up. If the new condition is reporting true for failed pods, pod re-creation is all that needs to be attempted. If the new condition is reporting false, there is most likely a deeper underlying problem - for example, PVs (bound to the PVCs spun up based on templates) are not attaching/mounting and the controller may want to try to recreate those as well to get the pod to successfully come up.

The new condition enables controllers to make finer distinctions on what optimal strategy to use to retry when encountering failures to bring up a pod.

derekwaynecarr · 2022-06-16T19:31:53Z

keps/sig-node/3085-pod-conditions-for-starting-completition-of-sandbox-creation/README.md

+controller can leave PVCs intact and only recreate pods if sandbox creation
+completes successfully but the pod's containers fail to become ready.
+
+When a pod's sandbox no longer exists, the `status` of `SandboxReady` condition


When a pod is stopped (it has reached a terminal phase), but is not yet deleted, the sandbox is still present and past containers logs can be accessed. In this state, is the SandboxReady true or false?

As long as the sandbox is present, the new condition should report true.

sftim · 2022-06-16T22:20:57Z

Some flavors of PodHasNetwork might need more values than just true and false. We could define the condition to be true if-and-only-if the networking is fully up, or we could use an unknown condition (:grimacing:) to cover cases such as “IPv6 is set up already but IPv4 still coming up”.

We might also want to define how to represent this condition if using a CNIs that implements NetworkPolicy. That might be better left to a separate condition, though, to avoid holding up this enhancement?

ddebroy · 2022-06-17T09:15:35Z

Some flavors of PodHasNetwork might need more values than just true and false. We could define the condition to be true if-and-only-if the networking is fully up, or we could use an unknown condition (😬) to cover cases such as “IPv6 is set up already but IPv4 still coming up”.

Setting up of the network concludes the pod "sandbox" creation process coordinated between Kubelet => CRI runtime => CNI plugins. Thus, in the context of this KEP, the new PodHasNetwork condition is essentially surfacing the state of the pod "sandbox" (whether the Kubelet => CRI runtime => CNI plugin invocation chain made it successfully) while allowing for other similar conditions in the future (like PodHasVolumes indicating all Kubelet => in-tree/csi plugin invocations have successfully configured and mounted the volumes) if needed.

If we want more granular network-related conditions (which is not within scope of the KEP), individual CNI plugins (with a API server client) can mark any necessary condition on the pod using extended conditions - for example, cni.kubernetes.io/pod-has-ipv4 and cni.kubernetes.io/pod-has-ipv6.

Note that until all CNI plugins in the configured chain do succeed, CRI runtime won't return an "in-process"/intermediate outcome to Kubelet. Either all networking configuration and IP allocation succeeded (i.e. networking is fully up for the pod) OR network configuration failed and the entire CRI sandbox creation + network configuration has to be retried.

ddebroy · 2022-06-17T09:26:45Z

@derekwaynecarr I have updated the KEP with the suggested PodHasNetwork condition. PTAL when you get a chance.

sftim · 2022-06-17T12:11:59Z

/retitle KEP-3085: Add KEP for Pod network ready condition

Hope that's OK.

keps/sig-node/3085-pod-conditions-for-starting-completition-of-sandbox-creation/kep.yaml

sftim · 2022-06-17T12:13:05Z

@ddebroy consider omitting “draft” from the PR description at this point.

keps/sig-node/3085-pod-conditions-for-starting-completition-of-sandbox-creation/README.md

Signed-off-by: Deep Debroy <ddebroy@gmail.com>

johnbelamaric · 2022-06-21T21:33:11Z

PRR looks fine, I will wait until there is SIG approval though because I am root in this repo

dchen1107 · 2022-06-22T19:30:36Z

keps/sig-node/3085-pod-conditions-for-starting-completition-of-sandbox-creation/README.md

+becomes especially relevant in multi-tenant clusters where individual tenants
+own the pod specs (including the set of init containers) while the cluster
+administrators are in charge of storage plugins, networking plugins and
+container runtime handlers.


Thanks for pointing out the mismatching phase interprets on Initialized between pods with init containers and without. But the issue isn't obvious to the users since we hide the pod sandbox creation completely in the implementation. To the end user, it is more like all init containers are successfully run, now it is the time to start the app containers. With the newly proposed PodCondition: PodHasNetwork, shouldn't the issue surface to the end users now?

With init containers: PodHasNetwork -> Initialized -> ... -> Ready
Without init containers: Initialized -> PodHasNetwork -> ... -> Ready

Can we consolidate above flows? We can move Initialized after PodHasNetwork to indicate Kubelet now can start any app containers.

But the issue isn't obvious to the users since we hide the pod sandbox creation completely in the implementation. To the end user, it is more like all init containers are successfully run, now it is the time to start the app containers. With the newly proposed PodCondition: PodHasNetwork, shouldn't the issue surface to the end users now?

The inconsistency already surfaces with the way things are today when CSI/CRI/CNI is unable to mount volumes or launch the pod sandbox upon encountering errors: without init containers, today, the pod status reports the Initialized condition to be true but Kubelet also reports error events like FailedCreatePodSandBox, FailedMount, etc against the pod (which are reported as part of kubectl describe etc)

Can we consolidate above flows? We can move Initialized after PodHasNetwork to indicate Kubelet now can start any app containers.

We can certainly consider that. I will add that to the KEP post-merge of this PR.

We had a discussion on this in sig-node on July 5th. The suggestion was to carry on with the PodHasNetwork condition as described in this KEP as the ordering of transition of pod conditions should not be important. We can separately address the situation with Initialized condition for pods without init containers (mentioned above) in a separate KEP as that feels like an independent problem.

dchen1107 · 2022-06-22T19:33:53Z

keps/sig-node/3085-pod-conditions-for-starting-completition-of-sandbox-creation/README.md

+What is out of scope for this KEP? Listing non-goals helps to focus discussion
+and make progress.
+-->
+- Modify the meaning of the existing `Initialized` condition


Please see my above comment.

dchen1107 · 2022-06-22T19:49:51Z

Add some more comments, but overall KEP is well written and lgtm. To unblock the progress for such useful feature, I will approve this to meet the deadline.

/lgtm
/approve

johnbelamaric · 2022-06-22T21:56:21Z

/approve

k8s-ci-robot · 2022-06-22T21:56:44Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: dchen1107, ddebroy, ehashman, johnbelamaric, qiutongs

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~keps/prod-readiness/OWNERS~~ [johnbelamaric]
~~keps/sig-node/OWNERS~~ [dchen1107,johnbelamaric]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

ddebroy · 2022-06-22T22:01:46Z

/unhold

wojtek-t

@ddebroy - Nice features - I just added two minor comments. If you could adjust them in the follow up PR it would be great.

wojtek-t · 2022-07-13T09:36:07Z

keps/sig-node/3085-pod-conditions-for-starting-completition-of-sandbox-creation/README.md

+automations, so be extremely careful here.
+-->
+
+No changes to any default behavior should result from enabling the feature.


I don't fully agree with this - by default each pod will get a new condition set.
So if someone is doing equality comparisons, they will start seeing changes.

wojtek-t · 2022-07-13T09:40:43Z

keps/sig-node/3085-pod-conditions-for-starting-completition-of-sandbox-creation/README.md

+[existing SLIs/SLOs]: https://git.k8s.io/community/sig-scalability/slos/slos.md#kubernetes-slisslos
+-->
+
+No


It may potentially increase pod-startup-time (which includes reporting the state) if kubelet is starved on QPS limits.

[This is fine - I'm just pointing this out for completeness...]

One thing to note is that the Kubelet Status Manager caches all status updates and queues and sends them to API server in an async fashion. So the updates to the pod conditions (existing ones as well as the new one added in this KEP) do not happen synchronously with actual pod creation and initialization activities. Given this, do you think the above may still be a concern @wojtek-t ?

Yeah - I knew that. But still - if the conditions are not changing fast enough and individual phases take a bit of time, we may need to do more API calls. And in that case, it may increase pod startup if kubelet is cpu-starved.

As I mentioned - I don't treat it as a concern or anything blocking - I just wanted it to be added for completeness.

k8s-ci-robot added cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. size/XL Denotes a PR that changes 500-999 lines, ignoring generated files. labels Dec 14, 2021

k8s-ci-robot requested review from dchen1107 and derekwaynecarr December 14, 2021 20:41

k8s-ci-robot added kind/kep Categorizes KEP tracking issues and PRs modifying the KEP directory sig/node Categorizes an issue or PR as relevant to SIG Node. labels Dec 14, 2021

ddebroy mentioned this pull request Dec 14, 2021

Pod conditions around readiness to start containers after completion of pod sandbox creation #3085

Closed

10 tasks

ddebroy force-pushed the pod-status branch from cc1aaed to c79da0c Compare December 14, 2021 20:54

ddebroy changed the title ~~KEP-3085: Initial KEP draft for pod sandbox conditions~~ KEP-3085: Initial KEP draft for pod sandbox conditions [WIP] Dec 14, 2021

ddebroy force-pushed the pod-status branch from c79da0c to 77ffe48 Compare January 4, 2022 00:37

k8s-ci-robot added size/XXL Denotes a PR that changes 1000+ lines, ignoring generated files. and removed size/XL Denotes a PR that changes 500-999 lines, ignoring generated files. labels Jan 4, 2022

ddebroy changed the title ~~KEP-3085: Initial KEP draft for pod sandbox conditions [WIP]~~ KEP-3085: Initial KEP draft for pod sandbox conditions Jan 4, 2022

ddebroy changed the title ~~KEP-3085: Initial KEP draft for pod sandbox conditions~~ KEP-3085: KEP draft for pod sandbox conditions Jan 4, 2022

ddebroy force-pushed the pod-status branch from 77ffe48 to 654a71c Compare January 4, 2022 01:28

k8s-ci-robot assigned ehashman Jan 4, 2022

ddebroy changed the title ~~KEP-3085: KEP draft for pod sandbox conditions~~ KEP-3085: KEP for pod sandbox conditions Jan 5, 2022

ddebroy force-pushed the pod-status branch 4 times, most recently from ff5fc47 to cd53e91 Compare January 5, 2022 22:10

fujitatomoya reviewed Jan 7, 2022

View reviewed changes

ddebroy force-pushed the pod-status branch from cd53e91 to 682e7c6 Compare January 9, 2022 14:43

k8s-ci-robot assigned derekwaynecarr Jan 9, 2022

ddebroy force-pushed the pod-status branch from 682e7c6 to 8972155 Compare January 10, 2022 22:34

derekwaynecarr requested changes Jun 16, 2022

View reviewed changes

ddebroy force-pushed the pod-status branch from f6b6332 to 21f5318 Compare June 17, 2022 08:58

ddebroy force-pushed the pod-status branch from 21f5318 to 8758c24 Compare June 17, 2022 09:20

k8s-ci-robot changed the title ~~KEP-3085: KEP for pod sandbox ready condition~~ KEP-3085: Add KEP for Pod network ready condition Jun 17, 2022

sftim reviewed Jun 17, 2022

View reviewed changes

keps/sig-node/3085-pod-conditions-for-starting-completition-of-sandbox-creation/kep.yaml Outdated Show resolved Hide resolved

ddebroy force-pushed the pod-status branch from 8758c24 to f88b820 Compare June 17, 2022 14:47

johnbelamaric reviewed Jun 20, 2022

View reviewed changes

keps/sig-node/3085-pod-conditions-for-starting-completition-of-sandbox-creation/README.md Outdated Show resolved Hide resolved

keps/sig-node/3085-pod-conditions-for-starting-completition-of-sandbox-creation/README.md Outdated Show resolved Hide resolved

KEP for PodHasNetwork condition

52a91cf

Signed-off-by: Deep Debroy <ddebroy@gmail.com>

ddebroy force-pushed the pod-status branch from f88b820 to 52a91cf Compare June 21, 2022 16:25

dchen1107 reviewed Jun 22, 2022

View reviewed changes

k8s-ci-robot assigned dchen1107 Jun 22, 2022

k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Jun 22, 2022

k8s-ci-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Jun 22, 2022

k8s-ci-robot removed the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Jun 22, 2022

k8s-ci-robot merged commit 201eb5b into kubernetes:master Jun 22, 2022

wojtek-t reviewed Jul 13, 2022

View reviewed changes

ddebroy mentioned this pull request Jul 23, 2022

Introduce PodHasNetwork condition for pods kubernetes/kubernetes#111358

Merged

BenTheElder mentioned this pull request Sep 9, 2022

WIP: DNM: Hack on kep 3085 #3501

Closed

kannon92 mentioned this pull request Jul 26, 2023

[KEP-3085] Add condition for sandbox creation (xposted from original issue) #4138

Open

20 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

KEP-3085: Add KEP for Pod network ready condition #3087

KEP-3085: Add KEP for Pod network ready condition #3087

ddebroy commented Dec 14, 2021 •

edited

Loading

marosset commented Jan 4, 2022

ehashman commented Jan 4, 2022

jsturtevant commented Jan 4, 2022

fujitatomoya left a comment

ddebroy commented Jan 9, 2022

yangjunmyfm192085 commented Jan 14, 2022

derekwaynecarr left a comment •

edited

Loading

derekwaynecarr Jun 16, 2022 •

edited

Loading

derekwaynecarr Jun 16, 2022

ddebroy Jun 16, 2022 •

edited

Loading

derekwaynecarr Jun 16, 2022

ddebroy Jun 17, 2022

sftim commented Jun 16, 2022

ddebroy commented Jun 17, 2022 •

edited

Loading

ddebroy commented Jun 17, 2022

sftim commented Jun 17, 2022

sftim commented Jun 17, 2022

johnbelamaric commented Jun 21, 2022

dchen1107 Jun 22, 2022

ddebroy Jun 22, 2022

ddebroy Jul 5, 2022

dchen1107 Jun 22, 2022

dchen1107 commented Jun 22, 2022

johnbelamaric commented Jun 22, 2022

k8s-ci-robot commented Jun 22, 2022

ddebroy commented Jun 22, 2022

wojtek-t left a comment

wojtek-t Jul 13, 2022

wojtek-t Jul 13, 2022

ddebroy Jul 13, 2022

wojtek-t Jul 14, 2022

KEP-3085: Add KEP for Pod network ready condition #3087

KEP-3085: Add KEP for Pod network ready condition #3087

Conversation

ddebroy commented Dec 14, 2021 • edited Loading

marosset commented Jan 4, 2022

ehashman commented Jan 4, 2022

jsturtevant commented Jan 4, 2022

fujitatomoya left a comment

Choose a reason for hiding this comment

ddebroy commented Jan 9, 2022

yangjunmyfm192085 commented Jan 14, 2022

derekwaynecarr left a comment • edited Loading

Choose a reason for hiding this comment

derekwaynecarr Jun 16, 2022 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ddebroy Jun 16, 2022 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

sftim commented Jun 16, 2022

ddebroy commented Jun 17, 2022 • edited Loading

ddebroy commented Jun 17, 2022

sftim commented Jun 17, 2022

sftim commented Jun 17, 2022

johnbelamaric commented Jun 21, 2022

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

dchen1107 commented Jun 22, 2022

johnbelamaric commented Jun 22, 2022

k8s-ci-robot commented Jun 22, 2022

ddebroy commented Jun 22, 2022

wojtek-t left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ddebroy commented Dec 14, 2021 •

edited

Loading

derekwaynecarr left a comment •

edited

Loading

derekwaynecarr Jun 16, 2022 •

edited

Loading

ddebroy Jun 16, 2022 •

edited

Loading

ddebroy commented Jun 17, 2022 •

edited

Loading