-
Notifications
You must be signed in to change notification settings - Fork 39.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Flaking Test] [EventedPLEG] Containers Lifecycle should continue running liveness probes for restartable init containers and restart them while in preStop #127312
Comments
@hshiina this seem to be the same problem of #123087 static pod when EventedPLEG is enabled.)
|
As far as I saw the log, containers does not look to have been recreated. If I understand correctly, this test works like:
I'm afraid I'm not sure what is expected to guarantee the liveness probe for the sidecar container ( |
Due to #124297 which was recently merged, another issue (#124704) appeared. Pod workers sometimes get blocked for a few seconds in kubelet like #124297 (comment). This may make something like race condition surface. |
/retitle [Flaking Test] [EventedPLEG] Containers Lifecycle should continue running liveness probes for restartable init containers and restart them while in preStop |
Marking with evented PLEG. Is the issue also hapenning outside the evented PLEG? |
I don't think this happens outside the evented PLEG. If the pod worker works slowly with blocked by #124704, the init container may not get into |
/assign @hshiina This is for alpha feature and NOT release blocking /priority backlog |
It failed in |
Which jobs are flaking?
ci-crio-cgroupv1-evented-pleg
Which tests are flaking?
E2eNode Suite.[It] [sig-node] [NodeConformance] Containers Lifecycle when a pod is terminating because its liveness probe fails should continue running liveness probes for restartable init containers and restart them while in preStop [NodeConformance]
Since when has it been flaking?
8/24
https://storage.googleapis.com/k8s-triage/index.html?date=2024-09-12&job=ci-crio-cgroupv1-evented-pleg&test=%20Containers%20Lifecycle%20when%20a%20pod%20is%20terminating%20because%20its%20liveness%20probe%20fails%20should%20continue%20running%20liveness%20probes%20for%20restartable%20init%20containers%20and%20restart%20them%20while%20in%20preStop%20
Testgrid link
https://testgrid.k8s.io/sig-node-cri-o#ci-crio-cgroupv1-evented-pleg
Reason for failure (if possible)
Anything else we need to know?
No response
Relevant SIG(s)
/sig node
The text was updated successfully, but these errors were encountered: