Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[meta] Pending image downloads of deleted containers are never interrupted, leading to possible DoS #3088

Open
rgaiacs opened this issue Sep 4, 2024 · 0 comments
Assignees

Comments

@rgaiacs
Copy link
Collaborator

rgaiacs commented Sep 4, 2024

Related to #3056

The behaviour observed with GESIS server of many pending pods like in

Screenshot 2024-09-04 at 13-52-52 Pod Activity - Starred - Grafana

that is also observed with OVH server like in

Screenshot 2024-09-04 at 13-54-01 Pod Activity - Starred - Grafana

matches the description of issue reported upstream kubernetes/kubernetes#122905:

When an user deploys a pod, the deployed container(s) may rely on image(s) that are not present on the target node, triggering a download by the container runtime. However, if that pod is deleted (either with or without the --force flag) before the container runtime completes the download of the image, this download will be carried on anyway. Kubernetes tracks no information at all about the download, and there is no way of interrupting it unless the container runtime is forcibly terminated and recreated.

This (useless, as the images will never be used) sequence of downloads will waste the node’s CPU and network bandwidth and block the download queue, preventing the deployment of other containers with images yet to be downloaded. We experimented on a wide variety of settings, including but not limited to private clouds, public clouds and edge devices, with different container runtimes (containerd and CRI-O) and different Kubernetes providers (kubeadm, microk8s, and GKE).

@sgibson91 @manics @minrk @yuvipanda @arnim what do you think of us to drop the timeout that JupyterHub or BinderHub has and the automatic relaunch? This should reduce the number of pending pods.

@rgaiacs rgaiacs self-assigned this Sep 4, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant