-
Notifications
You must be signed in to change notification settings - Fork 774
Description
GitHub Username
Organization you are requesting membership in
kubernetes
Requirements
- I have reviewed the community membership guidelines
- I have enabled 2FA on my GitHub account
- I have subscribed to the kubernetes dev e-mail list
- I am actively contributing to 1 or more Kubernetes subprojects
- I have two sponsors that meet the sponsor requirements listed in the community membership guidelines
- I have spoken to my sponsors ahead of this application, and they have agreed to sponsor my application
- I have verified that my sponsors are from different member companies
- I have verified that my sponsors are a reviewer or an approver in at least one OWNERS file within one of the Kubernetes GitHub organizations (excluding the contributor-playground)
- SUGGESTED: Ensure your affiliation in gitdm is up to date (gitdm is used by devstats to track affiliation)
- SUGGESTED: Ensure your affiliation in openprofile.dev is up to date (used by LFX Insights to track affiliation, will replace gitdm in the future)
- SUGGESTED: I have taken the Inclusive Open Source Community Orientation course
Sponsor 1
Sponsor 2
I would like to request membership to help chair the new Checkpoint/Restore WG together with Viktoria Spišaková and Adrian Reber. Over the past few years, we have been working on enabling the Checkpoint/Restore functionality in Kubernetes, and after our KubeCon talk in London earlier this year, several people in the community reached out to express interest in contributing to our discussions and asked us to create a working group.
List of contributions to the Kubernetes project
Contributions within the Kubernetes orgs:
One of my direct contributions has been with the functionality to restore containers from a checkpoint. I have implemented the initial prototype of this functionality in Podman, and later we enabled this with CRI-O and containerd. This has been included in Adrian's KEP and blog post:
- Add blog post about forensic container checkpointing website#37412
- https://kubernetes.io/blog/2022/12/05/forensic-container-checkpointing-alpha/
- Forensic Container Checkpointing enhancements#2008
The following is a list with some of my contributions related to this topic:
- https://github.com/containerd/containerd/pulls?q=rst0git
- https://github.com/cri-o/cri-o/pulls?q=rst0git
- https://github.com/containers/podman/pulls?q=rst0git
- https://github.com/checkpoint-restore/criu/pulls?q=rst0git
- https://github.com/opencontainers/runc/pulls?q=rst0git
- https://github.com/containers/crun/pulls?q=rst0git
Conferences talks / papers related to this topic:
- Forensic Analysis of Container Checkpoints, DevConf.cz 2023
- Protecting Sensitive Data in Container Checkpoints, Linux Plumbers Conference 2023
- Enabling Coordinated Checkpointing for Distributed HPC Applications, KubeCon 2024
- End-to-End Encryption for Container Checkpointing in Kubernetes, CloudNativeSecurityCon 2024
- Towards Efficient End-to-End Encryption for Container Checkpointing Systems, Asia-Pacific Workshop on Systems 2024
- Checkpoint Coordination for Distributed Containerized Applications, Linux Plumbers Conference 2024
- Optimizing Resource Utilization for Interactive GPU Workloads with Transparent Container Checkpointing, FOSDEM 2025
- CRIUgpu: Transparent Checkpointing of GPU-Accelerated Workloads, Pre-print published on arXiv 2025
- Efficient Transparent Checkpointing of AI/ML Workloads in Kubernetes, KubeCon 2025
- Kubernetes Scheduling with Checkpoint/Restore: Challenges & Open Problems, JSSPP Workshop @ IPDPS 2025
- Transparent Hot-Swapping of Containerized AI/ML Workloads, High Performance Container Workshop @ ISC-HPC 2025
Other related projects to which I have contributed and remain actively involved include:
- Extending CRIU with plugins to enable GPU checkpointing capabilities in Kubernetes.
- checkpointctl -- a tool for forensic analysis of container checkpoints.
- checkpoint-restore-operator -- a Kubernetes operator for managing checkpoints.
- criu-coordinator -- a tool for coordinated checkpoint/restore of distributed applications
- go-criu -- Go Bindings for CRIU