Senior Cloud Platform Engineer at W.W. Grainger, Inc., CNCF Golden Kubestronaut, and Oracle ACE Associate. Deep expertise in cloud-native GPU/AI infrastructure, Kubernetes ecosystems, and platform engineering. Building open-source tools for GPU workload autoscaling, observability, and topology-aware incident response.
Stats updated on 2026-05-07 13:21 UTC
Kubestronaut - One of the elite professionals who have achieved all five Kubernetes certifications from the CNCF:
- KCNA (Kubernetes and Cloud Native Associate)
- CKA (Certified Kubernetes Administrator)
- CKAD (Certified Kubernetes Application Developer)
- CKS (Certified Kubernetes Security Specialist)
- KCSA (Kubernetes and Cloud Native Security Associate)
Oracle ACE Associate - Recognized by Oracle for community advocacy, technical content creation, and knowledge sharing around Oracle products and cloud technologies.
Actively contributing to CNCF (Cloud Native Computing Foundation) and ASWF (Academy Software Foundation) projects:
| Project | Description | Contributions |
|---|---|---|
| Dragonfly | P2P-based file distribution and image acceleration | client#1665 - Add Hugging Face backend support with hf:// protocol, client#1673 - Add ModelScope backend support with modelscope:// protocol, d7y.io#386 - Add hf:// protocol documentation, d7y.io#398 - Add P2P-accelerated AI model downloads blog post, helm-charts#455 - Add injector support to helm chart, helm-charts#480 - Replace deprecated bitnamilegacy/mysql with bitnami/mysql |
| Kubernetes | Production-Grade Container Orchestration | #53891 - Document deployment.kubernetes.io/* annotations, #53892 - Add kubectl apply view-last-applied documentation |
| TiKV | Distributed transactional key-value database | #19225 - Add AGENTS.md for AI agent guidance |
| Volcano | Cloud-native batch scheduling for AI/HPC | #5095 - GPU NUMA topology awareness in scheduler, apis#229 - Add GPUInfo type to NumatopoSpec CRD, resource-exporter#12 - GPU NUMA topology discovery via sysfs |
| KEDA | Kubernetes Event-driven Autoscaling | keda-docs#1658 - Removing metricName from the kedadocs, #7538 - GPU/AI inference scaler architectural analysis |
| Metalยณ | Bare metal host provisioning for Kubernetes | #624 - Fix redirect links in tryit.md |
| OpenTelemetry | Observability framework | #8632 - Add .NET troubleshooting page |
| kpt | Kubernetes-native packaging and resource management | #4278 - Fix kpt fn doc command for KRM functions expecting input |
| Project | Description | Contributions |
|---|---|---|
| OpenColorIO | Color management library | #2229 - Add release signing workflow, #2230 - Add Dependabot configuration, #2243 - Add Vulkan unit test framework |
| OpenCue | Cloud rendering management system | #2134 - Add scheduled subscription recalculation task |
| OpenImageIO | Image processing library | #4976 - Fix IBA::compare_Yee() channel access |
| RAWtoACES | RAW to ACES image conversion | #222 - Add build developer documentation |
| xSTUDIO | Playback and review application | #186 - Fix broken build guide links |
Total: 26 PRs across 15 projects in CNCF and ASWF foundations!
| Project | Description | Contributions |
|---|---|---|
| keda-gpu-scaler | KEDA External gRPC Scaler for GPU/AI workloads | |
| otel-gpu-receiver | OpenTelemetry Collector receiver for GPU metrics | NVIDIA GPU metrics via NVML, OpenTelemetry-native, Prometheus exporter, multi-GPU support |
| kube-topology-agent | K8s topology discovery & automated root-cause analysis | Knowledge graph of cluster resources, AlertManager webhook integration, GPU workload classification, blast-radius analysis |
| Golden Kubestronaut Learning | Kubernetes certification study guides and resources | #23 - Dark mode persistence, #24 - PDF generation workflow |
I'm actively developing these open source projects and welcome contributors of all skill levels!
๐ฎ KEDA GPU ScalerKEDA External gRPC Scaler for GPU/AI workloads
Tech Stack: Go, gRPC, NVIDIA NVML, Kubernetes, Helm Referenced in KEDA #7538 |
OpenTelemetry Collector receiver for GPU metrics
Tech Stack: Go, OpenTelemetry Collector SDK, NVML |
|
K8s knowledge graph & automated root-cause analysis
Tech Stack: Go, Kubernetes API, Gorilla Mux, Helm |
All contributions welcome:
More projects: KubeAI Autoscaler ยท Ingress2Gateway ยท LLMOps |
- AWS - Primary cloud platform for production workloads
- Azure - Previous experience with enterprise deployments
- Kubernetes - Production cluster management, multi-tenancy, and workload orchestration
- ArgoCD - GitOps-driven continuous delivery and application lifecycle management
- Docker - Container image building and runtime management
- Crossplane - Kubernetes-native infrastructure provisioning and composition
- Prometheus & Grafana - Metrics collection, alerting, and dashboard visualization
- Splunk - Enterprise log aggregation and security analytics
- Datadog - Full-stack monitoring and application performance management
- OpenTelemetry - Vendor-neutral distributed tracing and telemetry collection
- Kyverno - Kubernetes-native policy engine for security and compliance
- OPA (Open Policy Agent) - Unified policy enforcement across the stack
- GitHub Actions - Cloud-native workflow automation and CI/CD pipelines
- Jenkins - Enterprise CI/CD automation and pipeline orchestration
- Flux - GitOps toolkit for Kubernetes continuous delivery
- UrbanCode Deploy - Enterprise application release automation
- PrestoDB & Trino - High-performance distributed SQL query engines for analytics
- Apache Superset - Modern data exploration and business intelligence platform
- Alluxio - Unified data orchestration for compute and storage
- Jupyter Notebooks - Interactive data science and machine learning workflows
Deeply interested in the convergence of AI/ML and Kubernetes - enabling organizations to run machine learning workloads at scale on cloud-native infrastructure. Exploring MLOps practices, GPU scheduling, and AI platform engineering.
| Title | Publication | Date |
|---|---|---|
| Abstracting AI Infrastructure: Native GPU Scaling for Internal Developer Platforms | Platform Engineering | May 2026 |
Sharing insights on DevOps best practices, Kubernetes deep-dives, and cloud-native architecture:
๐ pavanmadduri.wordpress.com
Always open to connecting with fellow engineers and enthusiasts in the cloud-native and AI/ML space!
- ๐ฌ Collaborate - Open an issue or discussion on any of my repositories
- ๐ค Partner - Interested in contributing to CNCF or ASWF projects together
- ๐ Network - Happy to exchange ideas and share experiences
Let's build something great together! ๐




