I'm a DevOps & Cloud Engineer who builds production-grade, event-driven infrastructure on AWS. I focus on automation, observability, and cost efficiency — turning manual ops work into self-healing systems.
- Built an AI-powered SRE Incident Analysis System using AWS Step Functions, Lambda, and Amazon Bedrock that auto-detects and analyzes infrastructure incidents with zero human intervention
- Engineered an EKS-based GPU autoscaler using Karpenter to dynamically provision and de-provision nodes for LLM inference workloads
- Engineered a FinOps automation tool that identifies and eliminates unused cloud resources on a scheduled basis
- Passionate about CI/CD pipelines, IaC, and building systems that scale without breaking
Check out my cloud resume at ericchiu.page
Cloud & Infrastructure
CI/CD & Automation
AI & Observability
Languages & Config
| Project | Description | Stack |
|---|---|---|
| AI-SRE Incident Analysis System | Event-driven serverless pipeline that auto-detects infrastructure issues and generates AI-powered root-cause analysis using Amazon Bedrock | Step Functions · Lambda · Bedrock · DynamoDB · Terraform |
| Aura — EKS GPU Autoscaler | Cloud infrastructure automation for AI workloads on ephemeral EKS clusters with Karpenter-driven GPU autoscaling for LLM inference | EKS · Karpenter · Terraform · Python |
| FinOps Zombie Hunter | Scheduled Lambda that scans AWS accounts for unused resources, calculates cost waste, and auto-remediates on a weekly cadence | Lambda · Python · Terraform |
| Containerized 3-Tier App | Full-stack application deployed via Terraform IaC with a CI/CD pipeline that enforces tests before every deployment | Docker · ECS · RDS · Terraform · GitHub Actions |
| MCP DevOps Mentor | Dockerized MCP server acting as a senior DevOps mentor — reviews infra, CI/CD pipelines, and cloud architecture with real-world best practices | Docker · Python · GitHub API |
| Cloud Resume | Serverless resume site on AWS with auto-deploy via GitHub Actions, visitor counter Lambda, and full Terraform IaC | CloudFront · Lambda · DynamoDB · Terraform |

