Skip to content
View iEric0228's full-sized avatar

Highlights

  • Pro

Block or report iEric0228

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
iEric0228/README.md

LinkedIn Email Resume Portfolio Profile Views Open to Work

About Me

I'm a DevOps & Cloud Engineer who builds production-grade, event-driven infrastructure on AWS. I focus on automation, observability, and cost efficiency — turning manual ops work into self-healing systems.

  • Built an AI-powered SRE Incident Analysis System using AWS Step Functions, Lambda, and Amazon Bedrock that auto-detects and analyzes infrastructure incidents with zero human intervention
  • Engineered an EKS-based GPU autoscaler using Karpenter to dynamically provision and de-provision nodes for LLM inference workloads
  • Engineered a FinOps automation tool that identifies and eliminates unused cloud resources on a scheduled basis
  • Passionate about CI/CD pipelines, IaC, and building systems that scale without breaking

Check out my cloud resume at ericchiu.page

Certifications

AWS CCP AWS SAA


Tech Stack

Cloud & Infrastructure

AWS ECS EKS Terraform Docker Kubernetes Linux

CI/CD & Automation

GitHub Actions AWS Lambda Step Functions

AI & Observability

Amazon Bedrock Prometheus Grafana CloudWatch

Languages & Config

Python Bash HCL YAML


Featured Projects

Project Description Stack
AI-SRE Incident Analysis System Event-driven serverless pipeline that auto-detects infrastructure issues and generates AI-powered root-cause analysis using Amazon Bedrock Step Functions · Lambda · Bedrock · DynamoDB · Terraform
Aura — EKS GPU Autoscaler Cloud infrastructure automation for AI workloads on ephemeral EKS clusters with Karpenter-driven GPU autoscaling for LLM inference EKS · Karpenter · Terraform · Python
FinOps Zombie Hunter Scheduled Lambda that scans AWS accounts for unused resources, calculates cost waste, and auto-remediates on a weekly cadence Lambda · Python · Terraform
Containerized 3-Tier App Full-stack application deployed via Terraform IaC with a CI/CD pipeline that enforces tests before every deployment Docker · ECS · RDS · Terraform · GitHub Actions
MCP DevOps Mentor Dockerized MCP server acting as a senior DevOps mentor — reviews infra, CI/CD pipelines, and cloud architecture with real-world best practices Docker · Python · GitHub API
Cloud Resume Serverless resume site on AWS with auto-deploy via GitHub Actions, visitor counter Lambda, and full Terraform IaC CloudFront · Lambda · DynamoDB · Terraform

Currently Learning

Terraform Associate CKA Helm


GitHub Stats

Pinned Loading

  1. AI-SRE-INCIDENT-ANALYSIS-SYSTEM-AWS AI-SRE-INCIDENT-ANALYSIS-SYSTEM-AWS Public

    AI-powered SRE Incident Analysis System built on AWS. Automates log ingestion, incident classification, root cause analysis, and remediation suggestions using serverless architecture and AI. Demons…

    Python

  2. Aura_AI-driven-Under-utilized-Resource-Autoscaler Aura_AI-driven-Under-utilized-Resource-Autoscaler Public

    This is a cloud infrastructure automation project for AI workloads, specifically designed to deploy and manage ephemeral AWS EKS clusters for running large language models (Llama 3) with GPU autosc…

    Shell

  3. Cloud-resume Cloud-resume Public

    Serverless resume website on AWS with CloudFront, Lambda, DynamoDB, and Terraform IaC. Auto-deploys via GitHub Actions on push.

    HCL

  4. Containerized-3-Tier-Application Containerized-3-Tier-Application Public

    Build a Python/Node.js app with a database. Focus heavily on Infrastructure as Code (deploying the infra via Terraform, not the console) and writing a proper CI/CD pipeline that runs tests before d…

    HCL

  5. finops-zombie-hunter finops-zombie-hunter Public

    A scheduled/manual trigger AWS Lambda function (The Hunter) wakes up every Sunday midnight, scans your AWS account for unused resources (EBS volumes, RDS, NAT-gateway, Elastic IPs), calculates the …

    Python

  6. MCP-DevOps-Mentor MCP-DevOps-Mentor Public

    A dockerized MCP server acting as a senior DevOps mentor, integrating GitHub APIs and web search to review infrastructure, CI/CD pipelines, and cloud architectures with real-world best practices.

    Python