I am a Data Engineer and Platform Engineer with over four years of experience designing, building, and operating scalable, production-grade data platforms across AWS and GCP.
My work spans real-time streaming, batch analytics, and infrastructure as code, with a strong focus on automation, reliability, and CI/CD-driven deployments. I design systems end to end, from ingestion and transformation to orchestration, governance, and analytics.
With six years of prior experience in real estate, I bring strong domain knowledge and business context into data problem-solving. I hold a Bachelor's degree in Real Estate and multiple certifications in data analytics and data engineering.
- Data Pipelines: dbt, Databricks, Databricks Asset Bundles, Snowflake, BigQuery, Redshift, Apache Kafka
- Cloud & Platform: AWS (S3, Glue, Athena, Redshift, IAM), GCP (GCS, BigQuery), Terraform
- Orchestration & CI/CD: Apache Airflow, GitHub Actions
- Programming: Python, SQL, R
- Data Visualization: Power BI, Tableau, Looker, QuickSight
- Version Control: Git, GitHub
- Data Platform Architecture: Designing and operating cloud-based data platforms with clearly defined raw, curated, and analytics layers
- Streaming & Batch Processing: Building Kafka-based streaming pipelines and dbt-driven ELT workflows
- Infrastructure as Code: Provisioning and managing data infrastructure using Terraform with remote state and CI/CD integration
- Automation & Reliability: Implementing automated testing, validation, and deployment workflows for data and infrastructure projects
- Analytics & Decision Support: Developing analytical models and dashboards to support operational monitoring and data-driven decision-making
Visit my YouTube page to view demonstrations of my projects: @Data_Pipeline_Lab


