Data Engineer with a background in computer systems engineering. I design and deliver reliable data pipelines that enable analytics and reporting at scale.
- Cloud & Lakehouse: Azure (Data Factory, Databricks, Storage), Delta Lake
- Data Processing: Databricks, Spark, Python
- Data Management: SQL (Azure SQL, PostgreSQL, MySQL), dbt fundamentals
- Orchestration & DevOps: CI/CD for data workloads, Git, Docker
Building production-grade pipelines on Azure and Databricks, improving data quality, and optimizing SQL transformations for analytics teams.
- Sales analytics pipeline: Ingests multi-source sales data into Azure Data Lake, transforms with Databricks notebooks, and publishes curated tables for BI.
- Operational dashboards: Real-time metrics powered by event-driven ingestion and Spark streaming jobs.
- Data quality framework: Reusable validation checks for SQL and PySpark workloads to keep datasets trustworthy.
- Email: carolinafnicasio@gmail.com
- LinkedIn: carolinanicasio
- GitHub: CarolinaNicasio
