Dockerized ETL pipeline using Kafka, Airflow, and PostgreSQL to simulate streaming movie review ingestion and aggregation.
-
Updated
Feb 19, 2026 - Python
Dockerized ETL pipeline using Kafka, Airflow, and PostgreSQL to simulate streaming movie review ingestion and aggregation.
A Spotify Wrapped-style analytics project using Python, SQL Server, and Power BI.
Pipeline de dados end-to-end para analytics de RevOps em uma empresa SaaS B2B fictícia, com Python, BigQuery, dbt, Airflow e Power BI.
A Python-based automation pipeline that converts multiple CSV files from a folder into a structured Excel workbook, eliminating manual Excel work. Built as part of an AI Powered Python course with a Data Analyst perspective.
A user-friendly Python GUI application that automates common data cleaning tasks for Excel and CSV files, reducing manual effort and improving dataset quality for analytics workflows.
Portfolio snapshot of a Python compiler backend that generates RISC-V-style assembly from typed AST nodes.
Rule-driven validation engine for AI document extraction outputs with fuzzy matching, normalization, risk scoring, batch processing, and audit reporting.
African football data pipeline — raw scraper to Snowflake/dbt warehouse to Databricks lakehouse. Three-phase DE portfolio project.
Add a description, image, and links to the porfolio-project topic page so that developers can more easily learn about it.
To associate your repository with the porfolio-project topic, visit your repo's landing page and select "manage topics."