-
👨💻 All of my data analysis projects are available at Github Projects
-
⏮ Previous Project EURO 2024 - Real Time Data Warehouse Streaming
-
⚡ My Current Work 50 SQL Leetcode Problems
-
🏆 Kaggle Competition Project 🛳️ Titanic Survival Analysis & Classification Model (Kaggle Competition)
-
🕸️ Web Scraping 🔴 YouTube Data Scraping
-
🌱 I’m currently learning Data Engineering
-
💼 My Portfolio datascienceportfol.io/evansajumathew
-
📄 Resume Google Drive
-
⚡ Fun fact I'm Actually good in graphic designing
-
Globallogic Technology Limited
- Gurugram
- https://www.datascienceportfol.io/evansajumathew
- in/evansajumathew
Pinned Loading
-
ETL-University-Course-Extraction-Using-Spark-Snowflake
ETL-University-Course-Extraction-Using-Spark-Snowflake PublicThis project automates the extraction of university course details (e.g., schedules, professors, course codes) from text files using Regex pattern and SpaCy NLP Model and , processes them using PyS…
Python
-
euro-2024-kafka-pinot-pipeline
euro-2024-kafka-pinot-pipeline PublicThis project implements a real-time data pipeline for EURO 2024 football data, utilizing Apache Kafka for streaming, Apache Pinot for fast querying, and Apache Superset for data visualization. The …
Python
-
Reddit_ETL_DE
Reddit_ETL_DE PublicThis project demonstrates a complete data pipeline for extracting, transforming, and loading (ETL) Reddit data into an Amazon Redshift data warehouse. The pipeline uses various AWS services and too…
Python 1
-
Apache-Kafka-Kraft-and-Apache-Druid
Apache-Kafka-Kraft-and-Apache-Druid PublicIntegrated Apache Kafka (KRaft mode) with Apache Druid for real-time streaming and high-performance analytics.
Python
-
Data-Analysis-Projects
Data-Analysis-Projects PublicThis repository hosts multiple data analysis projects, showcasing a variety of real-time and batch processing pipelines. Each project highlights different tools and technologies, offering comprehen…
Jupyter Notebook 1
-
If the problem persists, check the GitHub status page or contact support.