Skip to content
View evanmathew's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report evanmathew

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
evanmathew/README.md

Hi 👋, I'm Evan Saju Mathew

Analyst at GlobalLogic Technology Limited

evanmathew

Connect with me:

evansajumathew evansajumathew evansajumathew

💻 Tech Stack:

Apache Spark Apache Kafka Apache Airflow MySQL Git GitHub Bash Script Docker Snowflake Canva Python Matplotlib Adobe Illustrator Adobe XD Python Figma CSS3 HTML5 PowerShell Postgres AWS Oracle Anaconda NumPy Pandas Plotly Linux

Pinned Loading

  1. ETL-University-Course-Extraction-Using-Spark-Snowflake ETL-University-Course-Extraction-Using-Spark-Snowflake Public

    This project automates the extraction of university course details (e.g., schedules, professors, course codes) from text files using Regex pattern and SpaCy NLP Model and , processes them using PyS…

    Python

  2. euro-2024-kafka-pinot-pipeline euro-2024-kafka-pinot-pipeline Public

    This project implements a real-time data pipeline for EURO 2024 football data, utilizing Apache Kafka for streaming, Apache Pinot for fast querying, and Apache Superset for data visualization. The …

    Python

  3. Reddit_ETL_DE Reddit_ETL_DE Public

    This project demonstrates a complete data pipeline for extracting, transforming, and loading (ETL) Reddit data into an Amazon Redshift data warehouse. The pipeline uses various AWS services and too…

    Python 1

  4. Apache-Kafka-Kraft-and-Apache-Druid Apache-Kafka-Kraft-and-Apache-Druid Public

    Integrated Apache Kafka (KRaft mode) with Apache Druid for real-time streaming and high-performance analytics.

    Python

  5. Data-Analysis-Projects Data-Analysis-Projects Public

    This repository hosts multiple data analysis projects, showcasing a variety of real-time and batch processing pipelines. Each project highlights different tools and technologies, offering comprehen…

    Jupyter Notebook 1

  6. evanmathew evanmathew Public

    1