Skip to content
View seanhuvaya's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report seanhuvaya

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
seanhuvaya/README.md

Hi, I'm Sean Huvaya 👋

I'm a Data Engineer with a solid backend engineering foundation and growing expertise in machine learning operations (MLOps).

I've worked across international teams optimizing SQL queries, automating data pipelines, refactoring legacy systems, deploying Dockerized apps on AWS, and integrating ML features into real-time workflows.

Recently completed my MSc in Artificial Intelligence from Yeshiva University (Dec 2025), with coursework in Machine Learning, Neural Networks, Data Science, and Cloud Computing.

🔭 I’m currently working on real-time data streaming by exploring the NYC 311 Service Requests dataset — one of the largest open civic datasets in the world.

My goal is to build a robust streaming pipeline using Apache Kafka to ingest, process, and analyze complaints/requests in near real-time. This project is helping me master event-driven architectures, data ingestion at scale, and fault-tolerant streaming — key skills for production data engineering and MLOps.

📫 How to reach me:

→ LinkedIn: linkedin.com/in/seanhuvaya
→ GitHub: github.com/seanhuvaya
→ Personal site: seanhuvaya.dev

Open to collaborations, opportunities in data engineering, streaming, MLOps, or AI infrastructure — let's build something impactful! 🚀

Pinned Loading

  1. nyc311-kafka-airflow-spark nyc311-kafka-airflow-spark Public

    Python

  2. ktxdev.github.io ktxdev.github.io Public

    TypeScript