Skip to content
View debu-sinha's full-sized avatar

Block or report debu-sinha

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
debu-sinha/README.md

Hey, I'm Debu Sinha

Lead Applied AI/ML Engineer (Solutions Architecture) @ Databricks | Author | Open Source Contributor

Building ML platforms at scale. Helping enterprises ship AI from prototype to production.


Tech Stack


Book

Practical Machine Learning on Databricks

Packt Publishing, 2023 | 244 pages

End-to-end guide for building production ML systems on Databricks - from data engineering to MLOps. Reached best seller status in its category within 2 weeks of release.



Research

Research Affiliate, Johns Hopkins University

Paper Journal
DEMYSTIFYING LARGE LANGUAGE MODELS: A TECHNICAL DEEP DIVE IJCET
Exploring the Latest Innovations in Reinforcement Learning for Real-World Impact IJSRCET
THE TRANSFORMATIVE IMPACT OF AI ON EDUCATION: OPPORTUNITIES AND CHALLENGES IAEME
AI IN HEALTHCARE: BRIDGING THE GAP BETWEEN DATA AND BETTER PATIENT OUTCOMES IRJETS

Open Source Contributions

Active contributor to MLflow (23K+ stars) - the leading open-source ML lifecycle platform.

Recent PRs:

  • #19152 - inference_params support for LLM Judges (Approved)
  • #19237 - Phoenix & TruLens third-party scorer integrations
  • #19238 - Async predict support for ChatModel/ChatAgent
  • #19248 - Configurable parallelism for GenAI evaluation

Speaking

  • TechFutures 2025 (NYC) - End-to-End MLOps Pipelines Workshop (GitHub)
  • Data Con LA 2022 - Simplifying AI/ML using Databricks Feature Store (YouTube)
  • Data Con LA 2021 - Detecting Fake Reviews at Scale using Spark and John Snow Labs (YouTube)
  • NYU Guest Lecture - ML Pipeline with Apache Spark

Professional Memberships


Connect


Pinned Loading

  1. Databricks-GenAI-Series Databricks-GenAI-Series Public

    All the resources related to GenAI hands on workshop.

    Python 24 47

  2. PacktPublishing/Practical-Machine-Learning-on-Databricks PacktPublishing/Practical-Machine-Learning-on-Databricks Public

    Practical Machine Learning on Databricks, published by packt

    Python 21 37

  3. cross-region-model-serving-dab cross-region-model-serving-dab Public

    Production-ready Databricks Asset Bundle for cross-region ML model serving using Delta Sharing. Deploy models and feature tables across workspaces with zero-copy data access and automated online fe…

    Python 1

  4. mlflow mlflow Public

    Forked from mlflow/mlflow

    The open source developer platform to build AI agents and models with confidence. Enhance your AI applications with end-to-end tracking, observability, and evaluations, all in one integrated platform.

    Python