Skip to content
View JagrutN's full-sized avatar
  • Mountain View, CA

Organizations

@T-I-P

Block or report JagrutN

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
JagrutN/README.md

Hi, I’m Jagrut Nemade

MS in Data Science @ UW–Madison
Experience: AI/ML Engineer (Research) | Data Analyst (Deloitte)
Based in Mountain View, CA


About Me

I’m passionate about designing scalable AI and data systems that bring research into real-world impact.
My work spans machine learning algorithms, deep neural networks, and statistical analysis, with a strong focus on end-to-end data pipelines and applied AI systems.
I specialize in efficient training of ML models using Active Learning, smart data selection, and robust evaluation to reduce compute and labeling costs while preserving accuracy.
I have hands-on experience with LLMs, RAG pipelines, vector databases, and big-data engineering, and I deploy solutions across ETL, analytics dashboards, and cloud integrations.
Alongside research, I actively contribute to open-source projects and collaborate on advancing efficient ML methods.


Tech Stack

Languages: Python, SQL, R, C++, MATLAB, Java, JavaScript

Frameworks & Libraries: PyTorch, TensorFlow, Scikit-Learn, Pandas, NumPy, Matplotlib, NLTK

Big Data & Cloud: Spark, Kafka, Hive, BigQuery, GCP, Oracle ERP, Docker, CI/CD

Tools & Platforms: Tableau, Power BI, Git, Weights & Biases, LangChain, FastAPI, Flask

Specialties:

  • Machine Learning Algorithms & Deep Neural Networks
  • Active Learning & Efficient Fine-Tuning
  • Statistical Modeling, Hypothesis Testing & A/B Testing
  • LLMs, RAG Systems & Vector Databases
  • Data Engineering, ETL Pipelines & Cloud Integration
  • Big Data Systems, Streaming Pipelines & Real-Time Analytics

Projects & Research

  • Agentic RAG for Radiology → Built with LangGraph + LLaMA, deployed into clinical workflows
  • Efficient LLM Fine-Tuning → Novel data selection strategy; reduced training data by 67% while maintaining accuracy
  • SQL Injection Detection → ML-based real-time query classification (published at IEEE CONECCT)
  • Big Data Systems → Spark + Kafka + Hive pipeline for large-scale loan prediction and real-time analytics
  • Hope Speech Detection → Developed NLP pipeline for multilingual social media moderation (published book chapter)
  • Image Classification → U-Net based model for multicultural wedding classification (published in Expert Systems with Applications)

📄 Publications in EMNLP (under review), Expert Systems with Applications, IEEE CONECCT, CCSM Book Chapter


Currently

  • Exploring hybrid RAG systems combining vector databases with knowledge graphs
  • Advancing research in Active Learning, Deep Neural Networks, and Statistical Modeling
  • Contributing to open-source AI/ML and data engineering frameworks
  • Investigating deployment strategies for LLMs in real-world applications
  • Building scalable pipelines that merge big data systems with applied machine learning

Connect


⭐️ Always open to collaboration on AI and ML projects!

Popular repositories Loading

  1. RAG RAG Public

    AI chatbot for Medical Data

    Python 1

  2. Website_Orio Website_Orio Public

    Dart 1

  3. node3-weather-website node3-weather-website Public

    JavaScript

  4. datasciencecoursera datasciencecoursera Public

  5. datasharing datasharing Public

    Forked from jtleek/datasharing

    The Leek group guide to data sharing

  6. ProgrammingAssignment2 ProgrammingAssignment2 Public

    Forked from rdpeng/ProgrammingAssignment2

    Repository for Programming Assignment 2 for R Programming on Coursera

    R