Skip to content

abhijitdeshpande83/GenAI

Repository files navigation

🧠 GenAI Portfolio

A curated collection of production-oriented Generative AI projects spanning RAG, LLM fine-tuning, conversational AI, and agentic systems.

Each project lives in its own folder with a dedicated README covering architecture, implementation choices, and trade-offs. The main goals across this portfolio: production deployment, hybrid (deterministic + LLM) design, and honest documentation of what worked and what didn’t.


πŸš€ Featured Projects

πŸ€– Agentic AI

πŸ›‘οΈ AURA: Hybrid Conversational AI Support System

A production-oriented customer support system that combines a deterministic ML intent router (DeBERTa) with LangGraph-orchestrated workflows and bounded LLM reasoning (Groq). Designed to solve a real production tension: balancing LLM flexibility with deterministic control.

Highlights

  • Hybrid architecture: DeBERTa for intent routing, Groq LLM scoped to bounded reasoning only (Inquiry Agent), LangGraph for traceable workflow execution
  • Three execution paths: Inquiry Agent (LLM + tools), Complaint Workflow (ticket creation), Retention Workflow (churn mitigation)
  • 89% intent classification accuracy, sub-800ms p95 latency
  • Documented design decision: explicitly rejected a pure-LLM orchestration approach after observing tool-selection loops and inconsistent workflow behavior
  • Deployed via AWS Lambda + API Gateway + ECR (containerized)

πŸ“‚ Full Documentation


🌍 ATLAS: Agentic Tool-Enabled Web Assistant

A LangGraph-powered autonomous agent that performs real-time web research using Groq LLM and Tavily Search, with session-based memory. (Named after Atlas the Titan: holding up a world of real-time information.)

Highlights

  • Dynamic tool orchestration: the agent decides when to respond directly vs invoke external tools
  • Session memory via LangGraph MemorySaver for contextual multi-turn conversations
  • Modular separation of agent definition from execution via a run_agent interface
  • Deployed serverlessly on AWS Lambda + API Gateway, with CORS handled by routing requests through a Django backend proxy
  • Containerized with Docker

πŸ“‚ Full Documentation


🎯 LLM Fine-Tuning

🧠 INTELLA: Intent Classification with LoRA-Fine-Tuned Language Assistant

A two-phase fine-tuning project for multi-domain conversational AI, demonstrating end-to-end LLM customization from intent classification through natural response generation.

Phase I: Intent Classification Foundation

  • Fine-tuned RoBERTa-large with LoRA to classify ~150 user intents across multiple domains
  • Built scalable training and deployment pipelines on AWS SageMaker + Hugging Face Trainer

Phase II: Natural Language Generation for Core Intents

  • Reduced 150 raw intents β†’ 20 core intents via a custom label-mapping wrapper
  • Fine-tuned FLAN-T5 to generate natural responses for 10 core intents
  • Cost-efficient serverless inference via AWS Lambda + API Gateway

Honest limitation surfaced: FLAN-T5 handles single-turn queries fluently but does not reliably manage structured multi-turn dialogues requiring slot filling. This motivated the Rasa project below.

πŸ“‚ Full Documentation


πŸ“š Retrieval-Augmented Generation

πŸ” IntelliQA: RAG-based Document Q&A

A conversational Q&A system over uploaded documents (PDF, DOCX, TXT, and more) built on a containerized, production-deployed RAG pipeline.

Highlights

  • Document parsing and embedding via LangChain + Apache Tika
  • Multi-format support: PDF, DOCX, TXT, and additional file types
  • Operational guardrails: duplicate detection on upload, 5-file session cap, daily cleanup cron for storage hygiene
  • Modular Dockerized deployment package
  • Deployed on AWS for production usage

πŸ“‚ Project Folder (detailed README in progress)


πŸ’¬ Dialogue Management

🎬 Rasa: Movie Booking Chatbot

A deterministic, slot-filling chatbot for end-to-end movie booking, directly addressing the multi-turn dialogue limitation identified in INTELLA Phase II.

Highlights

  • Multi-turn dialogue collecting ZIP code, movie, showtime, theater, and seat selection
  • Form validation and business-rule enforcement (e.g. one seat per show, no past or ongoing dates)
  • Gracenote API integration for real-time movie listings, theaters, and showtimes
  • Automated HTML email confirmations via Python’s smtplib
  • Docker-based training and deployment, handling MacOS β†’ Linux model compatibility issues

πŸ“‚ Full Documentation


πŸ“ Additional Projects

βœ‚οΈ TextSummarization

NLP-based text summarization project. (Documentation in progress.)

πŸ“‚ Project Folder


πŸ—‚ Repository Structure

GenAI/
β”œβ”€β”€ Agentic Tool-Enabled Web Assistant (ATLAS)/      # LangGraph + Groq + Tavily research agent
β”œβ”€β”€ Agentic User Resolution Assistant (AURA)/        # Hybrid production support system
β”œβ”€β”€ Intent Classification with LoRA-Fine-Tuned       # RoBERTa + LoRA + FLAN-T5 fine-tuning
β”‚   Language Assistant(INTELLA)/
β”œβ”€β”€ IntelliQA/                                       # RAG-based document Q&A
β”œβ”€β”€ TextSummarization/                               # NLP text summarization (WIP)
β”œβ”€β”€ rasa/                                            # Deterministic dialogue management
└── README.md                                        # You are here

🧰 Skills Demonstrated

Area Tools & Techniques
LLMs & Fine-Tuning RoBERTa, DeBERTa, FLAN-T5, LoRA / PEFT, Hugging Face Transformers
Agentic Systems LangGraph, LangChain, Groq LLM, Tavily Search, MemorySaver checkpointing
RAG LangChain, Apache Tika, vector retrieval, multi-format document parsing
Dialogue Management Rasa (slot filling, form validation, business-rule enforcement)
Cloud & Deployment AWS SageMaker, Lambda, API Gateway, ECR, Docker, serverless inference
Backend & Data Python, Django, Supabase
System Design Hybrid ML + LLM architectures, deterministic-first routing, graph-based orchestration

πŸ‘€ About

I build practical GenAI systems with a focus on production deployment, hybrid architectures (deterministic + LLM), and transparent trade-off documentation. This portfolio reflects a progression from foundational RAG and fine-tuning toward agentic and hybrid system design.

πŸ“« Connect: LinkedIn β€’ Website β€’ GitHub