AI Engineer Β· Azure Data Engineer (DP-203) Β· Python Developer
Building production multi-agent LLM systems, RAG pipelines, and cloud-scale ETL solutions
π Click to explore skills, projects, timeline, and career achievements as a live interactive page
- π Microsoft Certified Azure Data Engineer Associate (DP-203)
- π€ 3+ years building production Generative AI β multi-agent platforms, RAG pipelines, LLM orchestration
- βοΈ 7+ years backend engineering β Azure Data Factory, ETL pipelines, cloud migration
- ποΈ Delivered solutions for U.S. government clients β Contra Costa County, LA County, California State Agencies
- π Open to AI Engineer and Azure Data Engineer roles β Remote or Hyderabad
11 specialized AI agents analyse legacy codebases in parallel and generate full modernisation reports
- Agents: Architecture Β· Security Β· Code Generation Β· DevOps Β· Testing Β· Migration Planning Β· Documentation Β· UI/UX Β· Business Rules Β· Integration Β· Data Migration
- Backend: FastAPI + LangGraph + asyncio + SSE real-time streaming
- LLM Cascade: Claude β Gemini β Groq (LLaMA 3) β Ollama β automatic key rotation across 25+ API keys, zero-downtime fallback
- RAG: ChromaDB with AST file summaries, corpus deduplication, SHA-256 content fingerprinting
- Features: Spectator Mode, DOCX/MD export, self-correcting agents, batch context via Gemini's 1M-token window
| Project | What it does | Stack |
|---|---|---|
| π€ AppNova AI | 11 AI agents analyse legacy codebases in parallel | FastAPI Β· LangGraph Β· Claude Β· Gemini Β· ChromaDB |
| ποΈ GovGenie | RAG-powered RFP generator β cuts bid writing by ~70% | LangChain Β· Ollama Β· ChromaDB Β· HuggingFace |
| π ReferenceFiller | Resume β fills DOCX template in 2 min (was 45 min) | FastAPI Β· Ollama Β· ChromaDB Β· python-docx |
| π₯ Video to Narrative | Surveillance footage β law-enforcement incident report | Flask Β· Whisper Β· ViT-GPT2 Β· Groq LLaMA 3 |
| π Skill Matrix App | PDF/DOCX resume β HR skill matrix via RAG | Flask Β· Mistral Β· ChromaDB Β· Sentence Transformers |
| π¬ Gemini YouTube Bot | Zero-input AI video creation + auto-publish | Gemini Β· MoviePy Β· gTTS Β· GitHub Actions |
π Project Details (click to expand)
Retrieves past proposal language from ChromaDB semantically, drafts government bid responses in company voice. Conversational chat interface with LangChain memory for follow-up clause questions. Reduced bid prep: 5+ days β under 24 hours
Upload resume β LLM extracts structured data β fills DOCX template via semantic field mapping. Session-based UUID stores, chunk-based text splitting, rotating log handlers. Replaced: 45-min manual task β 2-min automated workflow
OpenCV frame extraction β ViT-GPT2 captions β Whisper transcript β Groq LLaMA 3 synthesis. Returns structured JSON with timestamps, transcript, general summary, and formal law-enforcement narrative.
Fully local LLM (Mistral via Ollama) β no candidate data sent externally. RAG pipeline cuts hallucination to near zero vs baseline LLM-only approaches.
Gemini scripts β gTTS voiceover β MoviePy video β YouTube API auto-publish. Custom thumbnails, dynamic TextClip overlays. Runs entirely on GitHub Actions cron β no server needed.
30+ production ADF pipelines for U.S. government agencies
| Metric | Detail |
|---|---|
| Pipelines | 30+ ADF pipelines β person, address, offense, narrative, gang, property records |
| Performance | 50% faster data processing via optimised Mapping Data Flows |
| Migration | ARIES β 40+ tables, on-prem SQL Server β Azure Cloud, zero data loss |
| Automation | ADF validation + deduplication β 40% less manual review effort |
| Infrastructure | Azure VMs Β· Elastic Pools Β· Azure SQL Β· Data Lake Gen2 Β· Key Vault |
| Clients | Contra Costa County Β· LA County Β· California State Agencies |

