Iβm an AI Engineer who focuses on making models actually usable in the real world β deployed, optimized, and running reliably.
- π 3rd-year Computer Science Engineering student.
- π§ Work spans LLMs, RAG systems, agent-based architectures, and local model inference
- βοΈ Strong focus on deployment, servers, tunnels, automation, and infra
- π Interested in privacy-first AI (local & decentralized models)
- π§ͺ Prefer building end-to-end systems over isolated demos
I care less about model hype and more about systems that work under constraints.
- Local LLMs (Ollama, Phi, Qwen, Gemma)
- RAG pipelines (CSV / vector DB / symptom-based retrieval)
- Tokenization & dataset preprocessing at scale
- Agent-based & multi-agent reasoning systems
- FastAPI-based AI services
- API orchestration & fallback strategies
- Real-time transcription & processing pipelines
- Cloudflare Tunnels (no open ports, ISP-safe)
- Nginx reverse proxy setups
- Self-hosted AI servers (CPU-constrained environments)
- GitHub Pages, Render, Streamlit deployments
- π§ Micropulse β Local & distributed SLM system (privacy-first AI)
- π§© RAG-based ERM Assistant β Reduced API calls by storing symptom-level abstractions
- π Live Audio Transcription + Keyword Extraction for medical workflows
- π Location-based Chat App β Geo-matching + WebSocket privacy model
- π€ Multi-agent Deliberative AI System (Project AETHER)
- π§ͺ Trained & optimized small language models under limited compute
- π Published 2 AI/ML research papers
- π Participated in multiple ML hackathons
- πΌ Internships: Doctors App, Mindray
If you care about AI that actually runs, not just models that look good in notebooks β weβll get along.
π© yashas.vaddi12@gmail.com π linkedin.com/in/yashasvaddi π github.com/YashasVaddi
βBuild systems, not demos. Optimize for reality.β