Skip to content
View vinaybudideti's full-sized avatar

Block or report vinaybudideti

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
vinaybudideti/README.md

Vinay Kumar Reddy Budideti

Full-Stack Engineer · Java + React + Python · AI Systems · Lawrence, Kansas

I build full-stack applications and ship AI into production. Three years of industry experience across enterprise Java backends, React frontends, and LLM pipelines — owning features end-to-end from database schema to deployed UI. Graduated MSCS at University of Kansas.

Looking for: Full-time SWE / Full-Stack / AI-integrated product roles · Remote or On-Site · US-authorized

📍 Portfolio  ·  LinkedIn  ·  Email


What I've shipped

AgentFuse — Production Python SDK for LLM cost control. Two-tier Redis + FAISS semantic cache achieving 87.5% hit rate and 71.8% cost reduction ($0.24 vs $0.87 per workload). Graduated budget policies auto-downgrade models, compress context, and terminate gracefully instead of burning budget. Supports 12 providers, 22+ models. Published on PyPI as agentfuse-runtime. 260 unit tests, 86% core coverage.

TradeFlow — Event-driven loan processing platform built on microservices. Spring Boot · Apache Kafka · CQRS · Outbox pattern · JWT gateway · React frontend. 4-stage GitHub Actions CI/CD pipeline deploying to AWS ECS. Kubernetes-ready with HPA auto-scaling 2–10 replicas. AI-powered underwriting explanations via Claude API.

CodeMind — RAG application that lets you chat with any GitHub repository. NestJS + React + pgvector + LangChain + Claude AI. Context-aware code chunking, streaming responses, source citations. Live →

JobRadar — AI job board that fetches 600+ SWE jobs daily, scores each against my profile using Claude, and ranks them APPLY / MAYBE / SKIP based on tech fit and H-1B sponsorship signals.

Intent Atoms — Sub-query level semantic caching for LLM APIs. 3-tier hybrid FAISS engine. 87.5% cache hit rate, 71.8% cost savings measured across 100 real API calls.

NutriBot — AI nutrition assistant. Next.js 15 + Vercel AI SDK. Real-time macro tracking, personalized meal recommendations, streaming conversational responses. Live →


Stack

Languages     Java · TypeScript · Python · SQL
Frontend      React · Next.js · Tailwind CSS
Backend       Spring Boot · Spring MVC · FastAPI · NestJS · Flask
AI / LLM      LangChain · RAG · FAISS · OpenAI · Anthropic · Rasa · pgvector
Databases     PostgreSQL · MongoDB · MySQL · Oracle · Redis
Messaging     Apache Kafka · Avro · Schema Registry
DevOps        Docker · Kubernetes · AWS (ECS, ECR) · GCP · GitHub Actions · Jenkins
Observability Splunk · OpenTelemetry · Prometheus · Structured Logging
Testing       JUnit 5 · Mockito · Jest · TestContainers · WireMock

Experience

Vosyn — AI Software Engineer Oct – Nov 2025
Vosyn — Full-Stack Engineering Intern Aug – Oct 2025
KU NCCS Research Lab — Full-Stack Software Developer Jan – Mar 2024
University of Kansas — IT Support Specialist Apr – Aug 2024
Cognizant — Full Stack Developer · Java Jul 2022 – Jul 2023
Cognizant — Programming Analyst Intern Jan – Jun 2022
SVEC — Software Developer Co-op Jun – Dec 2020

Education

MS Computer Science · University of Kansas · 2023 – 2025
BTech Computer Science · Sree Vidyanikethan Engineering College · 2019 – 2023


Numbers that matter

  • 71.8% LLM cost reduction · 87.5% semantic cache hit rate — AgentFuse
  • 75% API response time improvement — 800ms → 200ms at Cognizant
  • 50+ features shipped end-to-end across internships and production systems
  • 285+ unit tests authored across AgentFuse, TradeFlow, NCCS Lab
  • 4 peer-reviewed research publications

Open to full-time opportunities · US work-authorized · vinaykumarreddy.budideti@gmail.com

Pinned Loading

  1. agentfuse agentfuse Public

    Intelligent LLM agent cost optimization runtime.

    Python

  2. TradeFlow TradeFlow Public

    TypeScript

  3. codemind codemind Public

    Chat with any GitHub repository using AI — RAG-powered Q&A with streaming responses, source citations, and auto bug finder built with NestJS, React, pgvector, and Claude AI.

    TypeScript 1

  4. Masters-Project Masters-Project Public

    AI-powered nutrition assistant with personalized meal planning, real-time nutrition data from Nutritionix API, and streaming AI responses. Built with Next.js 15, TypeScript, Tailwind CSS, Zustand, …

    TypeScript 1

  5. JobRadar JobRadar Public

    AI job board that fetches 600+ SWE jobs daily, scores each with Claude AI, and ranks them as APPLY / MAYBE / SKIP based on tech fit and H-1B sponsorship signals.

    JavaScript 1

  6. intent-atoms intent-atoms Public

    Sub-query level semantic caching for LLM APIs — 3-tier hybrid engine with FAISS vector search. 87.5% cache hit rate, 71.8% cost savings on 100 real API calls.

    Python 1