MedQuery: Agentic RAG System for Clinical Document Q&A

MedQuery is a production-ready RAG (Retrieval-Augmented Generation) system that:

Ingests clinical PDFs (guidelines, research papers, discharge summaries) into a vector database
Answers natural language questions using a LangChain ReAct agent grounded in your documents
Cites sources with filename and page number for every response
Evaluates itself using RAGAS metrics to measure and track answer quality

Architecture

┌─────────────────────────────────────────────────────────────────┐
│                        INGESTION PIPELINE                       │
│  PDF → PyPDF → RecursiveCharacterTextSplitter → OpenAI Embeds   │
│             → ChromaDB (dev) / Pinecone (prod)                  │
└─────────────────────────────────────────────────────────────────┘
                              ↓
┌─────────────────────────────────────────────────────────────────┐
│                     AGENTIC RAG LAYER                           │
│  Question → LangChain ReAct Agent → MMR Retrieval (top-5)       │
│           → Grounded Prompt → GPT-4o → Answer + Citations       │
└─────────────────────────────────────────────────────────────────┘
                              ↓
┌─────────────────────────────────────────────────────────────────┐
│                    RAGAS EVALUATION LAYER                       │
│  Faithfulness | Answer Relevancy | Context Precision | Recall   │
└─────────────────────────────────────────────────────────────────┘

Tech Stack

Layer	Technology	Purpose
LLM	OpenAI GPT-4o	Answer generation
Orchestration	LangChain ReAct	Agentic retrieval & reasoning
Embeddings	OpenAI text-embedding-3-small	Document vectorisation
Vector DB (dev)	ChromaDB	Local persistent store
Vector DB (prod)	Pinecone	Serverless scalable store
Backend	FastAPI	REST API
Frontend	Streamlit	UI + evaluation dashboard
Containers	Docker + Compose	Deployment
Cloud Storage	AWS S3	PDF storage
Evaluation	RAGAS	RAG quality metrics

Quick Start

Prerequisites

Python 3.11+
Docker + Docker Compose
OpenAI API key

1. Clone and configure

git clone https://github.com/yourusername/medquery.git
cd medquery

# Copy and fill in your API keys
cp backend/.env.example backend/.env

Edit backend/.env:

OPENAI_API_KEY=sk-your-key-here
USE_PINECONE=false          # use ChromaDB locally

2. Run with Docker

docker-compose up --build

Frontend: http://localhost:8501
API docs: http://localhost:8000/docs

3. Run locally (without Docker)

# Backend
cd backend
pip install -r requirements.txt
uvicorn app.main:app --reload

# Frontend (new terminal)
cd frontend
pip install -r requirements.txt
streamlit run app.py

API Reference

Upload Document

POST /api/documents/upload
Content-Type: multipart/form-data

file: <PDF file>

Query

POST /api/query/
Content-Type: application/json

{
  "question": "What are the contraindications for this medication?",
  "doc_id": "optional-filter-by-doc"
}

Run RAGAS Evaluation

POST /api/evaluation/run
Content-Type: application/json

{
  "question": "What is the recommended dosage?",
  "ground_truth": "The recommended dosage is 500mg twice daily."
}

Get Evaluation Summary

GET /api/evaluation/summary

RAGAS Metrics

Metric	Description	Target
Faithfulness	Is the answer supported by retrieved context?	> 0.85
Answer Relevancy	Does the answer address the question?	> 0.80
Context Precision	Are retrieved chunks actually relevant?	> 0.75
Context Recall	Were all relevant pieces retrieved?	> 0.75

Production Deployment (AWS EC2)

# On your EC2 instance (Ubuntu 22.04)
sudo apt update && sudo apt install docker.io docker-compose -y

git clone https://github.com/yourusername/medquery.git
cd medquery

# Set production env
cp backend/.env.example backend/.env
# Edit .env: USE_PINECONE=true, add Pinecone + AWS keys

docker-compose up -d --build

Important Notes

MedQuery is a research and portfolio project. It is not validated for clinical use.
Never upload real patient data. Use de-identified or synthetic documents only.
API keys should never be committed to version control.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
backend		backend
frontend		frontend
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
docker-compose.yml		docker-compose.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MedQuery: Agentic RAG System for Clinical Document Q&A

Architecture

Tech Stack

Quick Start

Prerequisites

1. Clone and configure

2. Run with Docker

3. Run locally (without Docker)

API Reference

Upload Document

Query

Run RAGAS Evaluation

Get Evaluation Summary

RAGAS Metrics

Production Deployment (AWS EC2)

Important Notes

Author: Jesmine Ting 2026

About

Uh oh!

Releases

Packages

Languages

License

JesmineT/medquery-agentic-rag

Folders and files

Latest commit

History

Repository files navigation

MedQuery: Agentic RAG System for Clinical Document Q&A

Architecture

Tech Stack

Quick Start

Prerequisites

1. Clone and configure

2. Run with Docker

3. Run locally (without Docker)

API Reference

Upload Document

Query

Run RAGAS Evaluation

Get Evaluation Summary

RAGAS Metrics

Production Deployment (AWS EC2)

Important Notes

Author: Jesmine Ting 2026

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages