A RAG (Retrieval-Augmented Generation) powered assistant that lets you chat with your own documents. Upload PDFs, text files, or markdown notes and get AI answers with exact source citations. A excellent local file reader using RAG framework.
- Upload PDF, TXT, and Markdown documents
- Vector search using ChromaDB + sentence transformers
- Conversational chat with full context memory
- Source citations with relevance scores on every answer
- Drag and drop file upload
- Document management — add and remove from knowledge base
- Backend: Python, Flask, SQLAlchemy
- Vector DB: ChromaDB with all-MiniLM-L6-v2 embeddings
- AI: Groq (Llama 3.3 70B)
- Frontend: Vanilla HTML/CSS/JS
- Clone the repo
python -m venv venvthen activatepip install -r requirements.txt- Create
.env: python app.py- Open
http://localhost:5002
- Documents are chunked into 500-word segments with 50-word overlap
- Each chunk is embedded using sentence-transformers (all-MiniLM-L6-v2)
- Embeddings stored in ChromaDB vector database
- On each question, top 5 most relevant chunks are retrieved by cosine similarity
- Retrieved chunks + question sent to Groq LLM for synthesized answer with citations