Skip to content
#

retrieval-augmented-generation-rag

Here are 80 public repositories matching this topic...

Client-side retrieval firewall for RAG systems — blocks prompt injection and secret leaks, re-ranks stale or untrusted content, and keeps all data inside your environment.

  • Updated Sep 4, 2025
  • Python

AI-Rag-ChatBot is a complete project example with RAGChat and Next.js 14, using Upstash Vector Database, Upstash Qstash, Upstash Redis, Dynamic Webpage Folder, Middleware, Typescript, Vercel AI SDK for the Client side Hook, Lucide-React for Icon, Shadcn-UI, Next-UI Library Plugin to modify TailwindCSS and deploy on Vercel.

  • Updated Jul 10, 2025
  • TypeScript
RAG-LCC

A hands‑on RAG experimentation lab. Largely configurable with debug insights. Classification‑driven corpus construction, filter chains, document loading, chat interaction, Open WebUI integration. Experimental by design and not production‑ready.

  • Updated May 28, 2026
  • Python

An advanced, fully local, and GPU-accelerated RAG pipeline. Features a sophisticated LLM-based preprocessing engine, state-of-the-art Parent Document Retriever with RAG Fusion, and a modular, Hydra-configurable architecture. Built with LangChain, Ollama, and ChromaDB for 100% private, high-performance document Q&A.

  • Updated Aug 11, 2025
  • Python

🩺 RAGnosis — An AI-powered clinical reasoning assistant that retrieves real diagnostic notes (from MIMIC-IV-Ext-DiReCT) and generates explainable medical insights using Mistral-7B & FAISS, wrapped in a clean Gradio UI. ⚡ GPU-ready, explainable, and open-source.

  • Updated Jul 12, 2025
  • Jupyter Notebook

A doctor-assistive AI system that interprets medical knowledge and patient images simultaneously. It utilizes a Dual-Encoder architecture to cross-reference textbook theory with visual pathology, generating clinically grounded diagnoses.

  • Updated Dec 13, 2025
  • Python

Enterprise Agentic RAG platform built with FastAPI, LangGraph, OpenAI, PostgreSQL, Qdrant Cloud, Railway, and Streamlit. Features multi-agent orchestration, hybrid retrieval (Vector Search + BM25), CrossEncoder reranking, conversational memory, tool-calling, web-search augmentation, observability, and cloud deployment.

  • Updated Jun 3, 2026
  • Python

Enterprise-grade local RAG API assistant backend running on traditional Azure CPU infrastructure. Serves a quantized Llama 3.2 GGUF model via llama-server with offline ChromaDB ingestion and stateful MySQL transaction logging.

  • Updated May 31, 2026
  • Python

Improve this page

Add a description, image, and links to the retrieval-augmented-generation-rag topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the retrieval-augmented-generation-rag topic, visit your repo's landing page and select "manage topics."

Learn more