A modular framework for building and deploying Retrieval-Augmented Generation (RAG) systems with built-in evaluation and monitoring.
-
Updated
Nov 26, 2025 - Python
A modular framework for building and deploying Retrieval-Augmented Generation (RAG) systems with built-in evaluation and monitoring.
Understand and build embedding models, focusing on word and sentence embeddings, dual encoder architectures. Learn to train embedding models using contrastive loss, implement them in semantic search and RAG systems.
🤖 Production-ready samples for building multi-modal AI agents that understand images, documents, videos, and text using Amazon Bedrock and Strands Agents. Features Claude integration, MCP tools, streaming responses, and enterprise-grade architecture.
RAG Gateway Service 🚪🤖: FastAPI gateway that auto-detects query topics using OpenAI embeddings 🧠🔍 and routes requests to topic-specific RAG agents 🎯, with fallback support and Docker-ready 🚀🐳.
Turn any LLM into a self-extending knowledge agent powered by a graph-structured memory - complete with PDF-to-graph ingestion, budget-aware optimisation, and dual-engine orchestration.
This project implements a Retrieval-Augmented Generation (RAG) based chatbot designed to handle university-related queries using natural language understanding. It combines semantic search with generative AI to provide precise, context-aware answers to students, faculty, and visitors.
Experimenting with different kinds of RAGs Systems
The course provides a comprehensive guide to optimizing retrieval systems in large-scale RAG applications. It covers tokenization, vector quantization, and search optimization techniques to enhance search quality, reduce memory usage, and balance performance in vector search systems.
🚀 Complete AI Development Toolkit Template - Add RAG, MCP, and AI assistance to any project in 2 minutes
Training Data Generator for SPLADE Model Fine-tuning
Production-grade RAG system for Singapore government documents with OpenAI integration
This repository covers extensive tutorials on how to integrate LangSmith with LangChain with LangGraph to incorporate observability, monitoring, alerting, evaluation, etc. within complex LLM workflows and applications.
Advanced Retrieval-Augmented Generation system supporting multimodal document processing (text, tables, images) with multiple reasoning strategies and comprehensive evaluation framework.
This project processes and retrieves information from PDF file or PDF collection. It leverages Qdrant as a vector database for similarity searches and employs a Retrieval-Augmented Generation (RAG).
Implements a Retrieval-Augmented Generation (RAG) system.
Four Tests Standard (4TS) - Vendor-neutral specification for verifiable AI governance
AI_Security_Engineers_Roadmap
⚡ Generate dynamic CRUD and Auth routes effortlessly with FastAPI Auto Routes for SQLModel—no repetitive boilerplate needed.
Add a description, image, and links to the rag-systems topic page so that developers can more easily learn about it.
To associate your repository with the rag-systems topic, visit your repo's landing page and select "manage topics."