RAG Blueprint

A comprehensive open-source framework for building production-ready Retrieval-Augmented Generation (RAG) systems. This blueprint simplifies the development of RAG applications while providing full control over performance, resource usage, and evaluation capabilities.

Open-source community offers a wide range of RAG-related frameworks focus on the specific area - monitoring, visualization or processing. While building or buying RAG systems has become increasingly accessible, deploying them as production-ready data products remains challenging. Our framework bridges this gap by providing a streamlined development experience with easy configuration and customization options, while maintaining complete oversight of performance and resource usage.

It comes with built-in monitoring and observability tools for better troubleshooting, integrated LLM-based metrics for evaluation, and human feedback collection capabilities. Whether you're building a lightweight knowledge base or an enterprise-grade application, this blueprint offers the flexibility and scalability needed for production deployments.

Figure 1: High-level architecture of the RAG Blueprint framework showing the main components and data flow

🚀 Features

Multiple Knowledge Base Integration: Seamless extraction from several Data Sources(Confluence, Notion, PDF)
Wide Models Support: Availability of numerous embedding and language models
Vector Search: Efficient similarity search using vector stores
Interactive Chat: User-friendly interface for querying knowledge on Chainlit
Performance Monitoring: Query and response tracking with Langfuse
Evaluation: Comprehensive evaluation metrics using RAGAS
Setup flexibility: Easy and flexible setup process of the pipeline

🛠️ Tech Stack

Language Models

LiteLLM - Availability of many LLMs via providers like OpenAI, Google or Anthropic as well as local LLMs

Vector Stores

Qdrant • Chroma • PGVector

Infrastructure

PostgreSQL • Docker

🚀 Quickstart

Check the detailed Quickstart Setup

🏗️ Architecture

Data Flow

Extraction:
- Fetches content from respective data sources
- Preprocess retrieved resources and parse it to markdown
Embedding:
- Applies markdown aware splitting
- Embeds final nodes using the selected embedding model
- Saves the embeddings in the selected vector store
Augmentation
- Defines retrieval and augmentation pipeline encapusalted in a chat engine
- Integrates Chainlit for UI interface
- Integrates Langfuse for observability of generated responses and user queries
Evaluation:
- Uses Chainlit and Langfuse platforms for gathering human feedback
- Employs Ragas package for evaluating perfomance of current setup

For more info refer to specific readmes of Extraction, Embedding, Augmentation and Evaluation.

Integrations

For user interface the codebase uses Chainlit, which is integrated with Langfuse responsible for observability and tracing of the system. Moreover, integration enables building evaluation datasets based on the user feeback regarding the system answers. Feedback is saved in Langfuse datasets and later used by Evaluation module.

📁 Project Structure

.
├── build/            # Build and deployment scripts
│   └── workstation/  # Build scripts for workstation setup
├── configurations/   # Configuration and secrets files
├── data/             # Data for local testing
├── res/              # Assets
└── src/              # Source code
    ├── augmentation/   # Chainlit, Langfuse, and RAG processing components
    ├── core/           # Base package
    ├── extraction/     # Data sources extraction
    ├── embedding/      # Data embedding
    └── evaluate/       # Evaluation system
├── tests/            # Unit tests

📚 Documentation

For detailed documentation on setup, configuration, and development:

Name		Name	Last commit message	Last commit date
Latest commit History 214 Commits
.github/workflows		.github/workflows
build/workstation		build/workstation
configurations		configurations
data/bavarian_beer		data/bavarian_beer
docs		docs
res/readme		res/readme
src		src
tests		tests
.dockerignore		.dockerignore
.gitignore		.gitignore
.isort.cfg		.isort.cfg
.pre-commit-config.yaml		.pre-commit-config.yaml
README.md		README.md
chainlit.md		chainlit.md
mkdocs.yml		mkdocs.yml
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

RAG Blueprint

🚀 Features

🛠️ Tech Stack

Core

Data Sources

Embedding Models

Language Models

Vector Stores

Infrastructure

🚀 Quickstart

🏗️ Architecture

Data Flow

Integrations

📁 Project Structure

📚 Documentation

About

Uh oh!

Releases

Packages

Contributors 3

Uh oh!

Languages

feld-m/rag_blueprint

Folders and files

Latest commit

History

Repository files navigation

RAG Blueprint

🚀 Features

🛠️ Tech Stack

Core

Data Sources

Embedding Models

Language Models

Vector Stores

Infrastructure

🚀 Quickstart

🏗️ Architecture

Data Flow

Integrations

📁 Project Structure

📚 Documentation

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Uh oh!

Languages

Packages