StaffConnect: Multi-Agent SQL Analytics Pipeline (Open WebUI + LangGraph)

A professional-grade, agentic blueprint for enterprise-level Text-to-SQL analytics, built on the Open WebUI Pipelines framework. This system uses LangGraph to orchestrate specialized LLM agents that generate safe SQL, query enterprise databases, run higher-level analytics (trends/anomalies), and return a structured response with optional matplotlib visualizations.

Architecture Overview

Host Framework: Open WebUI Pipelines
Orchestration: LangGraph (Router → Experts → Aggregation)
Pipeline Type: Manifold (Exposes multiple model IDs like gpt-4o-mini, o3-mini)
Database: Agnostic (Oracle, PostgreSQL, SQL Server via SQLAlchemy)

Who this is for

AI Engineers building agentic analytics over enterprise databases
Data/Platform teams that want a modular Text-to-SQL stack with guardrails
Anyone who wants a clean reference architecture for Router → Experts → Aggregation → Visualization

Key Concepts

Open WebUI Manifold Pipeline

This project is implemented as a Manifold Pipeline. Unlike a standard pipe, a manifold allows a single backend script to register multiple models in the Open WebUI interface. In this case, it exposes different LLM "brains" for the same SQL analytics engine.

Router-Expert Pattern

The system uses a Router to classify user intent and dispatch the query to one specialized Expert Agent (or sometimes multiple, depending on your graph design). Each expert is optimized for a particular domain:

Audit trail / activity logs
Error log analysis (ELMAH-style)
Trends / time-series analytics
Anomaly detection

LangGraph as the Orchestrator

LangGraph models the workflow as a state machine:

Each node reads/writes to a shared AgentState
Conditional edges route to the right expert agent
You can add loops (repair/refine) when SQL fails or outputs are incomplete

High-Level Architecture

graph TD
    UI[Open WebUI Interface] --> Gateway[FastAPI Pipeline Gateway]
    Gateway --> Pipeline[StaffConnect Manifold Pipeline]
    
    subgraph LangGraph Orchestrator
        Pipeline --> Router{Intent Router}
        Router -- "Audit" --> AA[Audit Trail Agent]
        Router -- "Errors" --> EA[ELMAH Agent]
        Router -- "Trends" --> TA[Trend Agent]
        Router -- "Anomaly" --> ANA[Anomaly Agent]
        
        AA & EA & TA & ANA --> Collector[Unified Context Collector]
    end

    Collector --> Viz[Visualization Engine]
    Viz --> UI

Repository Layout

Top-level folders and files in this repo:

.github/                  # CI workflows and repo automation
blueprints/               # Reusable LangGraph blueprints / templates
docs/                     # Documentation notes / design references
examples/                 # Example queries, scripts, or sample runs
pipelines/                # Core agent graph(s) + agent implementations
  staffconnect_chat.py    # Main Open WebUI Pipeline entry point
  staffconnect_chat_files/# Agent logic, chains, and LangGraph definitions
  common_files/           # Modularized utilities (SQL, Viz, LLM, Logging)
tests/                    # Unit/integration tests
utils/pipelines/          # Framework-level helpers (Auth, Misc, Stream)

config.py                 # Central configuration loader (env, flags)
schemas.py                # Pydantic / typed schemas used across agents
main.py                   # Entry point (FastAPI Gateway serving the pipelines)
docker-compose.yaml       # Container orchestration
Dockerfile                # Container build
env.example               # Example env vars
dev.sh / start.sh         # Convenience scripts
CONTRIBUTING.md           # Contribution guide
LICENSE                   # MIT license

Where to start reading the code

If you’re new to the codebase, read in this order:

main.py → app entry, graph build, execution loop
pipelines/staffconnect_chat.py → Open WebUI Pipeline entry & Gateway wiring
pipelines/staffconnect_chat_files/ → router + agent nodes + graph wiring
schemas.py → state + response contracts
pipelines/common_files/ → SQL cleaning, execution, chart creation
tests/ → what “correct behavior” means

Open WebUI Integration

Valves (Configuration)

Configuration is handled via Open WebUI Valves. You can set these directly in the WebUI settings:

DATABASE_URL: Generic SQLAlchemy connection string.
OPENAI_API_KEY: LLM access.
CHART_DIRECTORY_STAFFCONNECT: Path where generated analytic plots are stored.

Gateway Server

The main.py file at the root acts as a standalone FastAPI server that is OpenAI-API compatible. You can connect Open WebUI to this gateway to host your custom pipelines.

Execution Flow: What Happens on a Query

This is the conceptual runtime pipeline (matches the repo’s structure and file responsibilities):

User query enters the system: (CLI loop, API layer in main.py, or via Open WebUI).
State is initialized: (AgentState) with raw user input, conversation history (optional), and db connection settings.
Router node runs: Classifies intent (audit/errors/trends/anomaly/general sql) and writes state.route = <agent_name>.
Expert agent runs: Loads schema context, generates SQL or analysis plan, executes SQL safely, and returns structured data + explanation.
Collector node aggregates: Normalizes outputs into one common contract and attaches metadata (timing, rows, warnings).
Visualization node optionally runs: If agent output requests a plot (or data looks plottable), it creates chart images / base64 payload.
Final formatter returns: human-readable answer + machine-readable JSON payload + optional chart payload.

Agents

This repo’s modular design supports 4 core expert agents.

1) Audit Trail Agent

Intent: audit/activity investigation Typical outputs:

top events by user/action/time window
suspicious access patterns
drill-down by entity IDs

2) ELMAH Agent

Intent: error logs / failures / stack traces Typical outputs:

error frequency trends
top exception types + root cause hints
correlation with deployments/time windows

3) Trend Agent

Intent: time-series / KPI style analytics Typical outputs:

daily/weekly aggregation
moving averages
seasonality hints
plot-ready series

4) Anomaly Agent

Intent: detecting outliers Typical outputs:

z-score/IQR based outliers
abrupt shifts
rare event spikes
“why flagged” explanations

Implementation note (recommended pattern) Each agent should implement a consistent interface like run(state: AgentState) -> AgentState or invoke(state) -> dict then merged into state. Refer to BaseAgent in pipelines/staffconnect_chat_files/base_agent.py.

State Model

State is the backbone of LangGraph workflows. Refer to schemas.py for the actual schema.

Recommended AgentState fields

query: User input
route: Router decision
messages: Conversation history (optional)
sql: Generated SQL
params: Bound parameters (if supported)
rows: Query results (normalized)
analysis: Computed analytics (trend/anomaly)
chart: Chart payload (path/base64/bytes)
errors: Error stack / LLM failures
warnings: Safety filters triggered
metadata: Timing, rowcount, tokens, model name, etc.

SQL Safety & Guardrails

Text-to-SQL systems fail in predictable ways; this repo includes tests aimed at SQL cleaning + LLM parsing.

Recommended safety constraints

Read-only by default: allow only SELECT.
Block keywords: DROP, TRUNCATE, ALTER, GRANT, REVOKE, etc.
Enforce row limits: (FETCH FIRST N ROWS ONLY).
Enforce schema/table allowlist: (optional).
Parameterize values: whenever possible.
Validate against known schema: (table/column existence).

Common repair loop

If SQL fails:

capture DB error (ORA-xxxxx).
feed error back to the agent (repair prompt).
regenerate SQL with constraints.
retry with capped attempts.

Visualization & Output Contracts

Visualization Engine

Agent returns data + suggested plot type (line/bar/hist).
Viz layer generates matplotlib chart.
Chart is returned inline (base64) or saved to disk (path) depending on configuration in Valves.

Structured Response

Your final response should be both:

Human readable: explanation + key insights.
Machine readable: JSON including route used, SQL executed, rows summary, chart payload, and warnings/errors.

Configuration

Configuration is centralized in config.py and Open WebUI Valves.

Environment variables (recommended)

# LLM
OPENAI_API_KEY=...
OPENAI_MODEL=...

# Database (SQLAlchemy URL)
DATABASE_URL=your_database_url_here

# Runtime
LOG_LEVEL=INFO
MAX_ROWS=500
SQL_READ_ONLY=true

env.example is the canonical list—keep it updated as you add features.

Local Setup

Requirements

Python 3.11+
Appropriate Database Drivers (e.g., Oracle Instant Client, Psycopg2 for Postgres, PyODBC for SQL Server)
An OpenAI API Key

Install

git clone https://github.com/s8m21/Multi-Agent-SQL-Pipeline.git
cd Multi-Agent-SQL-Pipeline
pip install -r requirements.txt

Env Config

cp env.example .env
# edit .env with your values

Connection String Examples

Oracle: oracle+cx_oracle://user:password@host:port/?service_name=service
PostgreSQL: postgresql://user:password@host:port/dbname
SQL Server: mssql+pyodbc://user:password@host:port/dbname?driver=ODBC+Driver+17+for+SQL+Server

Running the App

Option A: Pipeline Gateway (Recommended)

python main.py

Add http://localhost:9099 to Open WebUI as an OpenAI connection.

Option B: Direct Python Execution

For testing purposes, you can execute individual agent scripts or the main entry.

Testing

A test suite is included in tests/.

pytest

Focus on: SQL cleaning functions, Router classification, and Repair loop behavior.

Docker

Build: docker build -t staffconnect-sql-pipeline .
Compose: docker compose up --build

Examples

Check examples/ for runnable demo scripts or sample prompts.

"Show top 10 users by failed logins last 7 days"
"Plot error count by day for the last 30 days"

Extending the System

Add a Agent: Create in pipelines/staffconnect_chat_files/, implement run(state), and add to Router.
Add a Tool: Place in pipelines/common_files/ and keep it pure and safe-by-default.

Troubleshooting

Database client: Ensure the appropriate client library (e.g., Oracle Instant Client) is in your PATH.
LLM parsing: Use strict output prompts or JSON repair passes.
SQL failures: Feed database-specific error codes back to the repair loop.

Contributing

See CONTRIBUTING.md.

License

MIT. See LICENSE.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.github		.github
blueprints		blueprints
docs		docs
examples		examples
pipelines		pipelines
tests		tests
utils/pipelines		utils/pipelines
.dockerignore		.dockerignore
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
config.py		config.py
dev-docker.sh		dev-docker.sh
dev.sh		dev.sh
docker-compose.yaml		docker-compose.yaml
env.example		env.example
main.py		main.py
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml
requirements-minimum.txt		requirements-minimum.txt
requirements.txt		requirements.txt
schemas.py		schemas.py
start.bat		start.bat
start.sh		start.sh

Uh oh!

License

s8m21/Multi-Agent-SQL-Pipeline

Folders and files

Latest commit

History

Repository files navigation

StaffConnect: Multi-Agent SQL Analytics Pipeline (Open WebUI + LangGraph)

Table of Contents

Key Concepts

Open WebUI Manifold Pipeline

Router-Expert Pattern

LangGraph as the Orchestrator

High-Level Architecture

Repository Layout

Where to start reading the code

Open WebUI Integration

Valves (Configuration)

Gateway Server

Execution Flow: What Happens on a Query

Agents

1) Audit Trail Agent

2) ELMAH Agent

3) Trend Agent

4) Anomaly Agent

State Model

Recommended AgentState fields

SQL Safety & Guardrails

Recommended safety constraints

Common repair loop

Visualization & Output Contracts

Visualization Engine

Structured Response

Configuration

Environment variables (recommended)

Local Setup

Requirements

Install

Env Config

Connection String Examples

Running the App

Option A: Pipeline Gateway (Recommended)

Option B: Direct Python Execution

Testing

Docker

Examples

Extending the System

Troubleshooting

Contributing

License

About

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Sponsor this project

Uh oh!

Packages 0

Uh oh!

Uh oh!

Languages

Packages