NyRAG

NyRAG (pronounced as knee-RAG) is a simple tool for building RAG applications by crawling websites or processing documents, then deploying to Vespa for hybrid search with an integrated chat UI.

How It Works

When a user asks a question, NyRAG performs a multi-stage retrieval process:

Query Enhancement: An LLM generates additional search queries based on the user's question and initial context to improve retrieval coverage
Embedding Generation: Each query is converted to embeddings using the configured SentenceTransformer model
Vespa Search: Queries are executed against Vespa using nearestNeighbor search with the best_chunk_score ranking profile to find the most relevant document chunks
Chunk Fusion: Results from all queries are aggregated, deduplicated, and ranked by score to select the top-k most relevant chunks
Answer Generation: The retrieved context is sent to an LLM (via OpenRouter) which generates a grounded answer based only on the provided chunks

This multi-query RAG approach with chunk-level retrieval ensures answers are comprehensive and grounded in your actual content, whether from crawled websites or processed documents.

Installation

pip install nyrag

We recommend uv:

uv init --python 3.10
uv venv
uv sync
source .venv/bin/activate
uv pip install -U nyrag

For development:

git clone https://github.com/abhishekkrthakur/nyrag.git
cd nyrag
pip install -e .

Quick Start

nyrag operates in two deployment modes (Local or Cloud) and two data modes (Web or Docs):

Deployment	Data Mode	Description
Local	Web	Crawl websites → Local Vespa Docker
Local	Docs	Process documents → Local Vespa Docker
Cloud	Web	Crawl websites → Vespa Cloud
Cloud	Docs	Process documents → Vespa Cloud

Local Mode

Runs Vespa in a local Docker container. Great for development and testing.

Web Crawling (Local)

export NYRAG_LOCAL=1

nyrag --config configs/example.yml

Example config for web crawling:

name: mywebsite
mode: web
start_loc: https://example.com/
exclude:
  - https://example.com/admin/*
  - https://example.com/private/*

crawl_params:
  respect_robots_txt: true
  follow_subdomains: true
  user_agent_type: chrome

rag_params:
  embedding_model: sentence-transformers/all-MiniLM-L6-v2
  chunk_size: 1024
  chunk_overlap: 50

Document Processing (Local)

export NYRAG_LOCAL=1

nyrag --config configs/doc_example.yml

Example config for document processing:

name: mydocs
mode: docs
start_loc: /path/to/documents/
exclude:
  - "*.csv"

doc_params:
  recursive: true
  file_extensions:
    - .pdf
    - .docx
    - .txt
    - .md

rag_params:
  embedding_model: sentence-transformers/all-mpnet-base-v2
  chunk_size: 512
  chunk_overlap: 50

Chat UI (Local)

After crawling/processing is complete:

export NYRAG_CONFIG=configs/example.yml
export OPENROUTER_API_KEY=your-api-key
export OPENROUTER_MODEL=openai/gpt-5.1

uvicorn nyrag.api:app --host 0.0.0.0 --port 8000

Open http://localhost:8000/chat

Cloud Mode

Deploys to Vespa Cloud for production use.

Web Crawling (Cloud)

export NYRAG_LOCAL=0
export VESPA_CLOUD_TENANT=your-tenant

nyrag --config configs/example.yml

Document Processing (Cloud)

export NYRAG_LOCAL=0
export VESPA_CLOUD_TENANT=your-tenant

nyrag --config configs/doc_example.yml

Chat UI (Cloud)

After crawling/processing is complete:

export NYRAG_CONFIG=configs/example.yml
export VESPA_URL="https://<your-endpoint>.z.vespa-app.cloud"
export OPENROUTER_API_KEY=your-api-key
export OPENROUTER_MODEL=openai/gpt-5.1

uvicorn nyrag.api:app --host 0.0.0.0 --port 8000

Open http://localhost:8000/chat

Configuration Reference

Web Mode Parameters (`crawl_params`)

Parameter	Type	Default	Description
`respect_robots_txt`	bool	`true`	Respect robots.txt rules
`aggressive_crawl`	bool	`false`	Faster crawling with more concurrent requests
`follow_subdomains`	bool	`true`	Follow links to subdomains
`strict_mode`	bool	`false`	Only crawl URLs matching start pattern
`user_agent_type`	str	`chrome`	`chrome`, `firefox`, `safari`, `mobile`, `bot`
`custom_user_agent`	str	`None`	Custom user agent string
`allowed_domains`	list	`None`	Explicitly allowed domains

Docs Mode Parameters (`doc_params`)

Parameter	Type	Default	Description
`recursive`	bool	`true`	Process subdirectories
`include_hidden`	bool	`false`	Include hidden files
`follow_symlinks`	bool	`false`	Follow symbolic links
`max_file_size_mb`	float	`None`	Max file size in MB
`file_extensions`	list	`None`	Only process these extensions

RAG Parameters (`rag_params`)

Parameter	Type	Default	Description
`embedding_model`	str	`sentence-transformers/all-MiniLM-L6-v2`	Embedding model
`embedding_dim`	int	`384`	Embedding dimension
`chunk_size`	int	`1024`	Chunk size for text splitting
`chunk_overlap`	int	`50`	Overlap between chunks
`distance_metric`	str	`angular`	Distance metric
`max_tokens`	int	`8192`	Max tokens per document

Environment Variables

Deployment Mode

Variable	Description
`NYRAG_LOCAL`	`1` for local Docker, `0` for Vespa Cloud

Local Mode

Variable	Description
`NYRAG_VESPA_DOCKER_IMAGE`	Docker image (default: `vespaengine/vespa:latest`)

Cloud Mode

Variable	Description
`VESPA_CLOUD_TENANT`	Your Vespa Cloud tenant
`VESPA_CLOUD_APPLICATION`	Application name (optional)
`VESPA_CLOUD_INSTANCE`	Instance name (default: `default`)
`VESPA_CLOUD_API_KEY_PATH`	Path to API key file
`VESPA_CLIENT_CERT`	Path to mTLS certificate
`VESPA_CLIENT_KEY`	Path to mTLS private key

Chat UI

Variable	Description
`NYRAG_CONFIG`	Path to config file
`VESPA_URL`	Vespa endpoint URL (optional for local, required for cloud)
`OPENROUTER_API_KEY`	OpenRouter API key for LLM
`OPENROUTER_MODEL`	LLM model (e.g., `openai/gpt-4o`)

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
assets		assets
configs		configs
src/nyrag		src/nyrag
.env.example		.env.example
.gitignore		.gitignore
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NyRAG

How It Works

Installation

Quick Start

Local Mode

Web Crawling (Local)

Document Processing (Local)

Chat UI (Local)

Cloud Mode

Web Crawling (Cloud)

Document Processing (Cloud)

Chat UI (Cloud)

Configuration Reference

Web Mode Parameters (`crawl_params`)

Docs Mode Parameters (`doc_params`)

RAG Parameters (`rag_params`)

Environment Variables

Deployment Mode

Local Mode

Cloud Mode

Chat UI

About

Uh oh!

Releases

Packages

Languages

License

awesome-software/NyRAG

Folders and files

Latest commit

History

Repository files navigation

NyRAG

How It Works

Installation

Quick Start

Local Mode

Web Crawling (Local)

Document Processing (Local)

Chat UI (Local)

Cloud Mode

Web Crawling (Cloud)

Document Processing (Cloud)

Chat UI (Cloud)

Configuration Reference

Web Mode Parameters (crawl_params)

Docs Mode Parameters (doc_params)

RAG Parameters (rag_params)

Environment Variables

Deployment Mode

Local Mode

Cloud Mode

Chat UI

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Web Mode Parameters (`crawl_params`)

Docs Mode Parameters (`doc_params`)

RAG Parameters (`rag_params`)

Packages