RAG Research

This project is a simple implementation with a RAG system with YouTube data using LlamaIndex for efficient data retrieval and Qdrant or Chroma as the VectorDB to store and search the vectors. It also includes an optional Web Research Workflow that leverages real-time web data.

RAG-Only Usage

Replace <query> with your query and <youtube_url> with the YouTube URL.

yt-dlp -f bestaudio --extract-audio --audio-format mp3 <youtube_url> -o "audio/audio.mp3"
cd src/rag
uv run whisper.py # Stop here if only want to load the data
uv run rag.py --query <query> --path "../../qdrant" --collection "yt" --qdrant

Explaination of args for rag.py:
- --query: The query you want to search for
- --path: The path to the VectorDB on disk
- --collection: The collection name in the VectorDB
- --qdrant: Use Qdrant as the VectorDB (default)
- --chroma: Use Chroma as the VectorDB

Using Cloud VectorDBs instead of Local VectorDBs

if use Qdrant or Pinecone (Note: not supported yet)
```
cp .env.example .env
```
Copy your QDRANT_API_KEY and QDRANT_URL to the .env file

Web Research Workflow

This workflow adds RAG to the workflow implemented in Ollama Deep Researcher, see it for more details.
The RAG system is used on hf_docs dataset to answer the query by default
Modify to use duckduckgo as the search API
Graph Workflow:

Usage

Spin up Ollama server:
```
ollama serve
```
NOTE: Pull the model you want first, for example: ollama pull deepseek-r1:8b
See Ollama Deep Researcher for details on the environment variables.
```
cp .env.example .env
```
If want to use your YouTube data as the dataset for the RAG system, follow the steps in the RAG-Only Usage section to load the data first.

DON'T run the rag.py script.
Run the workflow:
```
uvx --refresh --from "langgraph-cli[inmem]" --with-editable . --python 3.11 langgraph dev
```
NOTE: in graph.py, in the rag_research function, see comments if you want to use mock rag data instead of the real data.

Examples:

hf_docs

RAG-Only Usage: Uses the HF Docs dataset

cd src/rag
uv run hf_docs.py
uv run rag.py --query "How to create a pipeline object?" --path "../../qdrant" --collection "hf_docs" --qdrant

See llama3.1_hf_qdrant.txt for the output.

Web Research Workflow:
- uses deepseek-r1:8b model
```
ollama pull deepseek-r1:8b
ollama serve
uvx --refresh --from "langgraph-cli[inmem]" --with-editable . --python 3.11 langgraph dev
```
1. Prompt 1: What's Model Context Protocol?
  - See output_What's Model Context Protocol?.md for the output.
2. Prompt 2: What are the FAANG companies?
  - See output_What are the FAANG companies?.md for the output.
3. Prompt 3: How to create a custom huggingface pipeline object?
  - See output_How to create a custom huggingface pipeline object?.md for the output.

Technologies Used for RAG System:

LlamaIndex
Embeddings (Loads from HuggingFace):
- dense vectors: gte-small
- sparse vectors: Splade_PP_en_v1
VectorDBs:
- Support Hybrid Vectors (dense + sparse)
  - Qdrant
  - Pinecone (Note: not supported yet)
  Note: sparse vectors defaults to prithvida/Splade_PP_en_v1
- Dense Vectors: Chroma
- Sparse Vectors: BM25
Reranker:
- bge-m3
Language Models (Loads from HuggingFace):
- Llama-3.1-8B-Instruct
- Llama-3.2-1B-Instruct

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
src		src
.env.example		.env.example
.gitignore		.gitignore
.python-version		.python-version
README.md		README.md
langgraph.json		langgraph.json
llama3.1_hf_qdrant.txt		llama3.1_hf_qdrant.txt
mock_rag_results.txt		mock_rag_results.txt
output.png		output.png
output_How to create a custom huggingface pipeline object?.md		output_How to create a custom huggingface pipeline object?.md
output_What are the FAANG companies?.md		output_What are the FAANG companies?.md
output_What's Model Context Protocol?.md		output_What's Model Context Protocol?.md
pyproject.toml		pyproject.toml
rag_results.txt		rag_results.txt
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

RAG Research

RAG-Only Usage

Using Cloud VectorDBs instead of Local VectorDBs

Web Research Workflow

Usage

Examples:

hf_docs

Technologies Used for RAG System:

About

Uh oh!

Uh oh!

Languages

andrewhsugithub/RAG-Research-Agent

Folders and files

Latest commit

History

Repository files navigation

RAG Research

RAG-Only Usage

Using Cloud VectorDBs instead of Local VectorDBs

Web Research Workflow

Usage

Examples:

hf_docs

Technologies Used for RAG System:

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Uh oh!

Languages