RAG-Go-Ollama

A Retrieval-Augmented Generation (RAG) system built with Go that uses Ollama for embeddings and LLM capabilities, with Chroma as the vector database for document storage and retrieval.

Architecture Overview

This project implements a document Q&A system with the following components:

Go Backend: Uses Gin framework for HTTP routing
Chroma DB: Vector database running in a Docker container
Ollama: LLM and embedding model running locally on the host machine
Docker: Containerizes the application while connecting to local Ollama

Features

Upload PDF documents for processing and indexing
Split documents into manageable chunks for embedding
Generate embeddings using Ollama's embedding models
Store document chunks and embeddings in Chroma vector database
Search for relevant information using semantic similarity
Generate comprehensive answers to questions using retrieved context

Prerequisites

Docker and Docker Compose
Ollama running locally on your host machine
Go 1.23+ (only needed for development)

Installation

Clone the repository:

git clone https://github.com/yourusername/rag-go-ollama.git
cd rag-go-ollama

Please install Ollama locally first Ollama link

Make sure Ollama is running locally with the required models:

# Install models if you haven't already
ollama pull gemma3:1b
ollama pull nomic-embed-text

# Ensure Ollama service is running
ollama serve

Start the application using Docker Compose:
```
docker compose up -d
```

Configuration

The application uses environment variables for configuration, which are defined in the docker-compose.yml file:

CHROMA_URL: URL to connect to the Chroma database (http://chroma:8000)
OLLAMA_MODEL: Model to use for text generation (gemma3:1b)
OLLAMA_EMBEDDING_MODEL: Model to use for generating embeddings (nomic-embed-text)
OLLAMA_HOST: Host for Ollama service (host.docker.internal)
OLLAMA_URL: URL for Ollama API (http://host.docker.internal:11434)

API Endpoints

Upload a Document

POST /document

Form parameters:

id: Document identifier
title: Document title (optional)
file: PDF file to upload

Search for Information

POST /document/search

JSON payload:

{
  "query": "Your question about documents here"
}

Technical Details

Connection Management

The project uses a singleton connection manager (conn_manager.go) to handle connections to Ollama and Chroma services. This ensures efficient resource usage and provides thread-safe access to these services.

Document Processing

Documents go through the following pipeline:

PDF loading using langchainGo's document loader
Text splitting into chunks of approximately 1000 characters with 100 character overlap
Embedding generation using Ollama's embedding model
Storage in Chroma DB with metadata about the source document

Retrieval and Generation

When a query is received:

The query is embedded using the same embedding model
Similar documents are retrieved from Chroma using cosine similarity
Retrieved documents are used as context for the LLM
The LLM generates a comprehensive answer based on the provided context

Development

To develop or modify the application:

Install Go 1.23+
Clone the repository
Run go mod download to install dependencies
Make your changes
Build with go build -o main .
Or use Docker to build: docker build -t rag-go-ollama .

Evaluation

This project demonstrates a well-architected RAG system with:

Efficient Resource Management: Using connection pooling and singletons
Docker Integration: Running the application in containers while connecting to host services
Separation of Concerns: Clear separation between routes, models, and connection management
Thread Safety: Proper mutex usage for concurrent operations
Configuration Flexibility: Environment variable-based configuration
Error Handling: Comprehensive error propagation

Potential improvements could include:

Adding authentication/authorization
Supporting more document formats
Implementing caching for frequently asked questions
Adding logging and monitoring
Creating a user interface

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
models		models
routes		routes
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
docker-compose.yml		docker-compose.yml
go.mod		go.mod
go.sum		go.sum
main.go		main.go

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

RAG-Go-Ollama

Architecture Overview

Features

Prerequisites

Installation

Configuration

API Endpoints

Upload a Document

Search for Information

Technical Details

Connection Management

Document Processing

Retrieval and Generation

Development

Evaluation

About

Uh oh!

Releases

Packages

Uh oh!

Languages

erevos-13/rag-go-ollama

Folders and files

Latest commit

History

Repository files navigation

RAG-Go-Ollama

Architecture Overview

Features

Prerequisites

Installation

Configuration

API Endpoints

Upload a Document

Search for Information

Technical Details

Connection Management

Document Processing

Retrieval and Generation

Development

Evaluation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages