Express-Ollama

Express-Ollama is an open-source example project demonstrating the implementation of local LLM (Large Language Model) using Ollama, full offline, and ready to integrate with any document, regulation, or law database. This project showcases:

Automatic parsing and chunking of PDF files
Local embedding generation using Ollama's embedding model
Embedding and document storage to SQL database (MySQL/PostgreSQL)
Full-text search and semantic search (vector similarity)
Question-answering (QA) API powered by local LLM (Ollama)

Features

Fully offline: All AI/embedding/QA runs on your machine
Express.js backend: Simple and easy to extend
LLM & embedding local: No paid API, use Ollama on your Mac/PC/server
Works with regulations, legal docs, books, or any PDF
API endpoint for QA: /ask endpoint for intelligent question-answering

Requirements

Node.js (18+ recommended)
MySQL or PostgreSQL
Ollama (installed locally)
Docker (recommended for running database/vector DB)
Git

Getting Started

1. Clone Repository

git clone https://github.com/NeaByteLab/Express-Ollama.git
cd Express-Ollama

2. Install Dependencies

npm install

3. Install & Run Ollama

Download and install Ollama from https://ollama.com/download

Run Ollama:

ollama serve

4. Pull Required Models

Pull the latest LLM & embedding models:

ollama pull llama3.2
ollama pull nomic-embed-text

5. Configure .env

Edit .env with your database connection, example:

DB_HOST=localhost
DB_USER=root
DB_PASSWORD=yourpassword
DB_NAME=express_ollama
OLLAMA_URL=http://localhost:11434
PORT=8080

6. Run Database Migration (Knex)

npx knex migrate:latest --knexfile knexfile.js

7. Start Service

Run the main script:

node index.js

8. Add Your PDF Files

Place your PDF files in the dataset/ folder.
No need for a watcher service—just put the files and run the service to (re)process your dataset.
The service will automatically scan and process all PDF files in dataset/ on start.

9. Try the QA API

POST to http://localhost:8080/ask with JSON:

{ "query": "Apa isi utama Peraturan Pemerintah tentang ..." }

Project Structure

.
├── config/
│   └── db.js
├── dataset/
│   └── [your-pdf-files.pdf]
├── db/
│   └── migrations/
│       └── 001_init.js
├── helpers/
│   ├── ollama.js
│   └── parsePdf.js
├── index.js
├── knexFile.js
├── package.json
├── service/
│   └── pipeline.js

Project Flow (How It Works)

PDF Ingestion
- Place PDF files into the dataset/ folder.
- The main service (index.js) will scan and process all PDF files on startup.
- No background watcher required—just add files to dataset/ before running, or rerun service after adding new files.
PDF Parsing & Chunking
- Each PDF is parsed using the helper in helpers/parsePdf.js.
- The content is split into chunks (e.g. per page or per 500 words) for optimal semantic processing.
Embedding Generation
- Each text chunk is sent to Ollama (helpers/ollama.js) to generate vector embeddings using nomic-embed-text.
Database Storage
- Metadata, text chunks, and their embeddings are saved to your SQL database (MySQL/PostgreSQL) via Knex (config/db.js).
- All migration/schema is managed in db/migrations/.
Question Answering API
- User sends a POST request to /ask (handled in index.js or your Express API layer).
- The system:
  - Embeds the user query (with Ollama)
  - Searches the most relevant text chunks from the database using full-text search and/or cosine similarity on embeddings
  - Selects top-N best-matching chunks as context
  - Builds a prompt for the LLM model (e.g. llama3.2 via Ollama)
  - Generates and returns the answer to the user
Result Delivery
- The user receives the answer and (optionally) reference to the most relevant documents/chunks.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Express-Ollama

Features

Requirements

Getting Started

1. Clone Repository

2. Install Dependencies

3. Install & Run Ollama

4. Pull Required Models

5. Configure .env

6. Run Database Migration (Knex)

7. Start Service

8. Add Your PDF Files

9. Try the QA API

Project Structure

Project Flow (How It Works)

License

About

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
config		config
dataset		dataset
db/migrations		db/migrations
helpers		helpers
service		service
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
knexfile.js		knexfile.js
package.json		package.json

License

NeaByteLab/Express-Ollama

Folders and files

Latest commit

History

Repository files navigation

Express-Ollama

Features

Requirements

Getting Started

1. Clone Repository

2. Install Dependencies

3. Install & Run Ollama

4. Pull Required Models

5. Configure .env

6. Run Database Migration (Knex)

7. Start Service

8. Add Your PDF Files

9. Try the QA API

Project Structure

Project Flow (How It Works)

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Languages