ALLMond Assistant

A containerized AI-powered web assistant. This solution leverages a microservices architecture to crawl university data, process it via NLP pipelines, and generate context-aware responses using local Large Language Models (LLMs).

🏗 Architecture

The system is composed of the following Docker services:

webapp: Node.js/Express/TypeScript frontend (MVC) with Bootstrap 5. Handles user interaction, request concurrency locking, and dynamic dashboard rendering.
ai-service: Python/FastAPI backend. Orchestrates the NLP pipeline using LangChain and integrates with Google Gemini API. It handles:
- Classification: Scikit-learn Logistic Regression to filter relevant queries.
- RAG (Retrieval-Augmented Generation): Summarization and QA chains powered by Qwen models via Ollama and context retrieved from PostgreSQL with pgvector.
- Text-to-SQL: Generates SQL queries for data analytics based on user input.
- Data-to-Visualization: Generates EJS/Chart.js snippets for dynamic dashboards.
db: PostgreSQL 16 with pgvector extension. Stores RAG documents, database schema information for Text-to-SQL, and dashboard history.
crawler: Python/FastAPI service using crawl4ai (Playwright) to fetch live content from UnivPM and populate the rag_documents table in PostgreSQL.
ollama: (Optional) Containerized LLM inference server. Can be replaced by a local instance for better performance on Apple Silicon/GPU.
ngrok: Exposes the application to the public internet.

🚀 Getting Started

Prerequisites

Docker Desktop
Ollama (Recommended for local execution)
Python 3.10+ (Optional, for local training)

1. Environment Setup

Copy the example environment file and configure it:

cp .env.example .env

Edit .env to set your configuration. If using Local Ollama (recommended for Mac M1/M2/M3), ensure:

OLLAMA_URL=http://host.docker.internal:11434

2. Ollama Configuration (Local Mode)

To utilize your host's GPU and share models with the project, you must configure your local Ollama instance to listen on all interfaces and store models in the project directory.

Stop any running Ollama instance (e.g., from the menu bar), then run:

# Run this from the project root
export OLLAMA_HOST=0.0.0.0 \
export OLLAMA_MODELS=$(pwd)/models/models

# Start the server
ollama serve

Note: Keep this terminal open. The OLLAMA_MODELS path ensures that models pulled by the project are stored within the repo structure.

3. Generate Training Data

If you don't have a dataset, you can generate a synthetic one with university-related questions:

# Generate 2000 samples in data/raw/training_dataset.csv
python scripts/generate_dataset.py

4. Train the Classifier

Before starting the services, you must train the relevance classifier.

Option A: Using Docker (Easiest) Start the services first, then run the training script inside the container:

docker-compose up -d
docker-compose exec ai-service python scripts/train_pipeline.py --input-file "data/raw/training_dataset.csv" --text-column "question" --label-column "label" --model logistic_regression

Option B: Local Python

pip install -r src/inference/requirements.txt
python scripts/train_pipeline.py \
  --input-file "data/raw/training_dataset.csv" \
  --text-column "question" \
  --label-column "label" \
  --model logistic_regression

5. Run the Application

Start the entire stack:

docker-compose up --build

Web Interface: http://localhost:3000
Public URL: Check the ngrok service logs or visit http://localhost:4040.

🛠 Development

Hot Reload: The webapp service is configured with nodemon. Changes to src/application (TS, EJS, CSS) will trigger an automatic rebuild/restart.
Logs: View logs for specific services:
```
docker-compose logs -f ai-service
```

📂 Project Structure

├── data/               # Datasets (raw, processed)
├── models/             # Trained .pkl models and Ollama blobs
├── scripts/            # Training and utility scripts
├── src/
│   ├── application/    # Node.js Web App (Frontend/BFF)
│   ├── crawler/        # Python Crawler Service
│   └── inference/      # Python AI/NLP Service
├── utils/              # Shared Python utilities (Preprocessing, Logger)
└── docker-compose.yml  # Orchestration

🤖 Models Used

Classifier: Logistic Regression (Scikit-learn)
Orchestration: LangChain (Python)
Summarization: qwen3:0.6b (via Ollama)
QA/Chat: qwen3:1.7b (via Ollama)

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
data		data
docs		docs
logs		logs
notebooks		notebooks
scripts		scripts
src		src
tests		tests
utils		utils
.env.example		.env.example
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
docker-compose.yml		docker-compose.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ALLMond Assistant

🏗 Architecture

🚀 Getting Started

Prerequisites

1. Environment Setup

2. Ollama Configuration (Local Mode)

3. Generate Training Data

4. Train the Classifier

5. Run the Application

🛠 Development

📂 Project Structure

🤖 Models Used

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

ALLMond Assistant

🏗 Architecture

🚀 Getting Started

Prerequisites

1. Environment Setup

2. Ollama Configuration (Local Mode)

3. Generate Training Data

4. Train the Classifier

5. Run the Application

🛠 Development

📂 Project Structure

🤖 Models Used

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages