🎓 AI-Assistant-Self Tutoring

A self-tutoring AI assistant with document grounding, knowledge graphs, and deep research capabilities. Upload your learning materials and get intelligent, cited answers from your documents.

✨ Key Features

📄 Document-Grounded Answers - Responses based ONLY on your uploaded documents
🔍 Hybrid RAG Search - Combines vector similarity + knowledge graph traversal
🕸️ Knowledge Graph - Builds entity relationships using Neo4j (optional)
🔬 Deep Research - Web search + document synthesis
🤖 Local LLM - Uses Ollama (no API keys needed)
🛡️ Security Guardrails - Configurable content filtering and safety checks
💾 Large File Support - Up to 50MB per document

🚀 Quick Start

Prerequisites

Python 3.10+
Ollama - Download here
Neo4j (optional) - For knowledge graph features
Docker (optional) - For containerized deployment

Installation

Linux/macOS:

# Clone the repository
git clone <repository-url>


# Make scripts executable
chmod +x startup.sh 

# Run the startup script (handles everything automatically)
./startup.sh

Windows:

REM Clone the repository
git clone <repository-url>


REM Run the startup script
startup.bat

The startup script will:

✅ Check system requirements
✅ Verify Ollama is running
✅ Check/download required models
✅ Install Python dependencies
✅ Start the application

Access the application at: http://localhost:5000

📖 Usage

Basic Workflow

Start the Application
```
./startup.sh
```
Upload Documents
- Go to http://localhost:5000
- Upload PDFs, DOCX, TXT, or other supported formats
- Maximum size: 50MB per file
Ask Questions
- Type your question in the chat interface
- Get answers grounded in your documents
- See source citations for each answer

Supported File Formats

📄 PDF
📝 DOCX
📃 TXT, MD
📊 CSV, JSON
🌐 HTML, XML

⚙️ Configuration

Operating Modes

Fast Mode (Default) - Vector search only, no knowledge graph

./startup.sh --fast

Full Mode - With knowledge graph (requires Neo4j)

./startup.sh --full

Docker Mode - Containerized deployment

./startup.sh --docker

Environment Variables

Create a .env file (copy from .env.example):

# LLM Configuration
LLM_MODEL=llama3.2:3b                 # Recommended: 3b or 8b variant
EMBEDDING_MODEL=nomic-embed-text      # Most reliable option

# Ollama Connection
OLLAMA_HOST=http://localhost:11434

# Optional: Neo4j (for knowledge graph)
# ENABLE_KNOWLEDGE_GRAPH=true
# NEO4J_URI=bolt://localhost:7687
# NEO4J_USER=neo4j
# NEO4J_PASSWORD=your-password

Model Recommendations

Model	Size	RAM Required	Best For	Performance
`llama3.2:3b`	2GB	4GB+	Balanced use	⭐⭐⭐⭐ Recommended
`llama3:8b`	5GB	8GB+	High accuracy	⭐⭐⭐⭐⭐ Best quality
`qwen2.5:7b`	4GB	8GB+	Technical docs	⭐⭐⭐⭐⭐
`phi3:3.8b`	2.3GB	4GB+	Low resources	⭐⭐⭐

Embedding Models

Model	Size	Speed	Reliability
`nomic-embed-text`	700MB	Fast	✅ Most reliable
`all-minilm`	80MB	Very Fast	✅ Very reliable
`mxbai-embed-large`	1.5GB	Slower	⚠️ May have detection issues

🔧 Troubleshooting

Common Issues

1. Model Not Found After Pulling

Problem: Ran ollama pull <model> but model still shows 404 errors

Solution:

# Restart Ollama service
pkill ollama && ollama serve &

# Verify model appears
ollama list

2. ChromaDB Reset After Model Change

Problem: Changing embedding models wipes the database

Solution:

# Stick with one embedding model, or re-index
./startup.sh
# Wait for documents to re-index automatically

For detailed troubleshooting, see TROUBLESHOOTING_INDEXING.md.

🐳 Docker Deployment

Quick Start

# Fast Mode (uses host Ollama)
docker-compose up -d graphrag

# Full Mode (with Neo4j knowledge graph)
docker-compose --profile kg up -d

# With containerized Ollama (GPU)
docker-compose --profile ollama up -d

Environment Configuration

# Create .env file
cp .env.example .env

# Edit configuration
nano .env

📚 API Reference

Core Endpoints

Endpoint	Method	Description
`/ask`	POST	Ask a question
`/upload`	POST	Upload document
`/deep-research`	POST	Web research + synthesis
`/config-status`	GET	Get configuration

Diagnostic Endpoints

Endpoint	Method	Description
`/chroma-status`	GET	Index statistics
`/debug-search`	POST	Test vector search
`/graph-stats`	GET	Knowledge graph stats
`/data-store-files`	GET	List indexed files

Example: Ask Question

curl -X POST http://localhost:5000/ask \
  -H "Content-Type: application/json" \
  -d '{
    "question": "What are the security best practices?",
    "mode": "hybrid"
  }'

Example: Upload Document

curl -X POST http://localhost:5000/upload \
  -F "file=@document.pdf"

📁 Project Structure

graphrag_project/
├── graphrag_app.py          # Main Flask application
├── config.py                # Configuration with auto-detection
├── search.py                # Hybrid RAG search
├── document_processor.py    # Document parsing & chunking
├── deep_research.py         # Web research functionality
├── guardrails_handler.py    # Security guardrails
│
├── entity_extractor.py      # LLM-based entity extraction
├── entity_resolver.py       # Entity deduplication
├── neo4j_graph.py           # Knowledge graph operations
├── ontology.py              # Entity schemas
│
├── startup.sh               # Linux/macOS startup script
├── startup.bat              # Windows startup script
├── check_models.sh          # Model verification
├── check_indexing.sh        # Indexing diagnostics
├── upgrade_llm.sh           # LLM upgrade helper
│
├── Dockerfile               # Container build
├── docker-compose.yml       # Multi-service orchestration
├── docker-entrypoint.sh     # Container startup
│
├── templates/               # Web UI templates
├── guardrails/              # Guardrails configuration
│
├── requirements.txt         # Python dependencies
├── .env.example             # Environment template
└── README.md                # This file

🔄 Advanced Features

Knowledge Graph Mode

Enable entity extraction and relationship mapping:

# Start with knowledge graph
./startup.sh --full

# Or enable via API
curl -X POST http://localhost:5000/config/enable-kg

Deep Research

Perform web-based research with document synthesis:

curl -X POST http://localhost:5000/deep-research \
  -H "Content-Type: application/json" \
  -d '{
    "topic": "Machine learning best practices",
    "include_web": true,
    "include_docs": true,
    "depth": "standard"
  }'

Depth Levels:

quick - ~5 sources, 30 seconds
standard - ~15 sources, 1 minute
deep - ~25+ sources, 2 minutes

Configurable Search Parameters

curl -X POST http://localhost:5000/config/search-params \
  -H "Content-Type: application/json" \
  -d '{
    "top_k": 8,
    "min_relevance": 0.4,
    "search_mode": "hybrid",
    "context_window": 8000
  }'

📝 Model Variant Auto-Detection

The application automatically detects model variants installed in Ollama:

✅ .env specifies LLM_MODEL=llama3.2
✅ You have llama3.2:3b installed
✅ App auto-detects and uses llama3.2:3b

Console Output:

🤖 Initializing LLM (llama3.2)...
ℹ️  Auto-detected model variant: llama3.2:3b (configured: llama3.2)
✅ LLM ready

No configuration needed - it just works!

🤝 Contributing

Contributions are welcome! Please:

Fork the repository
Create a feature branch
Make your changes
Submit a pull request

📄 License

MIT License - See LICENSE file for details.

🙏 Acknowledgments

Built with:

Ollama - Local LLM runtime
LangChain - LLM orchestration
ChromaDB - Vector database
Neo4j - Graph database
Flask - Web framework

Built with ❤️ for self-directed learners

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
guardrails		guardrails
templates		templates
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
config.py		config.py
deep_research.py		deep_research.py
docker-compose.yml		docker-compose.yml
docker-entrypoint.sh		docker-entrypoint.sh
document_processor.py		document_processor.py
entity_extractor.py		entity_extractor.py
entity_resolver.py		entity_resolver.py
file_tracker.py		file_tracker.py
graphrag_app.py		graphrag_app.py
guardrails_handler.py		guardrails_handler.py
neo4j_graph.py		neo4j_graph.py
ontology.py		ontology.py
requirements.txt		requirements.txt
search.py		search.py
startup.bat		startup.bat
startup.sh		startup.sh

License

syedDS/AIAssistant_project

Folders and files

Latest commit

History

Repository files navigation