β οΈ Community Integration Fork | All credit goes to clusterzx for the original Paperless-AI project.
Paperless-AI is an AI-powered extension for Paperless-ngx created by clusterzx that brings automatic document classification, smart tagging, and semantic search using OpenAI-compatible APIs and Ollama.
This is a community-maintained integration fork that:
- π§ͺ Tests and merges pending upstream pull requests
- π¦ Provides optimized Docker images (Lite & Full variants)
- π Applies security updates and dependency maintenance
- π Integrates community bug fixes
- π Offers additional documentation
Important: This fork exists purely for experimentation and integration testing. All development credit belongs to the original author. Think of this as a "tinkering workshop" where community fixes are tested before potentially flowing back upstream.
Want the official version? β clusterzx/paperless-ai
Performance Enhancements:
- β‘ PERF-001: History table with SQL pagination (~25-50x faster with 1000+ documents)
- π― Tag caching with 5-minute TTL (95% reduction in API calls)
- π Force reload button to bypass cache when needed
Bug Fixes:
- β PR-772: Fixed infinite retry loop with exponential backoff
- β PR-747: History validation tool with real-time progress indicators
- β SSE buffering fix for instant progress feedback
- β Security: Authentication added to all history endpoints
Docker Optimizations:
- π¦ DOCKER-001: Optimized images (Lite ~400MB, Full ~1.2GB)
- ποΈ Separate build/push workflow for reliability
- π§ Multi-stage builds with clean dependency installation
Documentation:
- π Comprehensive COPILOT.md for developers and AI assistants
- π Detailed fix documentation in
Included_Fixes/ - π Swagger API documentation improvements
UI/UX:
- π Enhanced dark mode support
- π± Responsive mobile menu
- β±οΈ Real-time progress bars with step counts
- π¨ Improved loading indicators
All fixes are documented in Included_Fixes/ with detailed implementation notes:
| Category | Fix ID | Description | Status |
|---|---|---|---|
| Upstream PRs | PR-772 | Fix infinite retry loop | β Merged |
| PR-747 | History validation tool | β Merged | |
| Performance | PERF-001 | SQL pagination & tag caching | β Applied |
| Security | SEC-001 | SSRF & code injection fixes | β Applied |
| Docker | DOCKER-001 | Optimized Docker images | β Applied |
| Dependencies | DEP-001 | Remove unused sqlite3 | β Applied |
| CI/CD | CI-001 | Automatic version tagging | β Applied |
Full details: Included_Fixes/README.md
It enables fully automated document workflows, contextual chat, and powerful customization β all via an intuitive web interface.
π‘ Just ask:
βWhen did I sign my rental agreement?β
βWhat was the amount of the last electricity bill?β
βWhich documents mention my health insurance?β
Powered by Retrieval-Augmented Generation (RAG), you can now search semantically across your full archive and get precise, natural language answers.
- Detects new documents in Paperless-ngx automatically
- Analyzes content using OpenAI API, Ollama, and other compatible backends
- Assigns title, tags, document type, and correspondent
- Built-in support for:
- Ollama (Mistral, Llama, Phi-3, Gemma-2)
- OpenAI
- DeepSeek.ai
- OpenRouter.ai
- Perplexity.ai
- Together.ai
- LiteLLM
- VLLM
- Fastchat
- Gemini (Google)
- ...and more!
- Natural language document search and Q&A
- Understands full document context (not just keywords)
- Semantic memory powered by your own data
- Fast, intelligent, privacy-friendly document queries

- Web interface for manual AI tagging
- Useful when reviewing sensitive documents
- Accessible via
/manual
- Define rules to limit which documents are processed
- Disable prompts and apply tags automatically
- Set custom output tags for tracked classification

β οΈ First-time install: Restart the container after completing setup (API keys, preferences) to build RAG index.
π Not required for updates.
π Installation Wiki
Images are available on both GitHub Container Registry (GHCR) and Docker Hub:
# Lite version (~500-700 MB) - AI tagging only
docker pull ghcr.io/admonstrator/paperless-ai-patched:latest
# Full version (~1.5-2 GB) - AI tagging + RAG search
docker pull ghcr.io/admonstrator/paperless-ai-patched:latest-fullBenefits of GHCR:
- β No rate limits
- β Unlimited bandwidth
- β Free for public repositories
- β Integrated with GitHub Actions
# Lite version
docker pull admonstrator/paperless-ai-patched:latest
# Full version
docker pull admonstrator/paperless-ai-patched:latest-full- Multi-stage optimized builds for smaller image sizes
- Health monitoring and auto-restart
- Persistent volumes and graceful shutdown
- Works out of the box with minimal setup
- Multi-arch support (amd64, arm64)
# Install dependencies
npm install
# Start development/test mode
npm run test- β Multi-AI model support
- β Multilingual document analysis
- β Tag rules and filters
- β Integrated document chat with RAG
- β Responsive web interface
Note: This is an unofficial community fork. For core features and major changes, please contribute to the upstream project.
For this fork specifically:
- π Bug reports for integration issues
- π¦ Docker-related improvements
- π Documentation enhancements
- π§ͺ Testing feedback
Open an issue or PR if you have improvements to share!
- Upstream Issues: clusterzx/paperless-ai/issues
- Upstream Discord: Join Community
- Fork Issues: Admonstrator/paperless-ai-patched/issues
This project is licensed under the MIT License. See LICENSE for details.
Original work Copyright Β© clusterzx
Fork maintained by Admonstrator