auto-dedact

A self-learning sensitive data detection and redaction engine combining deterministic rules (regex) with LLM-based evaluation and feedback loops.

Auto-Dedact is intentionally named with dual meaning:

Automated Detect & Redact — the system automatically detects sensitive data and redacts it using deterministic rules and learned patterns.
A deliberate sound-pun on autodidact — reflecting that the engine learns by itself through LLM-driven feedback loops, validation, and rule refinement.

Early Development Screenshots

These screenshots reflect early development runs executed locally. Output is intentionally raw to demonstrate decision logic and failure handling.

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
.vscode		.vscode
app		app
docker/initdb		docker/initdb
docs		docs
poc_scripts_and_tools		poc_scripts_and_tools
synthetic_training_data		synthetic_training_data
tests/detect_redact		tests/detect_redact
.gitignore		.gitignore
.python-version		.python-version
LICENSE		LICENSE
README.md		README.md
TODOs.md		TODOs.md
docker-compose.yaml		docker-compose.yaml
main.py		main.py
main_self_learning.py		main_self_learning.py
pyproject.toml		pyproject.toml
uv.lock		uv.lock