Text preprocessing and PII anonymisation for NLP/ML. ONNX NER ensemble, language detection, stopword removal. Built for statistical ML and language models.
-
Updated
Feb 23, 2026 - Python
Text preprocessing and PII anonymisation for NLP/ML. ONNX NER ensemble, language detection, stopword removal. Built for statistical ML and language models.
A secure utility for sanitizing logs, text files, and archives using customizable regex rules.
Add a description, image, and links to the pii-removal topic page so that developers can more easily learn about it.
To associate your repository with the pii-removal topic, visit your repo's landing page and select "manage topics."