All-in-one text de-duplication
-
Updated
Jan 2, 2026 - Python
Entity resolution (also known as data matching, data linkage, record linkage, and many other terms) is the task of finding entities in a dataset that refer to the same entity across different data sources (e.g., data files, books, websites, and databases). Entity resolution is necessary when joining different data sets based on entities that may or may not share a common identifier (e.g., database key, URI, National identification number), which may be due to differences in record shape, storage location, or curator style or preference.
All-in-one text de-duplication
CD4Py: Code De-Duplication for Python
Preprocesses the query logs which can be used by suggesters like Most Popular Suggester (MPS).
A secure image storage application written in Python
A misc project of python based function to track, survey and manage files from mutiple systems
Created by Halbert L. Dunn
Released 1946