😎 Finding duplicate images made easy!
-
Updated
Aug 15, 2025 - Python
😎 Finding duplicate images made easy!
Fast Near-Duplicate Image Search and Delete using pHash, t-SNE and KDTree.
Tool to detect (and get rid of) similar images using perceptual hashing (pHash lib)
A Python tool to identify and remove similar-looking images from a dataset. Utilizes image preprocessing and hashing techniques for efficient comparison.
🏍️ A clustering tool providing exact and near de-duplication of images using vector embeddings.
高效的Python图像查重工具,支持百万级图片文件的重复检测。集成多种算法包括MD5哈希、感知哈希(dHash/pHash/aHash)和C++加速库,可识别完全相同、分辨率调整、部分截取和水印变更的重复图像。
a Python command-line tool that identifies and groups similar images using average hashing. It supports single-level and recursive directory scanning, adjustable similarity threshold, and presents results in JSON format. Ideal for image deduplication, organization, and content-based retrieval tasks.
The extended version of simhash supports fingerprint extraction of documents and images.
This Python script helps in identifying and moving duplicate images within a specified directory to a designated duplicates folder.
A python program to detect duplicate images in a specified folder.
Sobel Gradient Image Deduplication
Get Similarity adalah alat berbasis Python dengan antarmuka GUI yang memungkinkan pengguna menyaring gambar berkualitas rendah dan mengelompokkan gambar serupa secara otomatis menggunakan embedding CLIP + DINOv2 dan evaluasi kualitas berbasis MusIQ.
Add a description, image, and links to the image-deduplication topic page so that developers can more easily learn about it.
To associate your repository with the image-deduplication topic, visit your repo's landing page and select "manage topics."