DuplicateFileFinder is a fast, cross-platform command-line tool that detects duplicate files based on their exact content, not just file names.
It works with any file type:
- Videos 🎬
- Music 🎵
- Images 🖼️
- Documents (PDF, DOCX, TXT, etc.)
- Archives, executables, and more
If two files have identical content, they will be detected as duplicates — even if they have different names or are located in different folders.
- Recursive directory scanning
- Content-based duplicate detection (SHA-256)
- Optimized scanning using file size grouping
- Clear, color-coded output:
- 🟢 Green: file to keep
- 🔴 Red: duplicate copies
- 🔵 Helpful suggestions
- Human-readable statistics
- Cross-platform (Windows, Linux, macOS)
- No ads, no tracking, no data sent anywhere
DuplicateFileFinder does not rely on file names, dates, or metadata.
Instead, it uses a two-step verification process:
-
File size grouping
Files with different sizes are automatically excluded (they cannot be identical). -
SHA-256 hashing
Files with the same size are verified using the SHA-256 cryptographic hash algorithm.
SHA-256 guarantees that:
- If two files have the same hash, their content is 100% identical
- False positives are practically impossible
You can safely trust the results.
- Language: Python 3
- Libraries: Python Standard Library only
oshashlibpathlibcollections
No external dependencies are required.
- Python 3.8 or newer
python DuplicateFileFinder.py scan "C:\Path\To\Directory"If you like this project and want to support its development,
you can make a donation here: