Is a project that is intended to sort out the mess i made on my harddrives. I want to sort
pictures into a nested folder structure like this year/month/day/filename.extension without duplicates.
Since there are currently countless folders on my drives which contain the same files in different folders I started out with globbing through a filelist and pushing it to a RabbitMq Queue.
Start by running docker-compose -f docker-compose.yml up -d this will start rabbitmq
There is a datafolder in the root of this directory, it contains a source and a target folder, both are mounted as volumes in the docker-compose files. You should change the volume mappings according to your own wishes.
The project is split into a couple of steps:
-
sending the file list to the first queue
- this is done by running
python filelist.pyfrom within the src directory or - by running
make filelist
- this is done by running
-
scanning the exifdata for all the files from the filelist
- done by running
python exifscanner.pyormake exifscanner
- done by running
-
moving the files based on their original capture time (if available, leaves them untouched in the source folder otherwise)
- done by running
python filemover.pyormake filemover
- done by running
-
scan the duplicates, files with the same filename (and inherentlcy the same capturedate) will be scanned for likeness.
- exact duplicates will be posted to a seperate queue for the next process
- start using
python duplicate_scanner.pyormake duplicate-scanner
-
remove duplicates, originals will be removed in this step
- start using
python duplicate_remover.pyormake duplicate-remover
- start using