Having approximately 600GB of photos collected over 13 years - now stored on freebsd zfs/server.
Photos comes from family computers, from several partial backups to different external USB HDDs, reconstructed images from disk disasters, from different photo manipulation softwares (iPhoto, Picassa, HP and many others :( ) in several deep subdirectories - shortly = TERRIBLE MESS with many duplicates.
So in the first i done:
This helped a lot, but here are still MANY MANY duplicates:
Now the questions:
More complex
I'm able make complex scripts is BASH and "+-" :) know perl.. Can use FreeBSD/Linux utilities directly on the server and over the network can use OS X (but working with 600GB over the LAN not the fastest way)...
My rough idea:
use Image::ExifTool
script for collecting duplicate image data based on image-creation date, and camera model (maybe other exif data too).Any idea, help, any (software/algorithm) hint how to make order in the chaos?
Ps:
Here is nearly identical question: Finding Duplicate image files but i'm already done with the answer (md5). and looking for more precise checksuming and image comparing algorithms.
The Google picture search is the most widely used image search engine due to its extensive database that contains billions of images uploaded over the web. It is best to use image search Google when your aim is to find identical pictures against your queried image.
Definition: Photographic copy of original photograph, usually transparency or negative: copy should reproduce colours, contrast and detail of original as closely as possible. * Duplicate may be larger than original.
Assuming you can work with localy mounted FS:
rmlint
: fastest tool I've ever used to find exact duplicates findimagedupes
: automatize the whole ImageMagick way (as Randal Schwartz's script that I haven't tested? it seems)dupeguru-pe
(gui) : dedicated tool that is fast and does an excellent jobgeeqie
(gui) : I find it fast/excellent to finish the job, using the granular deduplication options. Also then you can generate an ordered collection of images such that 'simular images are next to each other, allowing you to 'flip' between the two to see the changes.If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With