Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Detecting Similar images [duplicate]

Possible Duplicate:
Image comparison algorithm

So basically i need to write a program that checks whether 2 images are the same or not. Consider the following 2 images:

http://i221.photobucket.com/albums/dd298/ramdeen32/starry_night.jpg

http://i221.photobucket.com/albums/dd298/ramdeen32/starry_night2.jpg

Well they are both the same images but how do i check to see if these images are the same. I am only limited to the media functions. All i can think of right now is the width height scaling and compare the RGB for each pixel but wouldnt the color be different?

Im completely lost on this one, any help is appreciated.

*Note this has to be in python and use the (media library)

like image 287
1337holiday Avatar asked Sep 24 '10 00:09

1337holiday


People also ask

How can I detect identical images across an entire system?

The undouble library can be used to detect identical images across an entire system or any input directory. Given the input data, preprocessing is performed on the images, the hashes are computed, and images are grouped based on the image hash.

How to find the difference between two images?

Go through each pixel in both images and store the difference in either each of the RGB channels, or just the difference in grayscale intensity. You would end up with an array the size of the image noting the difference between the pixel intensities on the two images.

How to detect (near-identical) images with undouble?

I touched on the concepts of hash functions for the detection of (near-)identical images. The perceptual hash function is recommended for the detection of duplicate images. With the undouble library it is relatively straightforward to detect (near-)identical images across an entire system or directory.

How much does visual similarity duplicate image finder cost?

The single-user license of the Visual Similarity Duplicate Image Finder program is available in a three-tier pricing structure: Standard ($24.95), Professional ($39.95), and Corporate ($499.00). Why Should You Pick Visual Similarity Duplicate Image Finder?


2 Answers

Wow - that is a massive question, and one that has a vast number of possible solutions. I'm afraid I'm not a python expert, but I thought your question was interesting - so I wanted to propose a method that I would implement if I were posed with this problem.

Obviously, the two images you posted are actually very different - so you will need to consider 'how much different is the same', especially when working with images and considering different image formats and compression etc.

Anyway, for a solution that allows for a given difference in colour values (but not for pixels to be in the wrong places), I would do something like the following;

  1. Pick two images.

  2. Rescale the largest image to the exact same height and width as the first (even distorting the image if necessary).

  3. Possibly grayscale the images to make the next steps simpler, without losing much in the way of effectiveness. Actually, possibly running edge detection here could work too.

  4. Go through each pixel in both images and store the difference in either each of the RGB channels, or just the difference in grayscale intensity. You would end up with an array the size of the image noting the difference between the pixel intensities on the two images.

  5. Now, I don't know the exact values, but you would probably then find that if you iterate over the array you could see whether the difference between each pixel in the two images is the same (or nearly the same) across all of the pixels. Perhaps iterate over the array once to find the average difference between the pixel intensities in the two images, then iterate over the image again to see if 90% of the differences fall within a certain threshold (5% difference?).

Just an idea. Of course, there might be some nice functions that I'm not aware of to make this easy, but I wouldn't hold my breath!

like image 185
John Wordsworth Avatar answered Oct 04 '22 03:10

John Wordsworth


ImageMagick has Python bindings and a comparison function. It should do most of the work for you, but I've never used it in Python.

like image 35
kindall Avatar answered Oct 04 '22 05:10

kindall