Building a simple image search using TensorFlow

1 Answers

The Wikipedia article on TinEye says that Perceptual Hashing will yield results similar to TinEye's. They reference this detailed description of the algorithm. But TinEye refuses to comment.

The biggest issue with the Perceptual Hashing approach is that while it's efficient for identifying the same image (subject to skews, contrast changes, etc.), it's not great at identifying a completely different image of the same object (e.g. the front of a car vs. the side of a car).

TensorFlow has great support for deep neural nets which might give you better results. Here's a high level description of how you might use a deep neural net in TensorFlow to solve this problem:

Start with a pre-trained NN (such as GoogLeNet) or train one yourself on a dataset like ImageNet. Now we're given a new picture we're trying to identify. Feed that into the NN. Look at the activations of a fairly deep layer in the NN. This vector of activations is like a 'fingerprint' for the image. Find the picture in your database with the closest fingerprint. If it's sufficiently close, it's probably the same object.

The intuition behind this approach is that unlike Perceptual Hashing, the NN is building up a high-level representation of the image including identifying edges, shapes, and important colors. For example, the fingerprint of an apple might include information about its circular shape, red color, and even its small stem.

You could also try something like this 2012 paper on image retrieval which uses a slew of hand-picked features such as SIFT, regional color moments and object contour fragments. This is probably a lot more work and it's not what TensorFlow is best at.

UPDATE

OP has provided an example pair of images from his application:

Image in the database Image from the user. Should match something in the database.

Here are the results of using the demo on the pHash.org website on that pair of similar images as well as on a pair of completely dissimilar images.

Comparing the two images provided by the OP:

RADISH (radial hash): pHash determined your images are not similar with PCC = 0.518013

DCT hash: pHash determined your images are not similar with hamming distance = 32.000000.

Marr/Mexican hat wavelet: pHash determined your images are not similar with normalized hamming distance = 0.480903.

Comparing one of his images with a random image from my machine:

RADISH (radial hash): pHash determined your images are not similar with PCC = 0.690619.

DCT hash: pHash determined your images are not similar with hamming distance = 27.000000.

Marr/Mexican hat wavelet: pHash determined your images are not similar with normalized hamming distance = 0.519097.

Conclusion

We'll have to test more images to really know. But so far pHash does not seem to be doing very well. With the default thresholds it doesn't consider the similar images to be similar. And for one algorithm, it actually considers a completely random image to be more similar.

155

answered Sep 23 '22 23:09

rafaelcosman

Related questions
                            
                                Robust Line Extraction from Image
                            
                                Active Shape Models vs Active Appearance Models
                            
                                Machine Learning: sign visibility
                            
                                OpenCV detect tennis court lines behind net
                            
                                How can I rotate an image based on object position?
                            
                                Identify pattern in image
                            
                                Sliding window using as_strided function in numpy?
                            
                                Hand detection using OpenCV
                            
                                OpenCV dot target detection not finding all targets, and found circles are offset
                            
                                OpenCV Assertion Failed error: (-215) scn == 3 || scn == 4 in function cv::cvtColor works ALTERNATE times
                            
                                "Can't find starting number (in the name of file)" when trying to read frames from hevc (h265) video in opencv
                            
                                RuntimeError: one of the variables needed for gradient computation has been modified by an inplace operation?
                            
                                face alignment algorithm on images
                            
                                Feature Extraction with Javascript
                            
                                undistortPoints() cannot handle lens distortions
                            
                                Gradient of a Loss Function for an SVM
                            
                                How to make predictions with tf.estimator.Estimator from checkpoint?
                            
                                removing noise in a binary image using openCV
                            
                                Binary features and Locality Sensitive Hashing (LSH)
                            
                                Relevance Vector Machine [closed]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Building a simple image search using TensorFlow

Tags:

tensorflow

computer-vision

image-recognition

reflog

People also ask

1 Answers

rafaelcosman

Recent Activity

Donate For Us