Measure of image similarity for feature matching?

Tags:

I'm currently trying to work with a Brute Force feature matcher using SIFT in openCV, using python. I'm trying to utilise it for my image search function on my server, where I'm inputting an image and having that image be compared with others, in the hopes that the matches will indicate a level of similarity. Is there a way to indicate a level of similarity via using feature matching?

Currently, I'm playing around with what I found on this website, which I'll also post below:

img1 = cv2.imread('box.png',0)          # queryImage
img2 = cv2.imread('box_in_scene.png',0) # trainImage

# Initiate SIFT detector
sift = cv2.SIFT()

# find the keypoints and descriptors with SIFT
kp1, des1 = sift.detectAndCompute(img1,None)
kp2, des2 = sift.detectAndCompute(img2,None)

# BFMatcher with default params
bf = cv2.BFMatcher()
matches = bf.knnMatch(des1,des2, k=2)

# Apply ratio test
good = []
for m,n in matches:
    if m.distance < 0.75*n.distance:
        good.append([m])

# cv2.drawMatchesKnn expects list of lists as matches.
img3 = cv2.drawMatchesKnn(img1,kp1,img2,kp2,good,flags=2)

plt.imshow(img3),plt.show()

What I'm using at the moment to create a measure of 'similarity' is the number of 'good' matches that are acquired from applying the ratio test, and just finding the how many 'good' matches are stored in good using a simple len(good).

This returned the number of good matches that I used to valuate the similarity of the input image to that of the database. However, I'm assuming it's not as simple as this, as when i began testing this using a picture of a shoe, images such as one of a banana, received a higher amount of 'good' matches than the other images of shoes. Even so far as to be more similar than the same shoe in a different colour.

I thought this may be just an anomaly, so I continued to test with a larger dataset of images, finding that again, the shoes weren't receiving scores (or number of good matches), as high as say an image of a quad-bike or a person, rather than matching with other shoes.

So basically, how can i define the similarity of two images using feature matching with a numerical value?

Thank you.

594

asked Apr 05 '17 01:04

Questionnaire

1 Answers

I think you need to choose better features in order to get better(or more similar images) results. SIFT is a local feature and there is a good chance you can find similar SIFT features even with images which are semantically different(as in shoe and banana).

To improve similarity accuracy, I would suggest you to decide on better features in addition to SIFT. Like colour histogram in the image. If you use colour histogram of the image, you will get images which are similar in colour histogram. You can use a good mix of features in order to find similarity. You can decide on this mix by checking what kind of images you have in database and what you feel could be discerning features between different semantic classes.

If you are fine to use a slightly different method, I would like to suggest PLSA, it is a method which I have used. Probabilistic latent semantic analysis(PLSA) is an unsupervised learning algorithm that represents data with a lower dimension hidden class. Similarity can be then found just by computing euclidean distance of the new images low dimensional representation with all the other classes. You can sort it based on the distance and get similar images. Even here choosing right features is important. Also you will need to choose number of hidden classes. You will need to experiment with number of classes.

I have one of my small projects which uses PLSA to solve image retrieval. So if you don't mind this plug, here it is PLSA Image retrieval. Unfortunately it is Matlab, but you can understand what is happening and try use it. I have used colour histogram as feature. So choose features which will help you discern different classes better.

182

answered Sep 23 '22 14:09

harshkn

Related questions
                            
                                Transform Pandas dataframe into frequency matrix
                            
                                python / pandas find number of years between two dates
                            
                                How to get value between two different tags using beautiful soup?
                            
                                Custom Theano Op to do numerical integration
                            
                                _tkinter.TclError: encountered an unsupported criticial chunk type "exIf"
                            
                                Matplotlib Colorbar change ticks labels and locators
                            
                                Python Logger object is not callable
                            
                                TensorFlow: Incompatible shapes: [100,155] vs. [128,155] when combining CNN and LSTM
                            
                                No module named numpy with pypy
                            
                                Python - measure amount of memory used in script
                            
                                How to add custom AxisItem to existing PlotWidget?
                            
                                Django refresh_from_db for ForeignKey
                            
                                Detect DPI/scaling factor in Python TkInter application
                            
                                cx_Oracle.InterfaceError: Unable to acquire Oracle environment handle Mac
                            
                                Control individual linewidths in seaborn heatmap
                            
                                Using a string to define Numpy array slice
                            
                                color percentage in image python opencv using histogram
                            
                                Make an input optional in Python [duplicate]
                            
                                Get Data JSON in Flask
                            
                                Sort each column of an numpy.ndarray using the output of numpy.argsort

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Measure of image similarity for feature matching?

Tags:

python

image-processing

opencv

computer-vision

feature-detection

Questionnaire

People also ask

1 Answers

harshkn

Recent Activity

Donate For Us