i have some doubts incase of bag of words based image classification, i will first of tell what i have done <ol> <li>i have extracted the features from the training image with two different categories using SURF method,</li> <li>i have then made clustering of the features for the two categories.</li> <li>in order to classify my test image (i.e) to which of the two category the test image belongs to. for this classifying purpose i am using SVM classifier, but here is what i have a doubt , how do we input the test image do we have to do the same step from 1 to 2 again and then use it as a test set or is there any other method to do, </li> <li>also would be great to know the efficiency of the bow approach,</li> </ol> kindly some one provide me with an clarification

The classifier needs the representation for the test data to have the same meaning as the training data. So, when you're evaluating a test image, you extract the features and then make the histogram of which words from your original vocabulary they're closest to. That is: <ol> <li>Extract features from your entire training set.</li> <li>Cluster those features into a vocabulary V; you get K distinct cluster centers.</li> <li>Encode each training image as a histogram of the number of times each vocabulary element shows up in the image. Each image is then represented by a length-K vector.</li> <li>Train the classifier.</li> <li>When given a test image, extract the features. Now represent the test image as a histogram of the number of times each cluster center from V was closest to a feature in the test image. This is a length K vector again.</li> </ol> It's also often helpful to discount the histograms by taking the square root of the entries. This approximates a more realistic model for image features.

bag of words - image classification

Tags:

machine-learning

computer-vision

i have some doubts incase of bag of words based image classification, i will first of tell what i have done

i have extracted the features from the training image with two different categories using SURF method,
i have then made clustering of the features for the two categories.
in order to classify my test image (i.e) to which of the two category the test image belongs to. for this classifying purpose i am using SVM classifier, but here is what i have a doubt , how do we input the test image do we have to do the same step from 1 to 2 again and then use it as a test set or is there any other method to do,
also would be great to know the efficiency of the bow approach,

kindly some one provide me with an clarification

708

asked Dec 14 '12 11:12

user1903801

1 Answers

The classifier needs the representation for the test data to have the same meaning as the training data. So, when you're evaluating a test image, you extract the features and then make the histogram of which words from your original vocabulary they're closest to.

That is:

Extract features from your entire training set.
Cluster those features into a vocabulary V; you get K distinct cluster centers.
Encode each training image as a histogram of the number of times each vocabulary element shows up in the image. Each image is then represented by a length-K vector.
Train the classifier.
When given a test image, extract the features. Now represent the test image as a histogram of the number of times each cluster center from V was closest to a feature in the test image. This is a length K vector again.

It's also often helpful to discount the histograms by taking the square root of the entries. This approximates a more realistic model for image features.

answered Sep 30 '22 21:09

Danica

Related questions
                            
                                Cross entropy loss suddenly increases to infinity
                            
                                Homogeneous vs heterogeneous ensembles
                            
                                std::function has performances issues, how to avoid it?
                            
                                How does shuffling work with ImageDataGenerator in Machine Learning?
                            
                                How to model a shared layer in keras?
                            
                                sigmoid_cross_entropy loss function from tensorflow for image segmentation
                            
                                definition of error rate in classification and why some researchers use error rate instead of accuracy
                            
                                Column-dependent bounds in torch.clamp
                            
                                PyTorch LSTM input dimension
                            
                                Are the k-fold cross-validation scores from scikit-learn's `cross_val_score` and `GridsearchCV` biased if we include transformers in the pipeline?
                            
                                FastAi What does the slice(lr) do in fit_one_cycle()
                            
                                Implementing a trainable generalized Bump function layer in Keras/Tensorflow
                            
                                Sequence to Sequence - for time series prediction
                            
                                How to design a neural network to predict arrays from arrays
                            
                                Neural network in MATLAB
                            
                                Can k-means fall into an infinite loop ?
                            
                                NLTK/NLP buliding a many-to-many/multi-label subject classifier
                            
                                10*10 fold cross validation in scikit-learn?
                            
                                Disease named entity recognition
                            
                                How to approach Machine Learning problems with dynamically sized input collection?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With