Meaning of correctly classified instances weka

Tags:

weka

I recently started using weka and I'm trying to classify tweets into positive or negative using Naive Bayes. So I have a training set with tweets that I gave the label for and a test set with tweets that all have the label "positive". When I ran Naive Bayes, I get the following results:

Correctly classified instances: 69 92% Incorrectly classified instances: 6 8%

Then if I change the labels of the tweets in the test set to "negative" and ran again Naive Bayes, the results are inversed:

Correctly classified instances: 6 8% Incorrectly classified instances: 69 92%

I thought that correctly classified instances show the accuracy of Naive Bayes and that it should be the same no matter the labels of the tweets in test set. Is there something wrong with my data or I don't understand correctly the meaning of correctly classified instances?

Thanks a lot for your time,

Nantia

702

asked Sep 03 '12 17:09

nadia

2 Answers

The labels on the test set are supposed to be the actual correct classification. Performance is computed by asking the classifier to give its best guess about the classification for each instance in the test set. Then the predicted classifications are compared to the actual classifications to determine accuracy. Therefore, if you flip the 'correct' values that you give it, the results will be flipped as well.

104

answered Oct 26 '22 23:10

Antimony

Based on your training set, 69.92% of your instances are classified as positive. If the labels for the test set, that is the correct answers, indicate that they are all positive, then that makes 69.92% correct. If the test set (and thus the classification) is the same, but you switch the correct answers, then of course, the percentage correct will also be the opposite.

Keep in mind that in order to evaluate a classifier, you need the true labels of the test set. Otherwise you can't compare the classifier's answers with the true answers. It seems to me that you might have misunderstood this. You can obtain the labels for unseen data, if that is what you want, but in that case you can't evaluate classifier accuracy.

answered Oct 26 '22 22:10

Junuxx

Related questions
                            
                                Using Neural Network Class in WEKA in Java code
                            
                                Natural Language Processing - Features for Text Classification
                            
                                Weka : How to prepare test set in weka
                            
                                Boolean attributes in Weka
                            
                                Creating a string attribute in Weka Java API
                            
                                How to ignore a feature while including it as part of feature set in Weka GUI
                            
                                Using Weka on Images
                            
                                how to use svm in Weka Classsifier?
                            
                                WEKA: how to get the score from classifyInstance?
                            
                                SMOTE oversampling and cross-validation
                            
                                Increase heap to avoid Out of Memory Error in WEKA
                            
                                Parameters of a Weka Classifier
                            
                                Simple text classification using naive bayes (weka) in java
                            
                                Output confusion matrix in Weka from command line
                            
                                How can I use a different distance measure for the k-nearest neighbor in Java/Weka?
                            
                                what is the best way to generate fake data for classification problem?
                            
                                Basic text classification with Weka in Java
                            
                                Weka error "cannot handle numeric class" in Java code using LibSVM

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With