Why scikit learn confusion matrix is reversed?

Tags:

I have 3 questions:

The confusion matrix for sklearn is as follows:

TN | FP
FN | TP

While when I'm looking at online resources, I find it like this:

TP | FP
FN | TN

Which one should I consider?

Since the above confusion matrix for scikit learn is different than the one I find in other rescources, in a multiclass confusion matrix, what's the structure will be? I'm looking at this post here: Scikit-learn: How to obtain True Positive, True Negative, False Positive and False Negative In that post, @lucidv01d had posted a graph to understand the categories for multiclass. is that category the same in scikit learn?

How do you calculate the accuracy of a multiclass? for example, I have this confusion matrix:

[[27  6  0 16]
 [ 5 18  0 21]
 [ 1  3  6  9]
 [ 0  0  0 48]]

In that same post I referred to in question 2, he has written this equation:

Overall accuracy

ACC = (TP+TN)/(TP+FP+FN+TN)

but isn't that just for binary? I mean, for what class do I replace TP with?

541

asked May 10 '19 12:05

John Sall

1 Answers

The reason why sklearn has show their confusion matrix like

TN | FP
FN | TP

like this is because in their code, they have considered 0 to be the negative class and one to be positive class. sklearn always considers the smaller number to be negative and large number to positive. By number, I mean the class value (0 or 1). The order depends on your dataset and class.

The accuracy will be the sum of diagonal elements divided by the sum of all the elements.p The diagonal elements are the number of correct predictions.

100

answered Nov 01 '22 08:11

secretive

Related questions
                            
                                Multi-label feature selection using sklearn
                            
                                TypeError: Expected sequence or array-like, got estimator
                            
                                Apply MinMaxScaler() on a pandas column
                            
                                Setting seed on train_test_split sklearn python
                            
                                Is it possible to specify handle_unknown = 'ignore' for certain columns and 'error' for others inside OneHotEncoder?
                            
                                Sklearn won't properly import plot_confusion_matrix
                            
                                Adjust size of ConfusionMatrixDisplay (ScikitLearn)
                            
                                Scikit Learn - Calculating TF-IDF from a corpus of arrays of features instead of from a corpus of raw documents
                            
                                Sklearn: Evaluate performance of each classifier of OneVsRestClassifier inside GridSearchCV
                            
                                How to update an SVM model with new data
                            
                                Plotting decision tree, graphvizm pydotplus
                            
                                python - sklearn Latent Dirichlet Allocation Transform v. Fittransform
                            
                                Why xgboost.cv and sklearn.cross_val_score give different results?
                            
                                What is row slicing vs What is column slicing?
                            
                                How to list all classification/regression/clustering algorithms in scikit-learn?
                            
                                Custom sklearn pipeline transformer giving "pickle.PicklingError"
                            
                                How to calculate the actual size of a .fit()-trained model in sklearn?
                            
                                What is the recommended way to persist (pickle) custom sklearn pipelines?
                            
                                RandomForestRegressor and feature_importances_ error
                            
                                AttributeError: 'int' object has no attribute 'lower' in TFIDF and CountVectorizer

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Why scikit learn confusion matrix is reversed?

Tags:

scikit-learn

text-classification

confusion-matrix

performance-measuring

Overall accuracy

John Sall

People also ask

1 Answers

secretive

Recent Activity

Donate For Us