Is there an easy way to get confusion matrix for multiclass classification? (OneVsRest)

Tags:

I was using OneVsRest classifier on three class classification problem, (three random forests). Occurrence of each class is defined my dummy integer (1 for occurrence, 0 for otherwise). I was wondering if there is an easy alternative way to creating confusion matrix? As all approaches I came across, takes arguments in the form of y_pred, y_train = array, shape = [n_samples]. Ideally , I would like y_pred, y_train = array , shape = [n_samples, n_classes]

SOME SAMPLE , SIMILAR TO THE STRUCTURE OF THE PROBLEM:

y_train = np.array([(1,0,0), (1,0,0), (0,0,1), (1,0,0), (0,1,0)])
y_pred = np.array([(1,0,0), (0,1,0), (0,0,1), (0,1,0), (1,0,0)])


print(metrics.confusion_matrix(y_train, y_pred)

RETURNS: multilabel-indicator is not supported

431

asked Dec 07 '16 09:12

Gediminas Sadaunykas

Video Answer

2 Answers

I don't know what you have in mind since you didn't specify the output you're looking for, but here are two ways you could go about it:

1.One confusion matrix per column

In [1]:
for i in range(y_train.shape[1]):
    print("Col {}".format(i))
    print(metrics.confusion_matrix(y_train[:,i], y_pred[:,i]))
    print("")

Out[1]:
Col 0
[[1 1]
 [2 1]]

Col 1
[[2 2]
 [1 0]]

Col 2
[[4 0]
 [0 1]]

2.One confusion matrix altogether

For this, we are going to flatten the arrays:

In [2]: print(metrics.confusion_matrix(y_train.flatten(), y_pred.flatten()))

Out[2]:
[[7 3]
 [3 2]]

159

answered Oct 20 '22 18:10

Julien Marrec

You can try like below to get all the details in one go.

from sklearn.metrics import confusion_matrix
confusion_matrix(y_test.argmax(axis=1), y_pred.argmax(axis=1))

This will give you something like below:

array([[ 7,  0,  0,  0],
       [ 0,  7,  0,  0],
       [ 0,  1,  2,  4],
       [ 0,  1,  0, 11]])

-This means all diagonals are correctly predicted.

answered Oct 20 '22 18:10

Amaresh

Related questions
                            
                                How to define and use percentage in Pint
                            
                                How do i save many to many fields objects using django rest framework
                            
                                Tkinter - How to stop a loop with a stop button?
                            
                                How to capture website screenshot in high resolution?
                            
                                Pandas dataframe pivot not fitting in memory
                            
                                peewee and peewee-async: why is async slower
                            
                                Python: Open Excel Workbook using Win32 COM Api
                            
                                PyInstaller cannot add .txt files
                            
                                POST request works in Postman but not in Python
                            
                                Pandas find first nan value by rows and return column name
                            
                                Count number of tabs open in Selenium Python
                            
                                Set pandas.tseries.index.DatetimeIndex.freq with inferred_freq
                            
                                How do I read a 2 column csv file and create a dictionary?
                            
                                Check state of button in tkinter
                            
                                How to extract the cell state and hidden state from an RNN model in tensorflow?
                            
                                Packaging local module with pex
                            
                                GridSearchCV: How to specify test set?
                            
                                Securing Django OAuth Toolkit Views
                            
                                How to stop Django from escaping the # symbol
                            
                                Predict NA (missing values) with machine learning

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Is there an easy way to get confusion matrix for multiclass classification? (OneVsRest)

Tags:

python

pandas

numpy

scikit-learn

multilabel-classification