I'm doing a binary classification. Whenever my prediction equals the ground truth, I find <code>sklearn.metrics.confusion_matrix</code> to return a single value. Isn't there a problem? <pre class="prettyprint"><code>from sklearn.metrics import confusion_matrix print(confusion_matrix([True, True], [True, True]) # [[2]] </code></pre> I would expect something like: <pre class="prettyprint"><code>[[2 0] [0 0]] </code></pre>

You should fill-in <code>labels=[True, False]</code>: <pre class="prettyprint"><code>from sklearn.metrics import confusion_matrix cm = confusion_matrix(y_true=[True, True], y_pred=[True, True], labels=[True, False]) print(cm) # [[2 0] # [0 0]] </code></pre> <h3>Why?</h3> From the docs, the output of <code>confusion_matrix(y_true, y_pred)</code> is: <blockquote> C: ndarray of shape (n_classes, n_classes) </blockquote> The variable <code>n_classes</code> is either: <ul> <li>guessed as the number of unique values in <code>y_true</code> or <code>y_pred</code> </li> <li>taken from the length of optional parameters <code>labels</code> </li> </ul> In your case, because you did not fill in <code>labels</code>, the variable <code>n_classes</code> is guessed from the number of unique values in <code>[True, True]</code> which is 1. Hence the result.

Why is my confusion matrix returning only one number?

Tags:

python

scikit-learn

confusion-matrix

I'm doing a binary classification. Whenever my prediction equals the ground truth, I find sklearn.metrics.confusion_matrix to return a single value. Isn't there a problem?

from sklearn.metrics import confusion_matrix
print(confusion_matrix([True, True], [True, True])
# [[2]]

I would expect something like:

[[2 0]
 [0 0]]

511

asked Dec 11 '20 09:12

arnaud

1 Answers

You should fill-in labels=[True, False]:

from sklearn.metrics import confusion_matrix

cm = confusion_matrix(y_true=[True, True], y_pred=[True, True], labels=[True, False])
print(cm)

# [[2 0]
#  [0 0]]

Why?

From the docs, the output of confusion_matrix(y_true, y_pred) is:

C: ndarray of shape (n_classes, n_classes)

The variable n_classes is either:

guessed as the number of unique values in y_true or y_pred
taken from the length of optional parameters labels

In your case, because you did not fill in labels, the variable n_classes is guessed from the number of unique values in [True, True] which is 1. Hence the result.

137

answered Sep 22 '22 15:09

arnaud

Related questions
                            
                                settings.py, TypeError: unsupported operand type(s) for /: 'str' and 'str'
                            
                                How to drop row at certain index in every group in GroupBy object?
                            
                                Why doesn't my request.user have groups in Django?
                            
                                Tensorflow 2.3 and libcublas.so.10
                            
                                Finding local minimum values in pandas
                            
                                How to compare data from the same column in a dataframe (Pandas)
                            
                                how to give tuple via command line in python
                            
                                Order Pandas DataFrame by groups and Timestamp
                            
                                Azure kubernetes - python to read configmap?
                            
                                Airflow - call a operator inside a function
                            
                                How to apply regex over all the rows of a dataset?
                            
                                Using Playwright for Python, how do I select (or find) an element?
                            
                                How should a NamedTemporaryFile be annotated?
                            
                                How to properly insert pandas NaT datetime values to my postgresql table
                            
                                How to override hydra working dir from within a script?
                            
                                Numba data type error: Cannot unify array
                            
                                Get maximum subset in multidimensional array [closed]
                            
                                How do you list local profiles with boto3 from ~/.aws/.credentials and ~/.aws/.config files?
                            
                                How to extract info within a #shadow-root (open) using Selenium Python?
                            
                                Copying a section of a string from one column and putting it into a new pandas column

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Why is my confusion matrix returning only one number?

Tags:

python

scikit-learn

confusion-matrix

arnaud

People also ask

1 Answers

Why?

arnaud

Recent Activity

Donate For Us