roc_auc_score - Only one class present in y_true

Tags:

I am doing a k-fold XV on an existing dataframe, and I need to get the AUC score. The problem is - sometimes the test data only contains 0s, and not 1s!

I tried using this example, but with different numbers:

import numpy as np
from sklearn.metrics import roc_auc_score
y_true = np.array([0, 0, 0, 0])
y_scores = np.array([1, 0, 0, 0])
roc_auc_score(y_true, y_scores)

And I get this exception:

ValueError: Only one class present in y_true. ROC AUC score is not defined in that case.

Is there any workaround that can make it work in such cases?

860

asked Jul 17 '17 08:07

bloop

2 Answers

You could use try-except to prevent the error:

import numpy as np
from sklearn.metrics import roc_auc_score
y_true = np.array([0, 0, 0, 0])
y_scores = np.array([1, 0, 0, 0])
try:
    roc_auc_score(y_true, y_scores)
except ValueError:
    pass

Now you can also set the roc_auc_score to be zero if there is only one class present. However, I wouldn't do this. I guess your test data is highly unbalanced. I would suggest to use stratified K-fold instead so that you at least have both classes present.

172

answered Sep 18 '22 12:09

Dat Tran

As the error notes, if a class is not present in the ground truth of a batch,

ROC AUC score is not defined in that case.

I'm against either throwing an exception (about what? This is the expected behaviour) or returning another metric (e.g. accuracy). The metric is not broken per se.

I don't feel like solving a data imbalance "issue" with a metric "fix". It would probably be better to use another sampling, if possibile, or just join multiple batches that satisfy the class population requirement.

answered Sep 22 '22 12:09

Diego Ferri

Related questions
                            
                                Why is exporting the entire module not allowed?
                            
                                ApplicationEventMulticaster not initialized - call 'refresh' before multicasting events via the context
                            
                                Use nth-child as CSS variable
                            
                                Debugging code from dynamically loaded assembly in .net core 2.0
                            
                                What is safe area in xib in xcode 9? [duplicate]
                            
                                android Canary 3.0 beta 5 unknown element <library> found
                            
                                YAML safe loading is not available
                            
                                Why is a fresh install of Haskell-Stack and GHC so large/big?
                            
                                Why does this std::vector::emplace_back fail?
                            
                                Spring boot configuration with environment variables
                            
                                Crashlytics' stacktraces show file name as Unknown Source while Dexguard is enabled
                            
                                How to create Vue.js slot programmatically?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With