Good ROC curve but poor precision-recall curve

Tags:

I have some machine learning results that I don't quite understand. I am using python sciki-learn, with 2+ million data of about 14 features. The classification of 'ab' looks pretty bad on the precision-recall curve, but the ROC for Ab looks just as good as most other groups' classification. What can explain that?

enter image description here

631

asked Oct 23 '15 03:10

KubiK888

1 Answers

Class imbalance.

Unlike the ROC curve, PR curves are very sensitive to imbalance. If you optimize your classifier for good AUC on an unbalanced data you are likely to obtain poor precision-recall results.

answered Oct 12 '22 21:10

Calimo

Related questions
                            
                                What's the difference between input_shape and batch_input_shape in LSTM
                            
                                Keras load_model with custom objects doesn't work properly
                            
                                module 'tensorflow.compat.v2.__internal__' has no attribute 'tf2'
                            
                                C/C++ Machine Learning Libraries for Clustering [closed]
                            
                                How to understand the output of Topic Model class in Mallet?
                            
                                "valid deviance" is nan for GBM model, What does this means and how to get rid of this?
                            
                                Pandas DataFrame RangeIndex
                            
                                fisher's linear discriminant in Python
                            
                                Machine learning for weighting adjustment
                            
                                Neural nets as universal approximators
                            
                                AttributeError list object has no attribute add
                            
                                Keras cifar10 example validation and test loss lower than training loss
                            
                                How to do point-wise categorical crossentropy loss in Keras?
                            
                                How to speed-up k-means from Scikit learn?
                            
                                Any python Support Vector Machine library around that allows online learning?
                            
                                Principal Component Analysis in MATLAB
                            
                                Decision Tree Learning and Impurity
                            
                                compare bayesian linear regression VS linear regression [closed]
                            
                                How to group nearby latitude and longitude locations stored in SQL
                            
                                Gradient descent convergence How to decide convergence?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Good ROC curve but poor precision-recall curve

Tags:

performance-testing

machine-learning

scikit-learn

roc

precision-recall

KubiK888

People also ask

1 Answers

Calimo

Recent Activity

Donate For Us