How to plot ROC curve in Python

Tags:

I am trying to plot a ROC curve to evaluate the accuracy of a prediction model I developed in Python using logistic regression packages. I have computed the true positive rate as well as the false positive rate; however, I am unable to figure out how to plot these correctly using matplotlib and calculate the AUC value. How could I do that?

509

asked Jul 29 '14 06:07

user3847447

2 Answers

Here are two ways you may try, assuming your model is an sklearn predictor:

import sklearn.metrics as metrics # calculate the fpr and tpr for all thresholds of the classification probs = model.predict_proba(X_test) preds = probs[:,1] fpr, tpr, threshold = metrics.roc_curve(y_test, preds) roc_auc = metrics.auc(fpr, tpr)  # method I: plt import matplotlib.pyplot as plt plt.title('Receiver Operating Characteristic') plt.plot(fpr, tpr, 'b', label = 'AUC = %0.2f' % roc_auc) plt.legend(loc = 'lower right') plt.plot([0, 1], [0, 1],'r--') plt.xlim([0, 1]) plt.ylim([0, 1]) plt.ylabel('True Positive Rate') plt.xlabel('False Positive Rate') plt.show()  # method II: ggplot from ggplot import * df = pd.DataFrame(dict(fpr = fpr, tpr = tpr)) ggplot(df, aes(x = 'fpr', y = 'tpr')) + geom_line() + geom_abline(linetype = 'dashed')

or try

ggplot(df, aes(x = 'fpr', ymin = 0, ymax = 'tpr')) + geom_line(aes(y = 'tpr')) + geom_area(alpha = 0.2) + ggtitle("ROC Curve w/ AUC = %s" % str(roc_auc))

answered Sep 23 '22 14:09

uniquegino

This is the simplest way to plot an ROC curve, given a set of ground truth labels and predicted probabilities. Best part is, it plots the ROC curve for ALL classes, so you get multiple neat-looking curves as well

import scikitplot as skplt import matplotlib.pyplot as plt  y_true = # ground truth labels y_probas = # predicted probabilities generated by sklearn classifier skplt.metrics.plot_roc_curve(y_true, y_probas) plt.show()

Here's a sample curve generated by plot_roc_curve. I used the sample digits dataset from scikit-learn so there are 10 classes. Notice that one ROC curve is plotted for each class.

ROC Curves

Disclaimer: Note that this uses the scikit-plot library, which I built.

answered Sep 21 '22 14:09

Reii Nakano

Related questions
                            
                                How do I use Django templates without the rest of Django?
                            
                                Pandas DataFrame stored list as string: How to convert back to list
                            
                                ValueError: unsupported pickle protocol: 3, python2 pickle can not load the file dumped by python 3 pickle?
                            
                                How do I get interactive plots again in Spyder/IPython/matplotlib?
                            
                                Method Resolution Order (MRO) in new-style classes?
                            
                                ImportError: No module named pandas
                            
                                "ImportError: No module named site" on Windows
                            
                                Scope of lambda functions and their parameters?
                            
                                Default filter in Django admin
                            
                                Change figure window title in pylab
                            
                                How can I check if a date is the same day as datetime.today()?
                            
                                Python Timezone conversion
                            
                                Pycharm and sys.argv arguments
                            
                                What is the most pythonic way to check if multiple variables are not None?
                            
                                AssertionError: View function mapping is overwriting an existing endpoint function: main
                            
                                Positional argument v.s. keyword argument
                            
                                NaN loss when training regression network
                            
                                Pythonic way to combine two lists in an alternating fashion?
                            
                                Differences between STATICFILES_DIR, STATIC_ROOT and MEDIA_ROOT
                            
                                How to obfuscate Python code effectively?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to plot ROC curve in Python

Tags:

python

matplotlib

plot

statistics

roc

user3847447

People also ask

2 Answers

uniquegino

Reii Nakano

Recent Activity

Donate For Us