Plotting a ROC curve in scikit yields only 3 points

Tags:

TLDR: scikit's roc_curve function is only returning 3 points for a certain dataset. Why could this be, and how do we control how many points to get back?

I'm trying to draw a ROC curve, but consistently get a "ROC triangle".

lr = LogisticRegression(multi_class = 'multinomial', solver = 'newton-cg') y = data['target'].values X = data[['feature']].values  model = lr.fit(X,y)  # get probabilities for clf probas_ = model.predict_log_proba(X)

Just to make sure the lengths are ok:

print len(y) print len(probas_[:, 1])

Returns 13759 on both.

Then running:

false_pos_rate, true_pos_rate, thresholds = roc_curve(y, probas_[:, 1]) print false_pos_rate

returns [ 0. 0.28240129 1. ]

If I call threasholds, I get array([ 0.4822225 , -0.5177775 , -0.84595197]) (always only 3 points).

It is therefore no surprise that my ROC curve looks like a triangle.

What I cannot understand is why scikit's roc_curve is only returning 3 points. Help hugely appreciated.

enter image description here

925

asked May 05 '15 11:05

sapo_cosmico

1 Answers

The number of points depend on the number of unique values in the input. Since the input vector has only 2 unique values, the function gives correct output.

answered Sep 20 '22 17:09

pyan

Related questions
                            
                                Android parentActivity not getting recreated after startActivityForResult returns
                            
                                Where can I put my Plugs and then use them from different controllers in my Phoenix app?
                            
                                What branch misprediction does the Branch Target Buffer detect?
                            
                                InterfaceError: connection already closed (using django + celery + Scrapy)
                            
                                Angular-Formly : Adding Form fields dynamically on user click
                            
                                Disable unique prefix matches for argparse and optparse
                            
                                Why does Yarn on EMR not allocate all nodes to running Spark jobs?
                            
                                how to cache asyncio coroutines
                            
                                Jenkins powershell plugin always builds successfully
                            
                                Passing Parent Function to Child Component in VueJS
                            
                                Chrome extension: Checking if content script has been injected or not
                            
                                PyODBC : can't open the driver even if it exists

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With