Does GridSearchCV use predict or predict_proba, when using auc_score as score function?

Tags:

scikit-learn

The predict function generates predicted class labels, which will always result in a triangular ROC-curve. A more curved ROC-curve is obtained using the predicted class probabilities. The latter one is, as far as I know, more accurate. If so, the area under the 'curved' ROC-curve is probably best to measure classification performance within the grid search.

Therefore I am curious if either the class labels or class probabilities are used for the grid search, when using the area under the ROC-curve as performance measure. I tried to find the answer in the code, but could not figure it out. Does anyone here know the answer?

Thanks

287

asked Feb 19 '13 10:02

Bastiaan van den Berg

2 Answers

To use auc_score for grid searching you really need to use predict_proba or decision_function as you pointed out. This is not possible in the 0.13 release. If you do score_func=auc_score it will use predict which doesn't make any sense.

[edit]Since 0.14[/edit] it is possible to do grid-search using auc_score, by setting the new scoring parameter to roc_auc: GridSearch(est, param_grid, scoring='roc_auc'). It will do the right thing and use predict_proba (or decision_function if predict_proba is not available). See the whats new page of the current dev version.

You need to install the current master from github to get this functionality or wait until April (?) for 0.14.

166

answered Oct 19 '22 02:10

Andreas Mueller

After performing some experiments with Sklearn SVC (which has predict_proba available) comparing some results with predict_proba and decision_function, it seems that roc_auc in GridSearchCV uses decision_function to compute AUC scores. I found a similar discussion here: Reproducing Sklearn SVC within GridSearchCV's roc_auc scores manually

answered Oct 19 '22 04:10

Augusto Peterlevitz

Related questions
                            
                                How to capture the exit_code and stderr of the command that is run in C++?
                            
                                Is string concatenaion really that slow?
                            
                                Coordinates of Leaflet.Draw rectangle
                            
                                What is the idiomatic way to return either a struct or an error?
                            
                                Relay PostgreSQL connection over another server
                            
                                Exclude files from Team City artifacts
                            
                                Swagger HashMap property type
                            
                                How can I debug e-mail sending on Gitlab?
                            
                                How to 'Build & Run' on multiple destinations at once in Xcode?
                            
                                Side by Side histograms in the Same Graph in R?
                            
                                How to upload one file repository to Gist, preserving history?
                            
                                Customizing the title bar area of a console application

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Does GridSearchCV use predict or predict_proba, when using auc_score as score function?

Tags:

scikit-learn

Bastiaan van den Berg

People also ask

2 Answers

Andreas Mueller

Augusto Peterlevitz

Recent Activity

Donate For Us