How does sklearn.svm.svc's function predict_proba() work internally?

Tags:

I am using sklearn.svm.svc from scikit-learn to do binary classification. I am using its predict_proba() function to get probability estimates. Can anyone tell me how predict_proba() internally calculates the probability?

316

asked Feb 27 '13 11:02

user2115183

1 Answers

Scikit-learn uses LibSVM internally, and this in turn uses Platt scaling, as detailed in this note by the LibSVM authors, to calibrate the SVM to produce probabilities in addition to class predictions.

Platt scaling requires first training the SVM as usual, then optimizing parameter vectors A and B such that

P(y|X) = 1 / (1 + exp(A * f(X) + B))

where f(X) is the signed distance of a sample from the hyperplane (scikit-learn's decision_function method). You may recognize the logistic sigmoid in this definition, the same function that logistic regression and neural nets use for turning decision functions into probability estimates.

Mind you: the B parameter, the "intercept" or "bias" or whatever you like to call it, can cause predictions based on probability estimates from this model to be inconsistent with the ones you get from the SVM decision function f. E.g. suppose that f(X) = 10, then the prediction for X is positive; but if B = -9.9 and A = 1, then P(y|X) = .475. I'm pulling these numbers out of thin air, but you've noticed that this can occur in practice.

Effectively, Platt scaling trains a probability model on top of the SVM's outputs under a cross-entropy loss function. To prevent this model from overfitting, it uses an internal five-fold cross validation, meaning that training SVMs with probability=True can be quite a lot more expensive than a vanilla, non-probabilistic SVM.

155

answered Sep 19 '22 22:09

Fred Foo

Related questions
                            
                                how to print contents of PYTHONPATH
                            
                                Convert Python list to pandas Series
                            
                                Any way to reset a mocked method to its original state? - Python Mock - mock 1.0b1
                            
                                How to detect lowercase letters in Python?
                            
                                python logging module is not writing anything to file
                            
                                Is there a Python equivalent for Scala's Option or Either?
                            
                                How to use numpy.void type
                            
                                PyTorch memory model: "torch.from_numpy()" vs "torch.Tensor()"
                            
                                Multiple mod_wsgi apps on one virtual host directing to wrong app
                            
                                Function not changing global variable
                            
                                Finding the indices of matching elements in list in Python
                            
                                Passing list-likes to .loc or [] with any missing labels is no longer supported
                            
                                how to test if one python module has been imported?
                            
                                Zipping lists of unequal size
                            
                                matplotlib imshow - default colour normalisation
                            
                                How Do I Use Raw Socket in Python?
                            
                                What's the reverse of shlex.split?
                            
                                How can I convert surrogate pairs to normal string in Python?
                            
                                Check if geo-point is inside or outside of polygon
                            
                                Parsing datetime in Python..?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How does sklearn.svm.svc's function predict_proba() work internally?

Tags:

python

svm

scikit-learn

user2115183

People also ask

1 Answers

Fred Foo

Recent Activity

Donate For Us