What does sklearn "RidgeClassifier" do?

Tags:

I'm trying to understand the difference between RidgeClassifier and LogisticRegression in sklearn.linear_model. I couldn't find it in the documentation.

I think I understand quite well what the LogisticRegression does.It computes the coefficients and intercept to minimise half of sum of squares of the coefficients + C times the binary cross-entropy loss, where C is the regularisation parameter. I checked against a naive implementation from scratch, and results coincide.

Results of RidgeClassifier differ and I couldn't figure out, how the coefficients and intercept are computed there? Looking at the Github code, I'm not experienced enough to untangle it.

The reason why I'm asking is that I like the RidgeClassifier results -- it generalises a bit better to my problem. But before I use it, I would like to at least have an idea where does it come from.

Thanks for possible help.

709

asked Dec 24 '18 09:12

Peter Franek

1 Answers

RidgeClassifier() works differently compared to LogisticRegression() with l2 penalty. The loss function for RidgeClassifier() is not cross entropy.

RidgeClassifier() uses Ridge() regression model in the following way to create a classifier:

Let us consider binary classification for simplicity.

Convert target variable into +1 or -1 based on the class in which it belongs to.
Build a Ridge() model (which is a regression model) to predict our target variable. The loss function is MSE + l2 penalty
If the Ridge() regression's prediction value (calculated based on decision_function() function) is greater than 0, then predict as positive class else negative class.

For multi-class classification:

Use LabelBinarizer() to create a multi-output regression scenario, and then train independent Ridge() regression models, one for each class (One-Vs-Rest modelling).
Get prediction from each class's Ridge() regression model (a real number for each class) and then use argmax to predict the class.

175

answered Oct 28 '22 20:10

Venkatachalam

Related questions
                            
                                Limit the range of x in seaborn distplot KDE estimation
                            
                                TypeError: strptime() argument 1 must be string, not Series
                            
                                How to create image from a list of pixel values in Python3?
                            
                                pyspark Window.partitionBy vs groupBy
                            
                                Uploading file to AWS S3 through Chalice API call
                            
                                How to use functional programming to iterate and find maximum product of five consecutive numbers in a list?
                            
                                python pandas merge multiple csv files
                            
                                How to monitor python's concurrent.futures.ProcessPoolExecutor?
                            
                                Why is the block size for Python httplib's reads hard coded as 8192 bytes
                            
                                Choosing subset of farthest points in given set of points
                            
                                replace values in xarray dataset with None
                            
                                Unittest Django: Mock external API, what is proper way?
                            
                                Randomly shuffle items in each row of numpy array
                            
                                Why is using a key function so much slower?
                            
                                Readonly form field in edit view - Flask-Admin
                            
                                what's the difference between torch.Tensor() vs torch.empty() in pytorch?
                            
                                Why would a pytest factory as fixture be used over a factory function?
                            
                                All dependencies are not downloaded with "pip download"
                            
                                What is the "right" way to close a Dask LocalCluster?
                            
                                Training a Keras model from batches of .npy files using generator?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

What does sklearn "RidgeClassifier" do?

Tags:

python

machine-learning

scikit-learn

logistic-regression

Peter Franek

People also ask

1 Answers

Venkatachalam

Recent Activity

Donate For Us