Can we use Normal Equation for Logistic Regression ?

Tags:

Just like we use the Normal Equation to find out the optimum theta value in Linear Regression, can/can't we use a similar formula for Logistic Regression ? If not, why ? I'd be grateful if could someone could explain the reasoning behind it. Thank You.

549

asked Jun 23 '16 16:06

user2125722

1 Answers

Unfortunately no, only two methods in classification theory have closed form solutions - linear regression and linear discriminant analysis/fischer discriminant.

In general it is considered a miracle that it "works" even for linear regression. As far as I know it is nearly impossible to prove that "you cannot solve logistic reggresion in closed form", however general understanding is that it will not ever be the case. You can do it, if your features are binary only, and you have very few of them (as a solution is exponential in number of features), which has been shown few years ago, but in general case - it is believed to be impossible.

So why it worked so well for linear regression? Because once you compute your derivatives you will notice, that resulting problem is set of linear equations, m equations with m variables, which we know can be directly solved through matrix inversions (and other techniques). When you differentiate logistic regression cost, resulting problem is no longer linear... it is convex (thus global optimum), but not linear, and consequently - current mathematics does not provide us with tools strong enough to find the optimum in closed form solution.

That being said there exists (absolutely impractical computationally) closed form solution if all your input variables are categorical (they can only take finitely many values that you can enumerate): https://www.tandfonline.com/doi/abs/10.1080/02664763.2014.932760?journalCode=cjas20

answered Sep 29 '22 12:09

lejlot

Related questions
                            
                                Is the Keras implementation of dropout correct?
                            
                                Split output of a layer in keras
                            
                                Adding an additional value to a Convolutional Neural Network Input? [closed]
                            
                                What is Sequence length in LSTM?
                            
                                Using different loss functions for different outputs simultaneously Keras?
                            
                                How to handle Shift in Forecasted value
                            
                                Parameter Tuning for Perceptron Learning Algorithm
                            
                                Distance between hyperplanes
                            
                                Initial bias values for a neural network
                            
                                Where to apply batch normalization on standard CNNs
                            
                                how to replace values of selected row of a column in panda's dataframe?
                            
                                how to use to_categorical when using ImageDataGenerator
                            
                                UnboundLocalError: local variable 'batch_outputs' referenced before assignment
                            
                                how does theano.scan's updates work?
                            
                                t-SNE predictions in R
                            
                                If we combine one trainable parameters with a non-trainable parameter, is the original trainable param trainable?
                            
                                What is batch size in Caffe or convnets
                            
                                why too many epochs will cause overfitting?
                            
                                Does anybody know any Clojure machine learning framework?
                            
                                How do I use sklearn CountVectorizer with both 'word' and 'char' analyzer? - python

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Can we use Normal Equation for Logistic Regression ?

Tags:

machine-learning

logistic-regression

linear-regression

user2125722

People also ask

1 Answers

lejlot

Recent Activity

Donate For Us