How to calculate the regularization parameter in linear regression

Tags:

When we have a high degree linear polynomial that is used to fit a set of points in a linear regression setup, to prevent overfitting, we use regularization, and we include a lambda parameter in the cost function. This lambda is then used to update the theta parameters in the gradient descent algorithm.

My question is how do we calculate this lambda regularization parameter?

236

asked Aug 29 '12 16:08

London guy

1 Answers

The regularization parameter (lambda) is an input to your model so what you probably want to know is how do you select the value of lambda. The regularization parameter reduces overfitting, which reduces the variance of your estimated regression parameters; however, it does this at the expense of adding bias to your estimate. Increasing lambda results in less overfitting but also greater bias. So the real question is "How much bias are you willing to tolerate in your estimate?"

One approach you can take is to randomly subsample your data a number of times and look at the variation in your estimate. Then repeat the process for a slightly larger value of lambda to see how it affects the variability of your estimate. Keep in mind that whatever value of lambda you decide is appropriate for your subsampled data, you can likely use a smaller value to achieve comparable regularization on the full data set.

171

answered Sep 22 '22 14:09

bogatron

Related questions
                            
                                What is the difference between an Embedding Layer and a Dense Layer?
                            
                                Tensorflow Precision / Recall / F1 score and Confusion matrix
                            
                                Why feature scaling in SVM?
                            
                                How to calculate prediction uncertainty using Keras?
                            
                                How to predict time series in scikit-learn?
                            
                                Options for deploying R models in production
                            
                                scikit-learn: how to scale back the 'y' predicted result
                            
                                How can I classify data with the nearest-neighbor algorithm using Python?
                            
                                Evaluate multiple scores on sklearn cross_val_score
                            
                                How to tell which Keras model is better?
                            
                                What is the use of train_on_batch() in keras?
                            
                                What is the correct way to change image channel ordering between channels first and channels last?
                            
                                PCA For categorical features?
                            
                                Machine Learning and Natural Language Processing [closed]
                            
                                What is the difference between Keras model.evaluate() and model.predict()?
                            
                                Different decision tree algorithms with comparison of complexity or performance
                            
                                Received a label value of 1 which is outside the valid range of [0, 1) - Python, Keras
                            
                                How to calculate the number of parameters of convolutional neural networks?
                            
                                Can I use CountVectorizer in scikit-learn to count frequency of documents that were not used to extract the tokens?
                            
                                Labels for clustermap in seaborn?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to calculate the regularization parameter in linear regression

Tags:

machine-learning

data-mining

regression

London guy

People also ask

1 Answers

bogatron

Recent Activity

Donate For Us