Can someone explain to me the difference between a cost function and the gradient descent equation in logistic regression?

Tags:

machine-learning

I'm going through the ML Class on Coursera on Logistic Regression and also the Manning Book Machine Learning in Action. I'm trying to learn by implementing everything in Python.

I'm not able to understand the difference between the cost function and the gradient. There are examples on the net where people compute the cost function and then there are places where they don't and just go with the gradient descent function w :=w - (alpha) * (delta)w * f(w).

What is the difference between the two if any?

934

asked Nov 29 '12 09:11

oktapodi

1 Answers

Whenever you train a model with your data, you are actually producing some new values (predicted) for a specific feature. However, that specific feature already has some values which are real values in the dataset. We know the closer the predicted values to their corresponding real values, the better the model.

Now, we are using cost function to measure how close the predicted values are to their corresponding real values.

We also should consider that the weights of the trained model are responsible for accurately predicting the new values. Imagine that our model is y = 0.9*X + 0.1, the predicted value is nothing but (0.9*X+0.1) for different Xs. [0.9 and 0.1 in the equation are just random values to understand.]

So, by considering Y as real value corresponding to this x, the cost formula is coming to measure how close (0.9*X+0.1) is to Y.

We are responsible for finding the better weight (0.9 and 0.1) for our model to come up with a lowest cost (or closer predicted values to real ones).

Gradient descent is an optimization algorithm (we have some other optimization algorithms) and its responsibility is to find the minimum cost value in the process of trying the model with different weights or indeed, updating the weights.

We first run our model with some initial weights and gradient descent updates our weights and find the cost of our model with those weights in thousands of iterations to find the minimum cost.

One point is that gradient descent is not minimizing the weights, it is just updating them. This algorithm is looking for minimum cost.

100

answered Sep 29 '22 16:09

Reihan_amn

Related questions
                            
                                What is difference between tf.truncated_normal and tf.random_normal?
                            
                                Calculate AUC in R?
                            
                                Does the SVM in sklearn support incremental (online) learning?
                            
                                What is the number of filter in CNN?
                            
                                Octave : logistic regression : difference between fmincg and fminunc
                            
                                Normalize a feature in this table
                            
                                What is the difference between a sigmoid followed by the cross entropy and sigmoid_cross_entropy_with_logits in TensorFlow?
                            
                                How to do multi class classification using Support Vector Machines (SVM)
                            
                                Java-R integration?
                            
                                What is the difference between sparse_categorical_crossentropy and categorical_crossentropy?
                            
                                Meaning of an Epoch in Neural Networks Training
                            
                                ConvergenceWarning: lbfgs failed to converge (status=1): STOP: TOTAL NO. of ITERATIONS REACHED LIMIT
                            
                                Cost Function, Linear Regression, trying to avoid hard coding theta. Octave.
                            
                                How does one debug NaN values in TensorFlow?
                            
                                How do I visualize a net in Pytorch?
                            
                                Feature/Variable importance after a PCA analysis
                            
                                Show progress bar for each epoch during batchwise training in Keras
                            
                                Keras accuracy does not change
                            
                                How to log Keras loss output to a file
                            
                                Save MinMaxScaler model in sklearn

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With