Vectorizing a gradient descent algorithm

Tags:

I am coding gradient descent in matlab. For two features, I get for the update step:

temp0 = theta(1,1) - (alpha/m)*sum((X*theta-y).*X(:,1)); temp1 = theta(2,1) - (alpha/m)*sum((X*theta-y).*X(:,2)); theta(1,1) = temp0; theta(2,1) = temp1;

However, I want to vectorize this code and to be able to apply it to any number of features. For the vectorization part, it shows that what I am trying to do is a matrix multiplication

theta = theta - (alpha/m) * (X' * (X*theta-y));

This is well seen, but when I tried, I realized that it doesn't work for gradient descent because the parameters are not updated simultaneously.

Then, how can I vectorize this code and make sure the parameters and updated at the same time?

434

asked Dec 23 '13 03:12

bigTree

2 Answers

For the vectorized version try the following(two steps to make simultaneous update explicitly) :

 gradient = (alpha/m) * X' * (X*theta -y)  theta = theta - gradient

answered Oct 26 '22 16:10

S.Arora

Your vectorization is correct. I also tried both of your code, and it got me the same theta. Just remember don't use your updated theta in your second implementation.

This also works but less simplified than your 2nd implementation:

Error = X * theta - y; for i = 1:2     S(i) = sum(Error.*X(:,i)); end  theta = theta - alpha * (1/m) * S'

answered Oct 26 '22 16:10

lennon310

Related questions
                            
                                Updating Class variable within a instance method
                            
                                Construct NetworkX graph from Pandas DataFrame
                            
                                SQL run from Excel cannot use a temporary table
                            
                                C++ warning: deprecated conversion from string constant to ‘char*’ [-Wwrite-strings]
                            
                                What's the svn revert equivalent in git?
                            
                                Python copy larger file too slow
                            
                                cURL SSL connect error 35 with NSS error -5961
                            
                                How to get value of translateX and translateY?
                            
                                Copying files with wildcard (*) to a folder in a bash script - why isn't it working?
                            
                                Where to start with creating Minecraft client mods [closed]
                            
                                How can I pin an array of byte?
                            
                                Split string into repeated characters

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With