I want to calculate cosine similarity between different rows of a matrix in matlab. I wrote the following code in matlab: <pre class="prettyprint"><code>for i = 1:n_row for j = i:n_row S2(i,j) = dot(S1(i,:), S1(j,:)) / (norm_r(i) * norm_r(j)); S2(j,i) = S2(i,j); </code></pre> matrix S1 is 11000*11000 and the code execution is very time consuming. So, I want to know Is there any function in matlab to calculate the cosine similarity between matrix rows faster than the above code?

Short version by calculating the similarity with <code>pdist</code>: <pre class="prettyprint"><code>S2 = squareform(1-pdist(S1,'cosine')) + eye(size(S1,1)); </code></pre> <h3>Explanation:</h3> <code>pdist(S1,'cosine')</code> calculates the cosine distance between all combinations of rows in <code>S1</code>. Therefore the similarity between all combinations is <code>1 - pdist(S1,'cosine')</code> . We can turn that into a square matrix where element <code>(i,j)</code> corresponds to the similarity between rows <code>i</code> and <code>j</code> with <code>squareform(1-pdist(S1,'cosine'))</code>. Finally we have to set the main diagonal to 1 because the similaritiy of a row with itself is obviously 1 but that is not explicitly calculated by <code>pdist</code>.

cosine similarity built-in function in matlab

Tags:

matrix

matlab

cosine-similarity

I want to calculate cosine similarity between different rows of a matrix in matlab. I wrote the following code in matlab:

Click to copy

for i = 1:n_row
    for j = i:n_row
        S2(i,j) = dot(S1(i,:), S1(j,:)) / (norm_r(i) * norm_r(j));
        S2(j,i) = S2(i,j);

matrix S1 is 11000*11000 and the code execution is very time consuming. So, I want to know Is there any function in matlab to calculate the cosine similarity between matrix rows faster than the above code?

808

asked Jan 04 '18 18:01

Mehdi

2 Answers

Short version by calculating the similarity with pdist:

Click to copy

S2 = squareform(1-pdist(S1,'cosine')) + eye(size(S1,1));

Explanation:

pdist(S1,'cosine') calculates the cosine distance between all combinations of rows in S1. Therefore the similarity between all combinations is 1 - pdist(S1,'cosine') .

We can turn that into a square matrix where element (i,j) corresponds to the similarity between rows i and j with squareform(1-pdist(S1,'cosine')).

Finally we have to set the main diagonal to 1 because the similaritiy of a row with itself is obviously 1 but that is not explicitly calculated by pdist.

answered Oct 15 '22 19:10

Leander Moesinger

Your code loops over all rows, and for each row loops over (about) half the rows, computing the dot product for each unique combination of rows:

Click to copy

n_row = size(S1,1);
norm_r = sqrt(sum(abs(S1).^2,2)); % same as norm(S1,2,'rows')
S2 = zeros(n_row,n_row);
for i = 1:n_row
  for j = i:n_row
    S2(i,j) = dot(S1(i,:), S1(j,:)) / (norm_r(i) * norm_r(j));
    S2(j,i) = S2(i,j);
  end
end

(I've taken the liberty to complete your code so it actually runs. Note the initialization of S2 before the loop, this saves a lot of time!)

If you note that the dot product is a matrix product of a row vector with a column vector, you can see that the above, without the normalization step, is identical to

Click to copy

S2 = S1 * S1.';

This runs much faster than the explicit loop, even if it is (maybe?) not able to use the symmetry. The normalization is simply dividing each row by norm_r and each column by norm_r. Here I multiply the two vectors to produce a square matrix to normalize with:

Click to copy

S2 = (S1 * S1.') ./ (norm_r * norm_r.');

answered Oct 15 '22 20:10

Cris Luengo

Related questions
                            
                                How can I customize the positions of legend elements?
                            
                                watershed algorithm in matlab
                            
                                Matlab how to change contourf plot's location on z axis
                            
                                Writing a cell (matlab) to a CSV file
                            
                                Using a MATLAB code on Scilab
                            
                                How to pass multiple output from function into cell array
                            
                                Assign rank to numbers in a vector
                            
                                How to auto-remove trailing whitespaces on save in Matlab?
                            
                                What is the equivalent of Matlab's surf(x,y,z,c) in matplotlib?
                            
                                Correct use of tilde operator for input arguments
                            
                                Draw rectangles on an image in Matlab
                            
                                Major and minor graticule for maps?
                            
                                double(<character>) gives different result in MATLAB and Octave
                            
                                How do I save a plotted image and maintain the original image size in MATLAB?
                            
                                MATLAB black hole variable
                            
                                pdist2 equivalent in MATLAB version 7
                            
                                Is there an equivalent of R's dput() for Matlab?
                            
                                Matlab - save(int2str(i), x) doesn't work - Argument must contain a string
                            
                                How to show several images in the same figue - Matlab
                            
                                Is it safe to delete a MATLAB script from within itself?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

cosine similarity built-in function in matlab

Tags:

matrix

matlab

cosine-similarity

Mehdi

People also ask

2 Answers

Explanation:

Leander Moesinger

Cris Luengo

Recent Activity

Donate For Us