pandas - apply function to current row against all other rows

Tags:

matrix

I am utilizing pandas to create a dataframe that appears as follows:

ratings = pandas.DataFrame({
    'article_a':[1,1,0,0],
    'article_b':[1,0,0,0],
    'article_c':[1,0,0,0],
    'article_d':[0,0,0,1],
    'article_e':[0,0,0,1]
},index=['Alice','Bob','Carol','Dave'])

I would like to compute another matrix from this input one that will compare each row against all other rows. Let's assume for example the computation was a function to find the length of the intersection set, I'd like an output DataFrame with the len(intersection(Alice,Bob)), len(intersection(Alice,Carol)), len(intersection(Alice,Dave)) in the first row, with each row following that format against the others. Using this example input, the output matrix would be 4x3:

len(intersection(Alice,Bob)),len(intersection(Alice,Carol)),len(intersection(Alice,Dave))
len(intersection(Bob,Alice)),len(intersection(Bob,Carol)),len(intersection(Bob,Dave))
len(intersection(Carol,Alice)),len(intersection(Carol,Bob)),len(intersection(Carol,Dave))
len(intersection(Dave,Alice)),len(intersection(Dave,Bob)),len(intersection(Dave,Carol))

Is there a named method for this kind of function based computation in pandas? What would be the most efficient way to accomplish this?

688

asked Jun 04 '13 17:06

DeaconDesperado

2 Answers

I am not aware of a named method, but I have a one-liner.

In [21]: ratings.apply(lambda row: ratings.apply(
... lambda x: np.equal(row, x), 1).sum(1), 1)
Out[21]: 
       Alice  Bob  Carol  Dave
Alice      5    3      2     0
Bob        3    5      4     2
Carol      2    4      5     3
Dave       0    2      3     5

137

answered Oct 14 '22 07:10

Dan Allan

@Dan Allan solution is 'right', here's a slightly different way of approaching the problem

In [26]: ratings
Out[26]: 
       article_a  article_b  article_c  article_d  article_e
Alice          1          1          1          0          0
Bob            1          0          0          0          0
Carol          0          0          0          0          0
Dave           0          0          0          1          1

In [27]: ratings.apply(lambda x: (ratings.T.sub(x,'index')).sum(),1)
Out[27]: 
       Alice  Bob  Carol  Dave
Alice      0   -2     -3    -1
Bob        2    0     -1     1
Carol      3    1      0     2
Dave       1   -1     -2     0

answered Oct 14 '22 08:10

Jeff

Related questions
                            
                                Inverse of a matrix 3x3 using symbols
                            
                                How to quickly determine if a matrix is a permutation matrix
                            
                                Solving a simple matrix in row-reduced form in C++
                            
                                Adding "hold on" after "figure" causes the plot to be different
                            
                                Convert upper triangular part of a matrix to 3-column long format
                            
                                Transform Pandas dataframe into frequency matrix
                            
                                Python: Non diagonal elements of a matrix to 0
                            
                                How can I accelerate a sparse matrix by dense vector product, currently implemented via scipy.sparse.csc_matrix.dot, using CUDA?
                            
                                Randomly sample contiguous rows from a data frame or matrix
                            
                                Applying a matrix to a function [duplicate]
                            
                                Julia: delete rows and columns from an array or matix
                            
                                How to select entire matrix except certain rows and columns?
                            
                                Is it possible to have a rotationally invariant identifier of a boolean matrix?
                            
                                How can I divide a matrix into unequally-sized submatrices?
                            
                                Find number of areas in a matrix
                            
                                SSRS How to move a row group?
                            
                                Rotating a Open GL camera correctly using GLM
                            
                                How to change matrix column type in R
                            
                                Calculating distance of all the points in a region with each other
                            
                                Product of two Toeplitz matrices?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With