Get Rankings of Column Names in Pandas Dataframe

Tags:

I have pivoted the Customer ID against their most frequently purchased genres of performances:

Genre            Jazz     Dance     Music  Theatre
Customer                                        
100000000001           0      3         1        2
100000000002           0      1         6        2
100000000003           0      3        13        4
100000000004           0      5         4        1
100000000005           1     10        16       14

My desired result is to append the column names according to the rankings:

Click to copy

Genre            Jazz     Dance     Music  Theatre          Rank1          Rank2          Rank3          Rank4
Customer                                         
100000000001           0      3         1        2          Dance        Theatre          Music           Jazz
100000000002           0      1         6        2          Music        Theatre          Dance           Jazz
100000000003           0      3        13        4          Music        Theatre          Dance           Jazz
100000000004           0      5         4        1          Dance          Music        Theatre           Jazz
100000000005           1     10        16       14          Music        Theatre          Dance           Jazz

I have looked up some threads but the closest thing I can find is idxmax. However that only gives me Rank1.

Could anyone help me to get the result I need?

Thanks a lot!

Dennis

490

asked Aug 10 '20 15:08

dendoniseden

Video Answer

3 Answers

Use:

Click to copy

i = np.argsort(df.to_numpy() * -1, axis=1)
r = pd.DataFrame(df.columns[i], index=df.index, columns=range(1, i.shape[1] + 1)) 
df = df.join(r.add_prefix('Rank'))

Details:

Use np.argsort along axis=1 to get the indices i that would sort the genres in descending order.

Click to copy

print(i)
array([[1, 3, 2, 0],
       [2, 3, 1, 0],
       [2, 3, 1, 0],
       [1, 2, 3, 0],
       [2, 3, 1, 0]])

Create a new dataframe r from the columns of dataframe df taken along the indices i (i.e df.columns[i]), then use DataFrame.join to join the dataframe r with df:

Click to copy

print(df)
              Jazz  Dance  Music  Theatre  Rank1    Rank2    Rank3 Rank4
Customer                                                                
100000000001     0      3      1        2  Dance  Theatre    Music  Jazz
100000000002     0      1      6        2  Music  Theatre    Dance  Jazz
100000000003     0      3     13        4  Music  Theatre    Dance  Jazz
100000000004     0      5      4        1  Dance    Music  Theatre  Jazz
100000000005     1     10     16       14  Music  Theatre    Dance  Jazz

198

answered Oct 13 '22 14:10

Shubham Sharma

Try this:

Click to copy

dfp = (df.rank(ascending=False, axis=1).stack()
         .astype(int).rename('rank').reset_index(level=1))
df.assign(**dfp.set_index('rank', append=True)['Genre'].unstack().add_prefix('Rank'))

Output:

Click to copy

Genre         Jazz  Dance  Music  Theatre  Rank1    Rank2    Rank3 Rank4
Customer                                                                
100000000001     0      3      1        2  Dance  Theatre    Music  Jazz
100000000002     0      1      6        2  Music  Theatre    Dance  Jazz
100000000003     0      3     13        4  Music  Theatre    Dance  Jazz
100000000004     0      5      4        1  Dance    Music  Theatre  Jazz
100000000005     1     10     16       14  Music  Theatre    Dance  Jazz

Use rank and reshape dataframe, then join back to original dataframe using assign.

answered Oct 13 '22 14:10

Scott Boston

Lets try stack, cumcount and sort_values

Click to copy

s = df.stack().sort_values(ascending=False).groupby(level=0).cumcount() + 1
s1 = (s.reset_index(1)
    .set_index(0, append=True)
    .unstack(1)
    .add_prefix("Rank")
    
    )
s1.columns = s1.columns.get_level_values(1)

then join back on your customer genre index.

Click to copy

df.join(s1)

Click to copy

                 Jazz  Dance  Music  Theatre  Rank1    Rank2    Rank3 Rank4
Customer_Genre                                                            
100000000001       0      3      1        2  Dance  Theatre    Music  Jazz
100000000002       0      1      6        2  Music  Theatre    Dance  Jazz
100000000003       0      3     13        4  Music  Theatre    Dance  Jazz
100000000004       0      5      4        1  Dance    Music  Theatre  Jazz
100000000005       1     10     16       14  Music  Theatre    Dance  Jazz

answered Oct 13 '22 14:10

Umar.H

Related questions
                            
                                How to change the time of a Pandas datetime column to midnight?
                            
                                AttributeError: 'NoneType' object has no attribute 'time' paramiko
                            
                                How to download a file from Google Drive using Python and the Drive API v3
                            
                                SignatureDoesNotMatch - Boto3 Django-storages
                            
                                Anaconda won't update spyder 4
                            
                                pytorch conv2d value cannot be converted to type uint8_t without overflow
                            
                                How to get the latest release version in Github only use python-requests?
                            
                                Most pythonic way to provide defaults for class constructor
                            
                                Explosion in loss function, LSTM autoencoder
                            
                                Tensorflow 2.0: Cannot Import tf.keras.utils.conv_utils
                            
                                ModuleNotFoundError: No module named 'tf'
                            
                                No module named 'sklearn.svm._classes' when loading model from colab
                            
                                Fastest Way To Filter A Pandas Dataframe Using A List
                            
                                How to convert selected column with index to a list of tuples in pandas
                            
                                Jupyter Notebook : 'head' is not recognized as an internal or external command, operable program or batch file
                            
                                Get a decision tree in a dictionary
                            
                                Adding claims to DRF simple JWT payload
                            
                                Error when loading scipy: OSError: [WinError 126] The specified module could not be found
                            
                                sum DataFrame rows and columns
                            
                                What is the correct syntax for Walrus operator with ternary operator?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Get Rankings of Column Names in Pandas Dataframe

Tags:

python

pandas

dataframe

dendoniseden

People also ask

Video Answer

3 Answers

Shubham Sharma

Scott Boston

Umar.H

Recent Activity

Donate For Us