Find names of top-n highest-value columns in each pandas dataframe row

Tags:

I have the following dataframe:

  id     p1 p2 p3 p4
  1      0  9  1  4
  2      0  2  3  4
  3      1  3 10  7
  4      1  5  3  1
  5      2  3  7 10

I need to reshape the data frame in a way that for each id it will have the top 3 columns with the highest values. The result would be like this:

 id top1 top2 top3
  1  p2   p4   p3
  2  p4   p3   p2
  3  p3   p4   p2
  4  p2   p3   p4/p1
  5  p4   p3   p2

It shows the top 3 best sellers for every user_id. I have already done it using the dplyr package in R, but I am looking for the pandas equivalent.

770

asked Aug 15 '16 12:08

chessosapiens

1 Answers

You could use np.argsort to find the indices of the n largest items for each row:

import numpy as np
import pandas as pd

df = pd.DataFrame({'id': [1, 2, 3, 4, 5],
 'p1': [0, 0, 1, 1, 2],
 'p2': [9, 2, 3, 5, 3],
 'p3': [1, 3, 10, 3, 7],
 'p4': [4, 4, 7, 1, 10]})
df = df.set_index('id')

nlargest = 3
order = np.argsort(-df.values, axis=1)[:, :nlargest]
result = pd.DataFrame(df.columns[order], 
                      columns=['top{}'.format(i) for i in range(1, nlargest+1)],
                      index=df.index)

print(result)

yields

   top1 top2 top3
id               
1    p2   p4   p3
2    p4   p3   p2
3    p3   p4   p2
4    p2   p3   p1
5    p4   p3   p2

answered Oct 06 '22 22:10

unutbu

Related questions
                            
                                Python code works, but eclipse shows error - Syntax error while detecting tuple
                            
                                Remove attribute from all MongoDB documents using Python and PyMongo
                            
                                How to compare two classes/types in python?
                            
                                PyQt: app.exec_() stops all following code from running
                            
                                Django how to make form fields optional
                            
                                request.args.get('key') gives NULL - Flask
                            
                                Possible to add newline to .format() method?
                            
                                Django File Upload and Rename
                            
                                Python idiom to get same result as calling os.path.dirname multiple times?
                            
                                Python / Remove special character from string
                            
                                Python and Pandas - Moving Average Crossover
                            
                                extract dictionary from Counter object
                            
                                Python: Apply function to values in nested dictionary
                            
                                How to create standalone executable file from python 3.5 scripts?
                            
                                set legend for plot with several lines (in python)
                            
                                SQLAlchemy - condition on join fails with AttributeError: Neither 'BinaryExpression' object nor 'Comparator' object has an attribute 'selectable'
                            
                                Python/Pandas Dataframe replace 0 with median value
                            
                                numpy.argmin for elements greater than a threshold
                            
                                Find First Non-zero Value in Each Row of Pandas DataFrame
                            
                                Python generator to read large CSV file

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Find names of top-n highest-value columns in each pandas dataframe row

Tags:

python

pandas

dataframe

top-n

chessosapiens

People also ask

1 Answers

unutbu

Recent Activity

Donate For Us