I want to find the highest 3 values of each column in a dataframe, and return the index names, ordered by value. The dataframe looks like this: <pre class="prettyprint"><code>df = pd.DataFrame({"u1":[1,2,-3,4,5], "u2":[8,-4,5,6,7], "u3":[np.NaN,np.NaN,np.NaN,np.NaN,np.NaN]}, index=["q1","q2","q3","q4","q5"]) </code></pre> The result would look like this: <pre class="prettyprint"><code>u1 u2 u3 q5 q1 NaN q4 q5 NaN q2 q4 NaN </code></pre>

You can use <code>apply</code> with <code>pandas.Series.nlargest</code> function. <pre class="prettyprint"><code>df.apply(lambda x: pd.Series(x.nlargest(3).index)) u1 u2 u3 0 q5 q1 NaN 1 q4 q5 NaN 2 q2 q4 NaN </code></pre>

Finding highest n values of every column in dataframe [duplicate]

Tags:

python

python-3.x

pandas

I want to find the highest 3 values of each column in a dataframe, and return the index names, ordered by value. The dataframe looks like this:

df = pd.DataFrame({"u1":[1,2,-3,4,5],
                   "u2":[8,-4,5,6,7],
                   "u3":[np.NaN,np.NaN,np.NaN,np.NaN,np.NaN]},
                   index=["q1","q2","q3","q4","q5"])

The result would look like this:

u1   u2   u3
q5   q1   NaN
q4   q5   NaN
q2   q4   NaN

613

asked May 20 '20 16:05

d_gnz

1 Answers

You can use apply with pandas.Series.nlargest function.

df.apply(lambda x: pd.Series(x.nlargest(3).index))
   u1  u2   u3
0  q5  q1  NaN
1  q4  q5  NaN
2  q2  q4  NaN

answered Sep 22 '22 14:09

Dishin H Goyani

Related questions
                            
                                TypeError: module() takes at most 2 arguments (3 given) code taken from pluralsight course [duplicate]
                            
                                Error: too many values to unpack (expected 2) when raise error with serializers.ValidationError
                            
                                How to print docstring for class attribute/element?
                            
                                Accessing Microsoft Sharepoint files and data using Python
                            
                                Python how to combine two columns of a dataframe into a single list?
                            
                                AWS Lambda, Python, Numpy and others as Layers
                            
                                Gradcam with guided backprop for transfer learning in Tensorflow 2.0
                            
                                ValueError: 2 columns passed, passed data had 1 columns
                            
                                Get indices of elements in tensor a that are present in tensor b
                            
                                Invoking Google Cloud Function from python using service account for authentication
                            
                                Is it possible to test a while True loop with pytest (I try with a timeout)?
                            
                                Simple Python question: Why can't I assign a variable to a sorted list (in place)? [duplicate]
                            
                                Airflow: Unable to access the AWS providers
                            
                                Difference between Numpy and Tensorflow? [closed]
                            
                                How to handle exception and exit?
                            
                                How to sort a tensor by first dimension in pytorch?
                            
                                Machine learning regression model predicts same value for every image
                            
                                Convert cx_Oracle.LOB data to string in python
                            
                                Why does time.sleep(...) not get affected by the GIL?
                            
                                Groupby a part of the string in pandas

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With