Based on python, sort descending dataframe with pandas: Given: <pre class="prettyprint"><code>from pandas import DataFrame import pandas as pd d = {'x':[2,3,1,4,5], 'y':[5,4,3,2,1], 'letter':['a','a','b','b','c']} df = DataFrame(d) </code></pre> df then looks like this: <pre class="prettyprint"><code>df: letter x y 0 a 2 5 1 a 3 4 2 b 1 3 3 b 4 2 4 c 5 1 </code></pre> I would like to have something like: <pre class="prettyprint"><code>f = lambda x,y: x**2 + y**2 test = df.sort(f('x', 'y')) </code></pre> This should order the complete dataframe with respect to the sum of the squared values of column 'x' and 'y' and give me: <pre class="prettyprint"><code>test: letter x y 2 b 1 3 3 b 4 2 1 a 3 4 4 c 5 1 0 a 2 5 </code></pre> Ascending or descending order does not matter. Is there a nice and simple way to do that? I could not yet find a solution.

You can create a temporary column to use in sort and then drop it: <pre class="prettyprint"><code>df.assign(f = df['one']**2 + df['two']**2).sort_values('f').drop('f', axis=1) Out: letter one two 2 b 1 3 3 b 4 2 1 a 3 4 4 c 5 1 0 a 2 5 </code></pre>

DataFrame sorting based on a function of multiple column values

Tags:

python

sorting

pandas

dataframe

Based on python, sort descending dataframe with pandas:

Given:

from pandas import DataFrame
import pandas as pd

d = {'x':[2,3,1,4,5],
     'y':[5,4,3,2,1],
     'letter':['a','a','b','b','c']}

df = DataFrame(d)

df then looks like this:

df:
      letter    x    y
    0      a    2    5
    1      a    3    4
    2      b    1    3
    3      b    4    2
    4      c    5    1

I would like to have something like:

f = lambda x,y: x**2 + y**2
test = df.sort(f('x', 'y'))

This should order the complete dataframe with respect to the sum of the squared values of column 'x' and 'y' and give me:

test:
      letter    x    y
    2      b    1    3
    3      b    4    2
    1      a    3    4
    4      c    5    1
    0      a    2    5

Ascending or descending order does not matter. Is there a nice and simple way to do that? I could not yet find a solution.

826

asked Jul 29 '16 15:07

Ohumeronen

1 Answers

You can create a temporary column to use in sort and then drop it:

df.assign(f = df['one']**2 + df['two']**2).sort_values('f').drop('f', axis=1)
Out: 
  letter  one  two
2      b    1    3
3      b    4    2
1      a    3    4
4      c    5    1
0      a    2    5

137

answered Oct 08 '22 12:10

ayhan

Related questions
                            
                                Can a python script execute a function inside a bash script?
                            
                                Python Regex to Parse String and Return Tuple
                            
                                using index() on multidimensional lists
                            
                                Introspect calling object
                            
                                Python good programming practice for enumerating lists
                            
                                python nested classes
                            
                                Django model: Email field unique if not null/blank
                            
                                How to specify 2 keys in python sorted(list)?
                            
                                Determine free RAM in Python
                            
                                python reduce to find the union of sets
                            
                                Fastest pairwise distance metric in python
                            
                                Finding the consecutive zeros in a numpy array
                            
                                Python logging time since start of program
                            
                                Python AES encryption without extra module
                            
                                GroupBy and Sum in SQLAlchemy?
                            
                                Creating a range with fixed number of elements (length)
                            
                                From Voronoi tessellation to Shapely polygons
                            
                                Difference between --default and --store_const in argparse
                            
                                How to slice middle element from list
                            
                                Iterating Through Table Rows in Selenium (Python)

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With