Here is my dataframe <pre class="prettyprint"><code>import pandas as pd df = pd.DataFrame({'A': ['one', 'one', 'two', 'two', 'one'] , 'B': ['Ar', 'Br', 'Cr', 'Ar','Ar'] , 'C': ['12/15/2011', '11/11/2001', '08/30/2015', '07/3/1999','03/03/2000' ], 'D':[1,7,3,4,5]}) </code></pre> My goal is to group by column <code>A</code> and sort within grouped results by column <code>B</code>. Here is what I came up with: <pre class="prettyprint"><code>sort_group = df.sort_values('B').groupby('A') </code></pre> I was hoping that grouping operation would not distort order, but it does not work and also returns not a dataframe, but <code>groupby</code> object <pre class="prettyprint"><code><pandas.core.groupby.DataFrameGroupBy object at 0x0000000008B190B8> </code></pre> Any suggestions?

You cannot apply <code>sort_values</code> directly to a <code>groupby</code> object but you need an <code>apply</code>: <pre class="prettyprint"><code>df.groupby('A').apply(lambda x: x.sort_values('B')) </code></pre> gives you the desired output: <pre class="prettyprint"><code> A B C D A one 0 one Ar 12/15/2011 1 4 one Ar 03/03/2000 5 1 one Br 11/11/2001 7 two 3 two Ar 07/3/1999 4 2 two Cr 08/30/2015 3 </code></pre>

How to group by one column and sort the values of another column?

Tags:

python

sorting

pandas

group-by

Here is my dataframe

import pandas as pd
df = pd.DataFrame({'A': ['one', 'one', 'two', 'two', 'one'] ,
                   'B': ['Ar', 'Br', 'Cr', 'Ar','Ar'] ,
                   'C': ['12/15/2011', '11/11/2001', '08/30/2015', '07/3/1999','03/03/2000' ],
                      'D':[1,7,3,4,5]})

My goal is to group by column A and sort within grouped results by column B.

Here is what I came up with:

sort_group = df.sort_values('B').groupby('A')

I was hoping that grouping operation would not distort order, but it does not work and also returns not a dataframe, but groupby object

<pandas.core.groupby.DataFrameGroupBy object at 0x0000000008B190B8>

Any suggestions?

264

asked Nov 17 '16 22:11

user1700890

1 Answers

You cannot apply sort_values directly to a groupby object but you need an apply:

df.groupby('A').apply(lambda x: x.sort_values('B'))

gives you the desired output:

         A   B           C  D
A                            
one 0  one  Ar  12/15/2011  1
    4  one  Ar  03/03/2000  5
    1  one  Br  11/11/2001  7
two 3  two  Ar   07/3/1999  4
    2  two  Cr  08/30/2015  3

177

answered Oct 12 '22 20:10

Cleb

Related questions
                            
                                Can't install zbar
                            
                                Set openpyxl cell format to currency
                            
                                Printing string with two columns
                            
                                JavaScript raises SyntaxError with data rendered in Jinja template
                            
                                Writing multiple pandas dataframes to multiple excel worksheets
                            
                                Is it possible to split a network across multiple GPUs in tensorflow?
                            
                                Python Inheritance: Is it necessary to explicitly call the parents constructor and destructor?
                            
                                Can't install python Polyglot package on Windows
                            
                                How to print progress when training a DNNClassifier in tensorflow r0.9 (skflow)?
                            
                                Aggregate query in mongo works, does not in Pymongo
                            
                                DataFrame: add column whose values are the quantile number/rank of an existing column?
                            
                                TypeError: list indices must be integers, not str (boolean convertion actually)
                            
                                How to combine n-grams into one vocabulary in Spark?
                            
                                How do I call a database function using SQLAlchemy in Flask?
                            
                                Reorder Python argparse argument groups
                            
                                python: update dataframe to existing excel sheet without overwriting contents on the same sheet and other sheets
                            
                                Flask send stream as response
                            
                                Convert date to ordinal python?
                            
                                NetworkX: how to add weights to an existing G.edges()?
                            
                                How can I sample equally from a dataframe?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With