I've tried to split my dataframe to groups <pre class="prettyprint"><code>df = pd.DataFrame({'A' : ['foo', 'bar', 'foo', 'bar', 'foo', 'bar', 'foo', 'foo'], 'B' : ['1', '2', '3', '4', '5', '6', '7', '8'], }) grouped = df.groupby('A') </code></pre> I get 2 groups <pre class="prettyprint"><code> A B 0 foo 1 2 foo 3 4 foo 5 6 foo 7 7 foo 8 A B 1 bar 2 3 bar 4 5 bar 6 </code></pre> Now I want to reset indexes for each group separately <pre class="prettyprint"><code>print grouped.get_group('foo').reset_index() print grouped.get_group('bar').reset_index() </code></pre> Finally I get the result <pre class="prettyprint"><code> A B 0 foo 1 1 foo 3 2 foo 5 3 foo 7 4 foo 8 A B 0 bar 2 1 bar 4 2 bar 6 </code></pre> Is there better way how to do this? (For example: automatically call some method for each group)

Pass in <code>as_index=False</code> to the groupby, then you don't need to <code>reset_index</code> to make the groupby-d columns columns again: <pre class="prettyprint"><code>In [11]: grouped = df.groupby('A', as_index=False) In [12]: grouped.get_group('foo') Out[12]: A B 0 foo 1 2 foo 3 4 foo 5 6 foo 7 7 foo 8 </code></pre> Note: As pointed out (and seen in the above example) the index above is not <code>[0, 1, 2, ...]</code>, I claim that this will never matter in practice - if it does you're going to have to just through some strange hoops - it's going to be more verbose, less readable and less efficient...

<pre class="prettyprint"><code>df = pd.DataFrame({'A' : ['foo', 'bar', 'foo', 'bar', 'foo', 'bar', 'foo', 'foo'], 'B' : ['1', '2', '3', '4', '5', '6', '7', '8'], }) grouped = df.groupby('A',as_index = False) </code></pre> we get two groups <pre class="prettyprint"><code>grouped_index = grouped.apply(lambda x: x.reset_index(drop = True)).reset_index() </code></pre> Result in two new columns level_0 and level_1 getting added and the index is reset <pre class="prettyprint"><code> level_0level_1 A B 0 0 0 bar 2 1 0 1 bar 4 2 0 2 bar 6 3 1 0 foo 1 4 1 1 foo 3 5 1 2 foo 5 6 1 3 foo 7 7 1 4 foo 8 </code></pre> <pre class="prettyprint"><code>result = grouped_index.drop('level_0',axis = 1).set_index('level_1') </code></pre> Creates an index within each group of "A" <pre class="prettyprint"><code> A B level_1 0 bar 2 1 bar 4 2 bar 6 0 foo 1 1 foo 3 2 foo 5 3 foo 7 4 foo 8 </code></pre>

How to reset a DataFrame's indexes for all groups in one step?

Tags:

python

pandas

group-by

I've tried to split my dataframe to groups

df = pd.DataFrame({'A' : ['foo', 'bar', 'foo', 'bar',                        'foo', 'bar', 'foo', 'foo'],                    'B' : ['1', '2', '3', '4',                        '5', '6', '7', '8'],                    })  grouped = df.groupby('A')

I get 2 groups

     A  B 0  foo  1 2  foo  3 4  foo  5 6  foo  7 7  foo  8       A  B 1  bar  2 3  bar  4 5  bar  6

Now I want to reset indexes for each group separately

print grouped.get_group('foo').reset_index() print grouped.get_group('bar').reset_index()

Finally I get the result

     A  B 0  foo  1 1  foo  3 2  foo  5 3  foo  7 4  foo  8       A  B 0  bar  2 1  bar  4 2  bar  6

Is there better way how to do this? (For example: automatically call some method for each group)

546

asked Mar 14 '14 14:03

Meloun

2 Answers

Pass in as_index=False to the groupby, then you don't need to reset_index to make the groupby-d columns columns again:

In [11]: grouped = df.groupby('A', as_index=False)  In [12]: grouped.get_group('foo') Out[12]:      A  B 0  foo  1 2  foo  3 4  foo  5 6  foo  7 7  foo  8

Note: As pointed out (and seen in the above example) the index above is not [0, 1, 2, ...], I claim that this will never matter in practice - if it does you're going to have to just through some strange hoops - it's going to be more verbose, less readable and less efficient...

122

answered Oct 02 '22 13:10

Andy Hayden

df = pd.DataFrame({'A' : ['foo', 'bar', 'foo', 'bar',                        'foo', 'bar', 'foo', 'foo'],                    'B' : ['1', '2', '3', '4',                        '5', '6', '7', '8'],                    }) grouped = df.groupby('A',as_index = False)

we get two groups

grouped_index = grouped.apply(lambda x: x.reset_index(drop = True)).reset_index()

Result in two new columns level_0 and level_1 getting added and the index is reset

  level_0level_1 A   B 0   0     0    bar  2 1   0     1    bar  4 2   0     2    bar  6 3   1     0    foo  1 4   1     1    foo  3 5   1     2    foo  5 6   1     3    foo  7 7   1     4    foo  8

result = grouped_index.drop('level_0',axis = 1).set_index('level_1')

Creates an index within each group of "A"

          A     B level_1      0        bar    2 1        bar    4 2        bar    6 0        foo    1 1        foo    3 2        foo    5 3        foo    7 4        foo    8

answered Oct 02 '22 11:10

yogitha jaya reddy gari

Related questions
                            
                                Are there builtin functions for elementwise boolean operators over boolean lists?
                            
                                Recommended NoSQL Database for use with Python [closed]
                            
                                Overriding special methods on an instance
                            
                                Combine Python Dictionary Permutations into List of Dictionaries
                            
                                Python pandas: select columns with all zero entries in dataframe
                            
                                How to create HTTPS tornado server
                            
                                Using "and" and "or" operator with Python strings [duplicate]
                            
                                NumPy - What is the difference between frombuffer and fromstring?
                            
                                Yield from coroutine vs yield from task
                            
                                How can I normalize the data in a range of columns in my pandas dataframe
                            
                                Python setting Decimal Place range without rounding?
                            
                                Django get_or_create fails to set field when used with iexact
                            
                                Pandas rolling gives NaN
                            
                                Generate random UTF-8 string in Python
                            
                                What should people new to Python know about its community and ecosystem? [closed]
                            
                                Modify default queryset in django
                            
                                Django unique_together not preventing duplicates
                            
                                Django REST Framework CSRF Failed: CSRF cookie not set
                            
                                Running Python in PowerShell?
                            
                                How do you index on a jinja template?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With