pandas reset_index after groupby.value_counts()

Tags:

I am trying to groupby a column and compute value counts on another column.

import pandas as pd dftest = pd.DataFrame({'A':[1,1,1,1,1,1,1,1,1,2,2,2,2,2],                 'Amt':[20,20,20,30,30,30,30,40, 40,10, 10, 40,40,40]})  print(dftest)

dftest looks like

    A  Amt 0   1   20 1   1   20 2   1   20 3   1   30 4   1   30 5   1   30 6   1   30 7   1   40 8   1   40 9   2   10 10  2   10 11  2   40 12  2   40 13  2   40

perform grouping

grouper = dftest.groupby('A') df_grouped = grouper['Amt'].value_counts()

which gives

   A  Amt 1  30     4    20     3    40     2 2  40     3    10     2 Name: Amt, dtype: int64

what I want is to keep top two rows of each group

Also, I was perplexed by an error when I tried to reset_index

df_grouped.reset_index()

which gives following error

df_grouped.reset_index() ValueError: cannot insert Amt, already exists

999

asked Sep 29 '16 19:09

muon

1 Answers

You need parameter name in reset_index, because Series name is same as name of one of levels of MultiIndex:

df_grouped.reset_index(name='count')

Another solution is rename Series name:

print (df_grouped.rename('count').reset_index())     A  Amt  count 0  1   30      4 1  1   20      3 2  1   40      2 3  2   40      3 4  2   10      2

More common solution instead value_counts is aggregate size:

df_grouped1 =  dftest.groupby(['A','Amt']).size().reset_index(name='count')  print (df_grouped1)    A  Amt  count 0  1   20      3 1  1   30      4 2  1   40      2 3  2   10      2 4  2   40      3

answered Sep 22 '22 15:09

jezrael

Related questions
                            
                                PyCharm: Forcing Django Template Syntax Highligting
                            
                                How do I let my matplotlib plot go beyond the axes?
                            
                                AttributeError: 'Manager' object has no attribute 'get_by_natural_key' error in Django?
                            
                                What's the working directory when using IDLE?
                            
                                Python `map` and arguments unpacking
                            
                                python requests - POST Multipart/form-data without filename in HTTP request
                            
                                pyzmq missing when running ipython notebook
                            
                                Tensor with unspecified dimension in tensorflow
                            
                                Is it possible to use argparse to capture an arbitrary set of optional arguments?
                            
                                Numpy is installed but still getting error
                            
                                Way to have compiled python files in a separate folder?
                            
                                How to check if a variable is empty in python?
                            
                                Installing a django site on GoDaddy [closed]
                            
                                will using list comprehension to read a file automagically call close()
                            
                                Django Rest Framework - How to test ViewSet?
                            
                                Does enumerate() produce a generator object?
                            
                                Removing \u2018 and \u2019 character
                            
                                How can I add N milliseconds to a datetime in Python
                            
                                Why isn't PyCharm's autocomplete working for libraries I install?
                            
                                how to "source" file into python script

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

pandas reset_index after groupby.value_counts()

Tags:

python

pandas

dataframe

data-manipulation

data-science

muon

People also ask

1 Answers

jezrael

Recent Activity

Donate For Us