Pandas Groupby and Sum Only One Column

Tags:

pandas

So I have a dataframe, df1, that looks like the following:

       A      B      C 1     foo    12    California 2     foo    22    California 3     bar    8     Rhode Island 4     bar    32    Rhode Island 5     baz    15    Ohio 6     baz    26    Ohio

I want to group by column A and then sum column B while keeping the value in column C. Something like this:

      A       B      C 1    foo     34    California 2    bar     40    Rhode Island 3    baz     41    Ohio

The issue is, when I say df.groupby('A').sum() column C gets removed returning

      B A bar  40 baz  41 foo  34

How can I get around this and keep column C when I group and sum?

340

asked Aug 16 '16 21:08

1 Answers

The only way to do this would be to include C in your groupby (the groupby function can accept a list).

Give this a try:

df.groupby(['A','C'])['B'].sum()

One other thing to note, if you need to work with df after the aggregation you can also use the as_index=False option to return a dataframe object. This one gave me problems when I was first working with Pandas. Example:

df.groupby(['A','C'], as_index=False)['B'].sum()

185

answered Sep 19 '22 23:09

Sevyns

Related questions
                            
                                Easier way to enable verbose logging
                            
                                MySQL parameterized queries
                            
                                PDF Parsing Using Python - extracting formatted and plain texts [closed]
                            
                                Python packages and egg-info directories
                            
                                What's the difference between Python's subprocess.call and subprocess.run
                            
                                Virtual environment in R?
                            
                                How do I count the letters in Llanfairpwllgwyngyllgogerychwyrndrobwllllantysiliogogogoch?
                            
                                Python time.sleep() vs event.wait()
                            
                                How do I debug efficiently with Spyder in Python?
                            
                                regex error - nothing to repeat
                            
                                Python functools.wraps equivalent for classes
                            
                                Why does next raise a 'StopIteration', but 'for' do a normal return?
                            
                                Efficient thresholding filter of an array with numpy
                            
                                set environment variable in python script
                            
                                What is the difference between pickle and shelve?
                            
                                Opposite of melt in python pandas
                            
                                Running Python from Atom
                            
                                How to access a field of a namedtuple using a variable for the field name?
                            
                                Django Model Mixins: inherit from models.Model or from object?
                            
                                c++11 regex slower than python

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Pandas Groupby and Sum Only One Column

Tags:

python

pandas

JSolomonCulp

People also ask

1 Answers

Sevyns

Recent Activity

Donate For Us