Sum up column values in Pandas DataFrame

Tags:

In a pandas DataFrame, is it possible to collapse columns which have identical values, and sum up the values in another column?

Code

data = {"score":{"0":9.397,"1":9.397,"2":9.397995,"3":9.397996,"4":9.3999},"type":{"0":"advanced","1":"advanced","2":"advanced","3":"newbie","4":"expert"},"count":{"0":394.18930604,"1":143.14226729,"2":9.64172783,"3":0.1,"4":19.65413734}}
df = pd.DataFrame(data)
df

Output

     count       score       type
0    394.189306  9.397000    advanced
1    143.142267  9.397000    advanced
2    9.641728    9.397995    advanced
3    0.100000    9.397996    newbie
4    19.654137   9.399900    expert

In the example above, the first two rows have the same score and type , so these rows should be merged together and their scores added up.

Desired Output

     count       score       type
0    537.331573  9.397000    advanced
1    9.641728    9.397995    advanced
2    0.100000    9.397996    newbie
3    19.654137   9.399900    expert

514

asked Nov 24 '13 21:11

Nyxynyx

1 Answers

This is a job for groupby:

>>> df.groupby(["score", "type"]).sum()
                        count
score    type                
9.397000 advanced  537.331573
9.397995 advanced    9.641728
9.397996 newbie      0.100000
9.399900 expert     19.6541374
>>> df.groupby(["score", "type"], as_index=False).sum()
      score      type       count
0  9.397000  advanced  537.331573
1  9.397995  advanced    9.641728
2  9.397996    newbie    0.100000
3  9.399900    expert   19.654137

186

answered Oct 03 '22 00:10

DSM

Related questions
                            
                                Python multiprocessing process vs. standalone Python VM
                            
                                Is there a multithreaded map() function? [closed]
                            
                                Subsetting data in Python
                            
                                python 3: how to check if an object is a function? [duplicate]
                            
                                Can a python program be run on a computer without Python? What about C/C++?
                            
                                How to use pipe in IPython
                            
                                Jinja2 ignore UndefinedErrors for objects that aren't found
                            
                                How to monkey patch Django?
                            
                                django querysets + memcached: best practices
                            
                                slices to immutable strings by reference and not copy
                            
                                UUID field added after data already in database. Is there any way to populate the UUID field for existing data?
                            
                                Python Opencv SolvePnP yields wrong translation vector
                            
                                Why are uncompiled, repeatedly used regexes so much slower in Python 3?
                            
                                Find closest row of DataFrame to given time in Pandas
                            
                                web scraping google news with python
                            
                                How to disable cookie handling with the Python requests library?
                            
                                Using Python to Remove All Lines Matching Regex
                            
                                pandas group by year, rank by sales column, in a dataframe with duplicate data
                            
                                pymongo method of getting statistics for collection byte usage?
                            
                                Can I use 'eval' to define a function in Python?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Sum up column values in Pandas DataFrame

Tags:

python

pandas

python-2.7

Nyxynyx

People also ask

1 Answers

DSM

Recent Activity

Donate For Us