I'm trying to identify the best way to make a simple pivot on my data: <pre class="prettyprint"><code>import pandas dfn = pandas.DataFrame({ "A" : [ 'aaa', 'bbb', 'aaa', 'bbb' ], "B" : [ 1, 10, 2, 30 ], "C" : [ 2, 0, 3, 20 ] }) </code></pre> The output I would like to have is a dataframe, grouped by <code>A</code>, that sum and count values of <code>B</code> and <code>C</code>, and names have to be exactly (<code>Sum_B</code>, <code>Sum_C</code>, <code>Count</code>), as following: <pre class="prettyprint"><code>A Sum_B Sum_C Count aaa 3 5 2 bbb 50 20 2 </code></pre> What is the fastest way to do this?

you can use .agg() function: <pre class="prettyprint"><code>In [227]: dfn.groupby('A').agg({'B':sum, 'C':sum, 'A':'count'}).rename(columns={'A':'count'}) Out[227]: B count C A aaa 3 2 5 bbb 40 2 20 </code></pre> or with <code>reset_index()</code>: <pre class="prettyprint"><code>In [239]: dfn.groupby('A').agg({'B':sum, 'C':sum, 'A':'count'}).rename(columns={'A':'count'}).reset_index() Out[239]: A B count C 0 aaa 3 2 5 1 bbb 40 2 20 </code></pre> PS Here is a link to examples provided by @evan54

python pandas simple pivot table sum count

Tags:

python

pandas

dataframe

group-by

pivot-table

I'm trying to identify the best way to make a simple pivot on my data:

import pandas    
dfn = pandas.DataFrame({
    "A" : [ 'aaa', 'bbb', 'aaa', 'bbb' ],
    "B" : [     1,    10,     2,   30  ],
    "C" : [     2,     0,     3,   20  ] })

The output I would like to have is a dataframe, grouped by A, that sum and count values of B and C, and names have to be exactly (Sum_B, Sum_C, Count), as following:

A   Sum_B  Sum_C  Count
aaa    3      5       2
bbb   50     20       2

What is the fastest way to do this?

860

asked Jun 22 '16 10:06

DPColombotto

1 Answers

you can use .agg() function:

In [227]: dfn.groupby('A').agg({'B':sum, 'C':sum, 'A':'count'}).rename(columns={'A':'count'})
Out[227]:
      B  count   C
A
aaa   3      2   5
bbb  40      2  20

or with reset_index():

In [239]: dfn.groupby('A').agg({'B':sum, 'C':sum, 'A':'count'}).rename(columns={'A':'count'}).reset_index()
Out[239]:
     A   B  count   C
0  aaa   3      2   5
1  bbb  40      2  20

PS Here is a link to examples provided by @evan54

165

answered Sep 28 '22 07:09

MaxU - stop WAR against UA

Related questions
                            
                                Parse a git URL like 'ssh://[email protected]:3333/org/repo.git'?
                            
                                Flask doesn't get secret_key config from config object
                            
                                Python: self.assertEqual(a, b, msg) --> I want diff AND msg
                            
                                Systemd Daemon in Python with watchdog support
                            
                                Python PDFMIner - PDF to CSV
                            
                                How to return errors from PYODBC
                            
                                Python : Behaviour of send() in generators
                            
                                python mqtt script on raspberry pi to send and receive messages
                            
                                How to do 2SLS IV regression using statsmodels python?
                            
                                Should i edit the django migration file to edit mismatched dependencies
                            
                                Using ternary operator in python?
                            
                                PyCharm: Navigate to recent method?
                            
                                Django - how to save my hashed password
                            
                                star unpacking for own classes
                            
                                Compare multiple year data on a single plot python
                            
                                Build numpy array with multiple custom index ranges without explicit loop
                            
                                Can't import pygal_maps_world.World
                            
                                Finding and utilizing eigenvalues and eigenvectors from PCA in scikit-learn
                            
                                Scipy.optimize.minimize SLSQP with linear constraints fails
                            
                                Soft delete django database objects

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With