Pandas Pivot Table List of Aggfunc

Tags:

Pandas Pivot Table Dictionary of Agg function

I am trying to calculate 3 aggregative functions during pivoting:

Count
Mean
StDev

This is the code:

n_page = (pd.pivot_table(Main_DF, 
                         values='SPC_RAW_VALUE',  
                         index=['ALIAS', 'SPC_PRODUCT', 'LABLE', 'RAW_PARAMETER_NAME'], 
                         columns=['LOT_VIRTUAL_LINE'],
                         aggfunc={'N': 'count', 'Mean': np.mean, 'Sigma': np.std})
          .reset_index()
         )

Error I am getting is: KeyError: 'Mean'

How can I calculate those 3 functions?

512

asked Dec 10 '15 04:12

Felix

3 Answers

As written in approved answer by @Happy001, aggfunc cant take dict is false. we can actually pass the dict to aggfunc.

A really handy feature is the ability to pass a dictionary to the aggfunc so you can perform different functions on each of the values you select. for example:

import pandas as pd
import numpy as np

df = pd.read_excel('sales-funnel.xlsx')  #loading xlsx file

table = pd.pivot_table(df, index=['Manager', 'Status'], columns=['Product'], values=['Quantity','Price'],
           aggfunc={'Quantity':len,'Price':[np.sum, np.mean]},fill_value=0)
table

In the above code, I am passing dictionary to the aggfunc and performing len operation on Quantity and mean, sum operations on Price.

Here is the output attaching:

enter image description here

The example is taken from pivot table explained.

answered Oct 17 '22 17:10

Ganesh_

The aggfunc argument of pivot_table takes a function or list of functions but not dict

aggfunc : function, default numpy.mean, or list of functions If list of functions passed, the resulting pivot table will have hierarchical columns whose top level are the function names (inferred from the function objects themselves)

So try

n_page = (pd.pivot_table(Main_DF, 
                         values='SPC_RAW_VALUE',  
                         index=['ALIAS', 'SPC_PRODUCT', 'LABLE', 'RAW_PARAMETER_NAME'], 
                         columns=['LOT_VIRTUAL_LINE'],
                         aggfunc=[len, np.mean, np.std])
          .reset_index()
         )

You may want to rename the hierarchical columns afterwards.

answered Oct 17 '22 17:10

Happy001

Try using groupby

df = (Main_DF
      .groupby(['ALIAS', 'SPC_PRODUCT', 'LABLE', 'RAW_PARAMETER_NAME'], as_index=False)
      .LOT_VIRTUAL_LINE
      .agg({'N': 'count', 'Mean': np.mean, 'Sigma': np.std})
     )

Setting as_index=False just leaves these as columns in your dataframe so you don't have to reset the index afterwards.

answered Oct 17 '22 17:10

Alexander

Related questions
                            
                                argparse subcommands with nested namespaces
                            
                                Grouping daily data by month in python/pandas and then normalizing
                            
                                Changing an element in one list changes multiple lists [duplicate]
                            
                                Str.format() for Python 2.6 gives error where 2.7 does not
                            
                                Setting the limits on a colorbar of a contour plot
                            
                                Python color map but with all zero values mapped to black
                            
                                Open files in "rock&roll" mode
                            
                                Python argparse AssertionError
                            
                                Is cube root integer?
                            
                                how to remove positive infinity from numpy array...if it is already converted to a number?
                            
                                What does calling Tk() actually do?
                            
                                Reading emails with imaplib - "Got more than 10000 bytes" error
                            
                                Runtime difference between set.discard and set.remove methods in Python?
                            
                                How to pass arbitrary arguments to a flask blueprint?
                            
                                django-allauth social account connect to existing account on login
                            
                                Python mock patch doesn't work as expected for public method
                            
                                Python Kivy: Align text to the left side of a Label
                            
                                Getting scrapy project settings when script is outside of root directory
                            
                                change certain squares in a seaborn heatmap
                            
                                Import Error: No module name libstdcxx

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Pandas Pivot Table List of Aggfunc

Tags:

python

pandas

pivot-table

Felix

People also ask

3 Answers

Ganesh_

Happy001

Alexander

Recent Activity

Donate For Us