Summing over a multiindex level in a pandas series

Tags:

Using the Pandas package in python, I would like to sum (marginalize) over one level in a series with a 3-level multiindex to produce a series with a 2 level multiindex. For example, if I have the following:

ind = [tuple(x) for x in ['ABC', 'ABc', 'AbC', 'Abc', 'aBC', 'aBc', 'abC', 'abc']] mi = pd.MultiIndex.from_tuples(ind) data = pd.Series([264, 13, 29, 8, 152, 7, 15, 1], index=mi)  A  B  C    264       c     13    b  C     29       c      8 a  B  C    152       c      7    b  C     15       c      1

I would like to sum over the variable C to produce the following output:

A  B    277    b     37 a  B    159    b     16

What is the best way in Pandas to do this?

466

asked Jul 18 '14 13:07

dylkot

1 Answers

If you know you always want to aggregate over the first two levels, then this is pretty easy:

In [27]: data.groupby(level=[0, 1]).sum() Out[27]: A  B    277    b     37 a  B    159    b     16 dtype: int64

130

answered Sep 23 '22 06:09

chrisaycock

Related questions
                            
                                Control the size TextArea widget look in django admin
                            
                                Running pytest test functions inside a jupyter notebook
                            
                                Why are single type constraints disallowed in Python?
                            
                                Quicker to os.walk or glob?
                            
                                AWS Cognito as Django authentication back-end for web site
                            
                                Comparing XML in a unit test in Python
                            
                                does close() imply flush() in Python?
                            
                                ConfigParser vs. import config
                            
                                Django Debug Toolbar: understanding the time panel
                            
                                Python: intersection indices numpy array
                            
                                When to use or not use iterator() in the django ORM
                            
                                Difference between using requests.get() and requests.session().get()?
                            
                                Feature Importance Chart in neural network using Keras in Python
                            
                                Credentials in pip.conf for private PyPI
                            
                                ValueError: shape mismatch: objects cannot be broadcast to a single shape
                            
                                How to Bootstrap numpy installation in setup.py
                            
                                Is there an "ungroup by" operation opposite to .groupby in pandas?
                            
                                Difference between render_template and redirect?
                            
                                How does it work, the naming convention for Django INSTALLED_APPS?
                            
                                How do you debug Mako templates?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Summing over a multiindex level in a pandas series

Tags:

python

pandas

multi-index

statistics

dylkot

People also ask

1 Answers

chrisaycock

Recent Activity

Donate For Us