Python pandas: groupby one level of MultiIndex but remain other levels instead

Tags:

python

pandas

Suppose that I have a DataFrame:

import numpy as np
import pandas as pd

df = pd.DataFrame(np.arange(0, 24).reshape((3, 8)))
df.columns = pd.MultiIndex.from_arrays([
    ['a1', 'a1', 'a2', 'a2', 'b1', 'b1', 'b2', 'b2'],
    ['4th', '5th', '4th', '5th', '4th', '5th', '4th', '5th']
])
print(df)

output:

       a1      a2      b1      b2    
  4th 5th 4th 5th 4th 5th 4th 5th
0   0   1   2   3   4   5   6   7
1   8   9  10  11  12  13  14  15
2  16  17  18  19  20  21  22  23

I wanna group by a dict:

label_dict = {'a1': 'A', 'a2': 'A', 'b1': 'B', 'b2': 'B'}
res = df.groupby(label_dict, axis=1, level=0).sum()
print(res)

output:

but what I want is:

    A   A   B   B
  4th 5th 4th 5th
0   2   4  10  12
1  18  21  26  28
2  34  36  42  44

Is there any idea? Thanks!

938

asked May 31 '18 12:05

Alvin Liu

1 Answers

Use rename with sum by both levels in MultiIndex in columns:

label_dict = {'a1': 'A', 'a2': 'A', 'b1': 'B', 'b2': 'B'}

res = df.rename(columns=label_dict, level=0).sum(level=[0,1], axis=1)
#alternative with groupby
#res = df.rename(columns=label_dict, level=0).groupby(level=[0,1], axis=1).sum()
print(res)
    A       B    
  4th 5th 4th 5th
0   2   4  10  12
1  18  20  26  28
2  34  36  42  44

102

answered Nov 10 '22 00:11

jezrael

Related questions
                            
                                Vectorized 2-D moving window in numpy including edges
                            
                                LSTM Initial state from Dense layer
                            
                                Python - How can I completely uninstall Anaconda on Windows 10?
                            
                                Django UpdateView, get the current object being edit id?
                            
                                Keras : AttributeError: 'int' object has no attribute 'ndim' when using model.fit
                            
                                Convert varargin and nargin to from Matlab to Python
                            
                                Python: Possible to unpack tuple and append to multiple lists in one line?
                            
                                How to use Exponential Moving Average in Tensorflow
                            
                                AWS Lambda: call function from another AWS lambda using boto3 invoke
                            
                                POST document with Django RequestFactory instead of form data
                            
                                Transform a datetime column to YYYYQx with quarter number
                            
                                How does PyTorch module do the back prop
                            
                                Boto3 AWS S3 bucket creation error
                            
                                How to get the list of all built in functions in Python
                            
                                Convert boolean numpy array to pillow image
                            
                                SQLAlchemy select from two tables with null LEFT JOIN returns empty result
                            
                                python sorting list of dictionary by custom order [duplicate]
                            
                                Python datetime to epoch
                            
                                Unable to invoke firefox headless
                            
                                Export a variable from bash and use it in Python

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With