Pandas Multiindex Groupby on Columns

Tags:

Is there anyway to use groupby on the columns in a Multiindex. I know you can on the rows and there is good documentation in that regard. However I cannot seem to groupby on columns. The only solution I have is transposing the dataframe.

#generate data (copied from pandas example)
arrays=[['bar', 'bar', 'baz', 'baz', 'foo', 'foo', 'qux', 'qux'],['one', 'two', 'one', 'two', 'one', 'two', 'one', 'two']]
tuples = list(zip(*arrays))
index = pd.MultiIndex.from_tuples(tuples, names=['first', 'second'])
df = pd.DataFrame(np.random.randn(3, 8), index=['A', 'B', 'C'], columns=index)

Now I will try to groupby columns which fails

df.groupby(level=1)
df.groupby(level='first')

However transposing with rows works

df.T.groupby(level=1)
df.T.groupby(level='first')

So is there a way to do this without transposing?

524

asked Nov 22 '16 15:11

Bobe Kryant

1 Answers

You need to specify the axis in the groupby method:

df.groupby(level = 1, axis = 1).sum()

enter image description here

Or if you mean groupby level 0:

df.groupby(level = 0, axis = 1).sum()

enter image description here

answered Oct 21 '22 20:10

Psidom

Related questions
                            
                                Check if a pandas.Timestamp is in a pandas.Period
                            
                                Overflow / math range error for log or exp
                            
                                How to conditionally skip a test in python
                            
                                PyInstaller doesn't import Queue
                            
                                TypeError: unsupported operand type(s) for &: 'float' and 'numpy.float64' [duplicate]
                            
                                TemplateNotFound when using Airflow's PostgresOperator with Jinja templating and SQL
                            
                                initialize pandas DataFrame with defined dtypes
                            
                                Get column data by Column name and sheet name
                            
                                Is it possible to see tensorboard over ssh?
                            
                                How to pass argument to scoring function in scikit-learn's LogisticRegressionCV call
                            
                                Pandas: how to increment a column's cell value based on a list of ids
                            
                                Python Break Inside Function [duplicate]
                            
                                parsing a dictionary in a pandas dataframe cell into new row cells (new columns)
                            
                                Implementing skip gram with scikit-learn?
                            
                                Speckle ( Lee Filter) in Python
                            
                                numpy.savetxt- Save one column as int and the rest as floats?
                            
                                Random Forest with bootstrap = False in scikit-learn python
                            
                                Pandas read_csv() 1.2GB file out of memory on VM with 140GB RAM
                            
                                How to scrape all the content of each link with scrapy?
                            
                                pandas - number of unique rows occurrences in dataframe

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Pandas Multiindex Groupby on Columns

Tags:

python

pandas

group-by

multi-index

Bobe Kryant

People also ask

1 Answers

Psidom

Recent Activity

Donate For Us