Pandas groupby and Multiindex

Tags:

python

pandas

Is there any opportunity in pandas to groupby data by MultiIndex? By this i mean passing to groupby function not only keys but keys and values to predefine dataframe columns?

a = np.array(['foo', 'foo', 'foo', 'bar', 'bar', 'foo', 'foo'], dtype=object)
b = np.array(['one', 'one', 'two', 'one', 'two', 'two', 'two'], dtype=object)
c = np.array(['dull', 'shiny', 'dull', 'dull', 'dull', 'shiny', 'shiny'], dtype=object)
df = pd.DataFrame([a, b, c]).T
df.columns = ['a', 'b', 'c']
df.groupby(['a', 'b', 'c']).apply(len)

a    b    c    
bar  one  dull     1
     two  dull     1
foo  one  dull     1
          shiny    1
     two  dull     1
          shiny    2

But what I actually want is the following:

mi = pd.MultiIndex(levels=[['foo', 'bar'], ['one', 'two'], ['dull', 'shiny']],
                   labels=[[0, 0, 0, 0, 1, 1, 1, 1], [0, 0, 1, 1, 0, 0, 1, 1], [0, 1, 0, 1, 0, 1, 0, 1]])
#pseudocode
df.groupby(['a', 'b', 'c'], multi_index = mi).apply(len)
a    b    c    
bar  one  dull     1
          shiny    0
     two  dull     1
          shiny    0
foo  one  dull     1
          shiny    1
     two  dull     1
          shiny    2

The way i see it is in creation of additional wrapper on groupby object. Or maybe this feature feets well to pandas philosophy and it can be included in the pandas lib?

236

asked Jun 10 '13 15:06

norecces

1 Answers

just reindex and fillna!

In [14]: df.groupby(['a', 'b', 'c']).size().reindex(index=mi).fillna(0)
Out[14]: 
foo  one  dull     1
          shiny    1
     two  dull     1
          shiny    2
bar  one  dull     1
          shiny    0
     two  dull     1
          shiny    0
dtype: float64

answered Sep 22 '22 03:09

Jeff

Related questions
                            
                                Representing a ragged array in numpy by padding
                            
                                First 8 byes of my encrypted data corrupting using 3DES and CBC
                            
                                Screen displays only in top left corner of window
                            
                                Python signal don't work even on Cygwin?
                            
                                Django template tag to insert or replace URL parameter
                            
                                Issue with Pandas boxplot within a subplot
                            
                                Why won't Python return my mysql-connector cursor from a function?
                            
                                How can change or override sorl-thumbnail cache path and add image with absolute path?
                            
                                How to speed up iteration over part of a numpy array
                            
                                xlrd crashes when reading .xls file modified by PHPExcel
                            
                                Nose test single setup function called once
                            
                                Closing database connection from pipeline and middleware in Scrapy
                            
                                How to search for Chinese characters and short words in documentation generated by Sphinx?
                            
                                Can I use the slice method to return a list that excludes ranges in the middle of the original list?
                            
                                module object has no attribute 'create_frame'
                            
                                Return PostgreSQL UUID array as list with psycopg2
                            
                                Convert Mac Timestamps with python
                            
                                How sys.exc_info() works?
                            
                                How do I use Scrapy to crawl within pages?
                            
                                Update pandas DataFrame in stored in a Pytable with another pandas DataFrame

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With