I have the following code: <pre class="prettyprint"><code>import numpy as np import pandas as pd obs = pd.DataFrame({ 'storm': [1, 1, 1, 1, 0, 0, 0, 0], 'lightning': [1, 1, 0, 0, 1, 1, 0, 0], 'thunder': [1, 0, 1, 0, 1, 0, 1, 0], 'p': [0.20, 0.05, 0.04, 0.36, 0.04, 0.01, 0.03, 0.27] }) g1=obs.groupby(['lightning','thunder']).agg({'p':'sum'}) g2=obs.groupby(['lightning','thunder','storm']).agg({'p':'sum'}) </code></pre> which gives <img src="https://i.stack.imgur.com/6SAvM.png" alt="enter image description here"> Now how to divide more detailed groupby by less detailed (to calculate percentage)? I have read this Pandas percentage of total with groupby but was unable to derive how to rewrite for my case.

<code>g2.unstack()</code> to get last level into columns. Then divide, broadcasting over columns. Then <code>stack</code> again. <pre class="prettyprint"><code>g2.unstack().div(g1.p, axis=0).stack() </code></pre> <img src="https://i.stack.imgur.com/Omc0L.png" alt="enter image description here">

How to divide two groupby objects in pandas?

Tags:

python

pandas

group-by

I have the following code:

import numpy as np
import pandas as pd
obs = pd.DataFrame({
        'storm': [1, 1, 1, 1, 0, 0, 0, 0], 
        'lightning': [1, 1, 0, 0, 1, 1, 0, 0], 
        'thunder': [1, 0, 1, 0, 1, 0, 1, 0],
        'p': [0.20, 0.05, 0.04, 0.36, 0.04, 0.01, 0.03, 0.27]
    })
g1=obs.groupby(['lightning','thunder']).agg({'p':'sum'})
g2=obs.groupby(['lightning','thunder','storm']).agg({'p':'sum'})

which gives

enter image description here

Now how to divide more detailed groupby by less detailed (to calculate percentage)?

I have read this Pandas percentage of total with groupby but was unable to derive how to rewrite for my case.

482

asked Jun 28 '16 19:06

Dims

1 Answers

g2.unstack() to get last level into columns. Then divide, broadcasting over columns. Then stack again.

g2.unstack().div(g1.p, axis=0).stack()

enter image description here

200

answered Sep 18 '22 11:09

piRSquared

Related questions
                            
                                Add text annotation to matplotlib plot from a pandas dataframe
                            
                                Python - Speed up for converting a categorical variable to it's numerical index
                            
                                Is there a function to return all single letter colors in Matplotlib?
                            
                                Numpy einsum broadcasting
                            
                                Upgrading from Django 1.6 to 1.9: python manage.py migrate failure
                            
                                How can I merge two dataframes with 'wildcards'?
                            
                                Tox can't copy non-python file while installing the module
                            
                                takes 1 positional argument but 2 were given
                            
                                AttributeError: 'module' object has no attribute 'webdriver'
                            
                                Plot a Correlation Circle in Python
                            
                                How use `unaccent` with full text search in django 1.10?
                            
                                Pandas MultiIndex groupby retaining index levels
                            
                                Python intersection with custom equality
                            
                                'ManyToManyDescriptor' object has no attribute 'add', why?
                            
                                Crash when calling PyArg_ParseTuple on a Numpy array
                            
                                merging recurrent layers with dense layer in Keras
                            
                                How to get __init__() to raise a more useful exception instead of TypeError when incorrect # of arguments?
                            
                                Faster Lemmatization techniques in Python
                            
                                How do I read a multi-line list from a file in Python?
                            
                                Marshmallow: Dict of nested Schema

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With