Pandas MultiIndex: Divide all columns by one column

Tags:

python

pandas

I have a data frame results of the form

                  TOTEXPPQ      TOTEXPCQ     FINLWT21
year quarter                                         
13   1        9.183392e+09  5.459961e+09  1271559.398
     2        2.907887e+09  1.834126e+09   481169.672

and I was trying to divide all (the first two) columns by the last one. My attempt was

weights = results.pop('FINLWT21')
results/weights

But I get

ValueError: cannot join with no level specified and no overlapping names

Which I don't get: There are overlapping names in the index:

weights.head()
year  quarter
13    1          1271559.398
      2           481169.672

Is there perhaps a better way to do this division? Do I need to reset the index?

227

asked Mar 30 '15 19:03

FooBar

1 Answers

You have to specify the axis for the divide (with the div method):

In [11]: results.div(weights, axis=0)
Out[11]:
                 TOTEXPPQ     TOTEXPCQ
year quarter
13   1        7222.149445  4293.909517
     2        6043.371329  3811.807158

The default is axis=1 and the result columns and weights' index names do not overlap, hence the error message.

194

answered Sep 19 '22 06:09

Andy Hayden

Related questions
                            
                                How does PyArg_ParseTupleAndKeywords work?
                            
                                Django-tastypie: Any example on file upload in POST?
                            
                                Strange behaviour with floats and string conversion
                            
                                Generate python bindings, what methods/programs to use [closed]
                            
                                Python - Find text using beautifulSoup then replace in original soup variable
                            
                                how to check which compiler was used to build Python
                            
                                Stopping Supervisor doesn't stop Celery workers
                            
                                What is the use of __kwdefaults__ which is a function object attribute?
                            
                                Sharing a lock between gunicorn workers
                            
                                Setting DataFrame column headers to a MultiIndex
                            
                                Django: list all reverse relations of a model
                            
                                remove italics in latex subscript in matplotlib
                            
                                fixing words with spaces using a dictionary look up in python?
                            
                                How to place minor ticks on symlog scale?
                            
                                Where in flask/gunicorn to initialize application
                            
                                Use cases for property vs. descriptor vs. __getattribute__
                            
                                Get count of related model efficiently in Django
                            
                                How to eliminate the extra minus sign when rounding negative numbers towards zero in numpy?
                            
                                Find out which font matplotlib uses
                            
                                Why does PyMongo throw AutoReconnect?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With