Pandas sum multiple dataframes

Tags:

I have multiple dataframes each with a multi-level-index and a value column. I want to add up all the dataframes on the value columns.

df1 + df2

Not all the indexes are complete in each dataframe, hence I am getting nan on a row which is not present in all the dataframes.

How can I overcome this and treat rows which are not present in any dataframe as having a value of 0?

Eg. I want to get

   val a    2 b    4 c    3 d    3

from pd.DataFrame({'val':{'a': 1, 'b':2, 'c':3}}) + pd.DataFrame({'val':{'a': 1, 'b':2, 'd':3}}) instead of

   val a    2 b    4 c  NaN d  NaN

293

asked Jul 20 '16 04:07

hangc

1 Answers

use the add method with fill_value=0 parameter.

df1 = pd.DataFrame({'val':{'a': 1, 'b':2, 'c':3}}) df2 = pd.DataFrame({'val':{'a': 1, 'b':2, 'd':3}})  df1.add(df2, fill_value=0)     val a  2.0 b  4.0 c  3.0 d  3.0

MultiIndex example

idx1 = pd.MultiIndex.from_tuples([('a', 'A'), ('a', 'B'), ('b', 'A'), ('b', 'D')]) idx2 = pd.MultiIndex.from_tuples([('a', 'A'), ('a', 'C'), ('b', 'A'), ('b', 'C')])  np.random.seed([3,1415]) df1 = pd.DataFrame(np.random.randn(4, 1), idx1, ['val']) df2 = pd.DataFrame(np.random.randn(4, 1), idx2, ['val'])  df1            val a A -2.129724   B -1.268466 b A -1.970500   D -2.259055  df2            val a A -0.349286   C -0.026955 b A  0.316236   C  0.348782  df1.add(df2, fill_value=0)            val a A -2.479011   B -1.268466   C -0.026955 b A -1.654264   C  0.348782   D -2.259055

More than 2 dataframes

from functools import reduce  df1 = pd.DataFrame({'val':{'a': 1, 'b':2, 'c':3}}) df2 = pd.DataFrame({'val':{'a': 1, 'b':2, 'd':3}}) df3 = pd.DataFrame({'val':{'e': 1, 'c':2, 'd':3}}) df4 = pd.DataFrame({'val':{'f': 1, 'a':2, 'd':3}}) df5 = pd.DataFrame({'val':{'g': 1, 'f':2, 'd':3}})  reduce(lambda a, b: a.add(b, fill_value=0), [df1, df2, df3, df4, df5])      val a   4.0 b   4.0 c   5.0 d  12.0 e   1.0 f   3.0 g   1.0

169

answered Oct 24 '22 23:10

piRSquared

Related questions
                            
                                Gradle - Exclude file from being packaged with jar
                            
                                How to manage database connection pool in spring jpa?
                            
                                SafariViewController: How to grab OAuth token from URL?
                            
                                How to skip the first item(s) of an iterator in Rust?
                            
                                Android 23+ - Exclude GCM Registration ID from backup
                            
                                How to Use MVC Controller and WebAPI Controller in same project
                            
                                Angular 2 - Use same resolver for parent and child routes
                            
                                Does a std::string always require heap memory?
                            
                                MySQL - Select only non - numeric values from varchar column
                            
                                Is Keras thread safe?
                            
                                What is com.google.android.gm?
                            
                                Getting "At least one JPA metamodel must be present" with @WebMvcTest

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With