Sum a list of Pandas DataFrames

Tags:

Is there a way to sum multiple pandas DataFrames using syntax similar to pd.concat([df1, df2, df3, df4]). I understand from documentation that I can do df1.sum(df2, fill_value=0), but I have a long list of DataFrames I need to sum and was wondering if I could do it without writing a loop.

Somewhat related question/answer: Pandas sum multiple dataframes (Stack Overflow)

Example of what the result should look like:

idx1 = pd.MultiIndex.from_tuples([('a', 'A'), ('a', 'B'), ('b', 'A'), ('b', 'D')])
idx2 = pd.MultiIndex.from_tuples([('a', 'A'), ('a', 'C'), ('b', 'A'), ('b', 'C')])
idx3 = pd.MultiIndex.from_tuples([('a', 'A'), ('a', 'D'), ('b', 'A'), ('b', 'C')])

np.random.seed([3,1415])
df1 = pd.DataFrame(np.random.randn(4, 1), idx1, ['val'])
df2 = pd.DataFrame(np.random.randn(4, 1), idx2, ['val'])
df3 = pd.DataFrame(np.random.randn(4, 1), idx3, ['val'])

df1

enter image description here

df2

enter image description here

df3

enter image description here

The result should look like:

enter image description here

451

asked Aug 31 '17 13:08

blahblahblah

1 Answers

Use reduce with add with parameter fill_value=0:

np.random.seed(12)

a = pd.DataFrame(np.random.randint(3, size=(5,3)), columns=list('abc'))
b = pd.DataFrame(np.random.randint(3, size=(5,2)), columns=list('ab'))
c = pd.DataFrame(np.random.randint(3, size=(5,2)), columns=list('ac'))
print(a)
   a  b  c
0  2  1  1
1  2  0  0
2  2  1  0
3  1  1  1
4  2  2  2

print(b)
   a  b
0  0  1
1  0  0
2  1  2
3  1  2
4  0  1

print(c)
   a  c
0  2  0
1  2  2
2  2  0
3  0  2
4  1  1

from functools import reduce

dfs = [a,b, c]
d = reduce(lambda x, y: x.add(y, fill_value=0), dfs)
print (d)
   a    b    c
0  4  2.0  1.0
1  4  0.0  2.0
2  5  3.0  0.0
3  2  3.0  3.0
4  3  3.0  3.0

137

answered Oct 16 '22 18:10

jezrael

Related questions
                            
                                Accessing MySQL from Python 3: Access denied for user
                            
                                Python ASCII codec can't encode character error during write to CSV
                            
                                Tensorflow successfully installs on mac but gets ImportError on copyreg when used [closed]
                            
                                Calculating pairwise correlation among all columns
                            
                                "Map" a nested list in Python
                            
                                nltk StanfordNERTagger : NoClassDefFoundError: org/slf4j/LoggerFactory (In Windows)
                            
                                How to get the entire web page source using Selenium WebDriver in python [duplicate]
                            
                                Self-signed SSL connection using PyMongo
                            
                                PySpark: filtering a DataFrame by date field in range where date is string
                            
                                PyCharm doesn't autocomplete Django model queries anymore in 2016.1.2
                            
                                Removing white space from txt with python
                            
                                Increment matplotlib color cycle
                            
                                Flask-sqlalchemy disable autoflush for the whole session
                            
                                Extracting Pylint Score
                            
                                Python: Accessing YAML values using "dot notation"
                            
                                pandas remove seconds from datetime index
                            
                                How to install numpy+mkl for python 2.7 on windows 64 bit?
                            
                                Trained Machine Learning model is too big
                            
                                How to get rid of warning "DeprecationWarning generator 'ngrams' raised StopIteration"
                            
                                Converting list of Arrays to list of Lists?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Sum a list of Pandas DataFrames

Tags:

python

pandas

dataframe

blahblahblah

People also ask

1 Answers

jezrael

Recent Activity

Donate For Us