Pandas sum two columns, skipping NaN

Tags:

If I add two columns to create a third, any columns containing NaN (representing missing data in my world) cause the resulting output column to be NaN as well. Is there a way to skip NaNs without explicitly setting the values to 0 (which would lose the notion that those values are "missing")?

In [42]: frame = pd.DataFrame({'a': [1, 2, np.nan], 'b': [3, np.nan, 4]})  In [44]: frame['c'] = frame['a'] + frame['b']  In [45]: frame Out[45]:      a   b   c 0   1   3   4 1   2 NaN NaN 2 NaN   4 NaN

In the above, I would like column c to be [4, 2, 4].

Thanks...

336

asked Jun 24 '14 12:06

smontanaro

2 Answers

with fillna()

frame['c'] = frame.fillna(0)['a'] + frame.fillna(0)['b']

or as suggested :

frame['c'] = frame.a.fillna(0) + frame.b.fillna(0)

giving :

    a   b  c 0   1   3  4 1   2 NaN  2 2 NaN   4  4

answered Oct 01 '22 12:10

jrjc

Another approach:

>>> frame["c"] = frame[["a", "b"]].sum(axis=1) >>> frame     a   b  c 0   1   3  4 1   2 NaN  2 2 NaN   4  4

answered Oct 01 '22 12:10

DSM

Related questions
                            
                                Calling variable defined inside one function from another function
                            
                                How to concatenate two integers in Python?
                            
                                How to log IPython history to text file?
                            
                                Writing a CSV from Flask framework [duplicate]
                            
                                How to specify date and time in python?
                            
                                Python regex to get everything until the first dot in a string
                            
                                pandas xlsxwriter, format header
                            
                                How can I select only one column using SQLAlchemy?
                            
                                converting string to tuple
                            
                                Map each list value to its corresponding percentile
                            
                                python enums with attributes
                            
                                python OpenCV - add alpha channel to RGB image
                            
                                How to get unique values with respective occurrence count from a list in Python?
                            
                                Changing plot scale by a factor in matplotlib
                            
                                Do you use Python mostly for its functional or object-oriented features? [closed]
                            
                                initialize dict with keys,values from two list [duplicate]
                            
                                How To Get Latitude & Longitude with python
                            
                                Why is subtraction faster than addition in Python?
                            
                                How to run " ps cax | grep something " in Python?
                            
                                Get an object attribute [duplicate]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Pandas sum two columns, skipping NaN

Tags:

python

pandas

smontanaro

People also ask

2 Answers

jrjc

DSM

Recent Activity

Donate For Us