Assume I have the following two <code>DataFrames</code>: <pre class="prettyprint"><code> X Y Z 1 0.0 0.0 0.0 2 1.0 2.0 3.0 3 4.0 2.0 0.0 4 NaN NaN NaN 5 NaN NaN NaN 6 NaN NaN NaN 7 NaN NaN NaN 8 NaN NaN NaN </code></pre> and <pre class="prettyprint"><code> X.2 Y.2 Z.2 1 NaN NaN NaN 2 NaN NaN NaN 3 NaN NaN NaN 4 NaN NaN NaN 5 NaN NaN NaN 6 9.0 3.0 6.0 7 7.0 4.0 3.0 8 3.0 6.0 8.0 </code></pre> I would like to fill the missing data in the first <code>DataFrame</code> with the values from the second. Result should look like this: <pre class="prettyprint"><code> X Y Z 1 0.0 0.0 0.0 2 1.0 2.0 3.0 3 4.0 2.0 0.0 4 NaN NaN NaN 5 NaN NaN NaN 6 9.0 3.0 6.0 7 7.0 4.0 3.0 8 3.0 6.0 8.0 </code></pre> If possible I'd like to avoid creating a new <code>DataFrame</code> but fill up the first <code>DataFrame</code> in place. How do I do this?

You can proceed simply with <code>update</code> which fills up the first dataframe <code>df1</code> based on the value of <code>df2</code>: <pre class="prettyprint"><code>df2.columns = df1.columns df1.update(df2) In [118]: df1 Out[118]: X Y Z 1 0 0 0 2 1 2 3 3 4 2 0 4 NaN NaN NaN 5 NaN NaN NaN 6 9 3 6 7 7 4 3 8 3 6 8 </code></pre>

If you line the columns up, then fillna() will do this: <pre class="prettyprint"><code>df2.columns = df1.column df1.fillna(df2, inplace=True) df1 X Y Z 1 0 0 0 2 1 2 3 3 4 2 0 4 NaN NaN NaN 5 NaN NaN NaN 6 9 3 6 7 7 4 3 8 3 6 8 </code></pre>

Pandas: merge two dataframes ignoring NaN

Tags:

python

pandas

merging-data

Assume I have the following two DataFrames:

Click to copy

  X    Y    Z
1 0.0  0.0  0.0
2 1.0  2.0  3.0
3 4.0  2.0  0.0
4 NaN  NaN  NaN
5 NaN  NaN  NaN
6 NaN  NaN  NaN
7 NaN  NaN  NaN
8 NaN  NaN  NaN

and

Click to copy

  X.2  Y.2  Z.2
1 NaN  NaN  NaN
2 NaN  NaN  NaN
3 NaN  NaN  NaN
4 NaN  NaN  NaN
5 NaN  NaN  NaN
6 9.0  3.0  6.0
7 7.0  4.0  3.0
8 3.0  6.0  8.0

I would like to fill the missing data in the first DataFrame with the values from the second. Result should look like this:

Click to copy

  X    Y    Z
1 0.0  0.0  0.0
2 1.0  2.0  3.0
3 4.0  2.0  0.0
4 NaN  NaN  NaN
5 NaN  NaN  NaN
6 9.0  3.0  6.0
7 7.0  4.0  3.0
8 3.0  6.0  8.0

If possible I'd like to avoid creating a new DataFrame but fill up the first DataFrame in place.

How do I do this?

355

asked Sep 30 '15 14:09

Hendrik Wiese

2 Answers

You can proceed simply with update which fills up the first dataframe df1 based on the value of df2:

Click to copy

df2.columns = df1.columns

df1.update(df2)

In [118]: df1
Out[118]:
    X   Y   Z
1   0   0   0
2   1   2   3
3   4   2   0
4 NaN NaN NaN
5 NaN NaN NaN
6   9   3   6
7   7   4   3
8   3   6   8

140

answered Sep 20 '22 13:09

Colonel Beauvel

If you line the columns up, then fillna() will do this:

Click to copy

df2.columns = df1.column
df1.fillna(df2, inplace=True)
df1

    X   Y   Z
1   0   0   0
2   1   2   3
3   4   2   0
4 NaN NaN NaN
5 NaN NaN NaN
6   9   3   6
7   7   4   3
8   3   6   8

answered Sep 19 '22 13:09

iayork

Related questions
                            
                                How does HttpResponse(status=<code>) work in django?
                            
                                Getting the maximum accuracy for a binary probabilistic classifier in scikit-learn
                            
                                How to get the current behave step with Python?
                            
                                why can't numpy compute long objects?
                            
                                pytesseract don't work with one digit image
                            
                                Python multiprocessing seems near impossible to do within classes/using any class instances. What is its intended use?
                            
                                Is there a reason why append and insert are both there?
                            
                                Export constants from header with Cython
                            
                                Pygame sceen.fill() not filling up the color properly
                            
                                What sort of Python array would this be? Does it already exist in Python?
                            
                                Python unittests, statement before test cases
                            
                                OpenCV: fit the detected edges
                            
                                Plot Red Channel from 3D Numpy Array
                            
                                ImportError: No module named 'version'
                            
                                Pandas dataframe Cartesian join
                            
                                Audio file to text file python
                            
                                Basic linear algebra on spark matrices
                            
                                NLTK: why does nltk not recognize the CLASSPATH variable for stanford-ner?
                            
                                Why matplotlib replace a right parenthesis with "!" in latex expression?
                            
                                Passing Python3 to virtualenvwrapper throws up ImportError

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Pandas: merge two dataframes ignoring NaN

Tags:

python

pandas

merging-data

Hendrik Wiese

People also ask

2 Answers

Colonel Beauvel

iayork

Recent Activity

Donate For Us