I have two DataFrames with the same indexing and want to append the second to the first. Lets say I have: <pre class="prettyprint"><code>df1 = pd.DataFrame([1,2,3], index = [2,3,4]) df2 = pd.DataFrame([3,5,3], index = [2,3,4]) df1 = df1.append(df2) </code></pre> which returns <pre class="prettyprint"><code> 0 2 1 3 2 4 3 2 3 3 5 4 3 </code></pre> But I want it to append a new column where the indexes match: <pre class="prettyprint"><code>2 1 3 3 2 5 4 3 3 </code></pre> Is there a way to do this?

Use <code>concat</code> and pass param <code>axis=1</code> to concatenate the list of dfs column-wise: <pre class="prettyprint"><code>In [3]: df1 = pd.DataFrame([1,2,3], index = [2,3,4]) df2 = pd.DataFrame([3,5,3], index = [2,3,4]) pd.concat([df1,df2], axis=1) Out[3]: 0 0 2 1 3 3 2 5 4 3 3 </code></pre> You can also use <code>join</code> but you have to rename the column first: <pre class="prettyprint"><code>In [6]: df1.join(df2.rename(columns={0:'x'})) Out[6]: 0 x 2 1 3 3 2 5 4 3 3 </code></pre> Or <code>merge</code> specifying that you wish to match on indices: <pre class="prettyprint"><code>In [8]: df1.merge(df2.rename(columns={0:'x'}), left_index=True, right_index=True ) Out[8]: 0 x 2 1 3 3 2 5 4 3 3 </code></pre>

If the indexes match exactly and there's only one column in the other DataFrame (like your question has), then you could even just add the other DataFrame as a new column. <pre class="prettyprint"><code>>>> df1['new_column'] = df2 >>> df1 0 new_column 2 1 3 3 2 5 4 3 3 </code></pre> In general, the <code>concat</code> approach is better. If you have different indexes, you can choose to do an <code>inner join</code> or <code>outer join</code>. <pre class="prettyprint"><code>>>> df2 = pd.DataFrame([3,5,3], index = [2,3,5]) >>> df2 0 2 3 3 5 5 3 >>> pd.concat([df1, df2], axis=1, join='inner') 0 0 2 1 3 3 2 5 >>> pd.concat([df1, df2], axis=1, join='outer') 0 0 2 1 3 3 2 5 4 3 NaN 5 NaN 3 </code></pre>

Append to a DataFrame in Pandas as new column

Tags:

python

pandas

I have two DataFrames with the same indexing and want to append the second to the first. Lets say I have:

df1 = pd.DataFrame([1,2,3], index = [2,3,4])
df2 = pd.DataFrame([3,5,3], index = [2,3,4])
df1 = df1.append(df2)

which returns

But I want it to append a new column where the indexes match:

2  1  3
3  2  5
4  3  3

Is there a way to do this?

745

asked Aug 06 '15 17:08

TheStrangeQuark

2 Answers

Use concat and pass param axis=1 to concatenate the list of dfs column-wise:

In [3]:

df1 = pd.DataFrame([1,2,3], index = [2,3,4])
df2 = pd.DataFrame([3,5,3], index = [2,3,4])
pd.concat([df1,df2], axis=1)
Out[3]:
   0  0
2  1  3
3  2  5
4  3  3

You can also use join but you have to rename the column first:

In [6]:

df1.join(df2.rename(columns={0:'x'}))
Out[6]:
   0  x
2  1  3
3  2  5
4  3  3

Or merge specifying that you wish to match on indices:

In [8]:

df1.merge(df2.rename(columns={0:'x'}), left_index=True, right_index=True )
Out[8]:
   0  x
2  1  3
3  2  5
4  3  3

answered Sep 20 '22 17:09

EdChum

If the indexes match exactly and there's only one column in the other DataFrame (like your question has), then you could even just add the other DataFrame as a new column.

>>> df1['new_column'] = df2
>>> df1
   0  new_column
2  1           3
3  2           5
4  3           3

In general, the concat approach is better. If you have different indexes, you can choose to do an inner join or outer join.

>>> df2 = pd.DataFrame([3,5,3], index = [2,3,5])
>>> df2
   0
2  3
3  5
5  3

>>> pd.concat([df1, df2], axis=1, join='inner')
   0  0
2  1  3
3  2  5

>>> pd.concat([df1, df2], axis=1, join='outer')
    0   0
2   1   3
3   2   5
4   3 NaN
5 NaN   3

answered Sep 20 '22 17:09

vk1011

Related questions
                            
                                Wrong Python version when using Virtualenv in PythonAnywhere
                            
                                Summing Python Objects with MPI's Allreduce
                            
                                Infinite Summation in Python
                            
                                Difference between os.execl() and os.execv() in python
                            
                                How do I output an XML file using ElementTree in python?
                            
                                Multiprocessing speed up vs number of cores
                            
                                Spark mllib predicting weird number or NaN
                            
                                odoo one2many default not set
                            
                                Merge two lists of tuples with timestamps and queue lengths
                            
                                Different result upon shuffling a list
                            
                                Save and close an Excel file after adding data?
                            
                                a Pythonic way to draw a bump chart
                            
                                Append to dictionary file in json, python
                            
                                Checking a python string for escaped characters
                            
                                TypeError: list indices must be integers, not str with xmltodict:
                            
                                Python Plot: How to remove grid lines not within the circle?
                            
                                How to use threading to get user input realtime while main still running in python
                            
                                Why does selenium return an empty text field?
                            
                                cProfile with imports
                            
                                Python nested defaultdict with mix data types

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With