I have a dataframe <code>df</code> that looks like: <pre class="prettyprint"><code> one three two 0 1.0 10.0 4.0 1 2.0 3.0 3.0 2 3.0 22.0 2.0 3 4.0 1.0 1.0 </code></pre> I have another single row dataframe <code>df2</code> that looks like: <pre class="prettyprint"><code> a b m u 0 1.0 2.0 1.0 4.0 </code></pre> I want to concatenate the two to end up with: <pre class="prettyprint"><code> one three two a b m u 0 1.0 10.0 4.0 1.0 2.0 1.0 4.0 1 2.0 3.0 3.0 1.0 2.0 1.0 4.0 2 3.0 22.0 2.0 1.0 2.0 1.0 4.0 3 4.0 1.0 1.0 1.0 2.0 1.0 4.0 </code></pre> I tried: <pre class="prettyprint"><code>df3 = pd.concat([df, df2], axis=1, ignore_index=True) 0 1 2 3 4 5 6 0 1.0 10.0 4.0 1.0 2.0 1.0 4.0 1 2.0 3.0 3.0 NaN NaN NaN NaN 2 3.0 22.0 2.0 NaN NaN NaN NaN 3 4.0 1.0 1.0 NaN NaN NaN NaN </code></pre> Err Wrong answer... How can I sort this out? Many thanks.

Use <code>merge</code> with assigning a dummy key. <pre class="prettyprint"><code>df.assign(key=1).merge(df2.assign(key=1), on='key').drop('key',axis=1) </code></pre> Output: <pre class="prettyprint"><code> one three two a b m u 0 1.0 10.0 4.0 1.0 2.0 1.0 4.0 1 2.0 3.0 3.0 1.0 2.0 1.0 4.0 2 3.0 22.0 2.0 1.0 2.0 1.0 4.0 3 4.0 1.0 1.0 1.0 2.0 1.0 4.0 </code></pre>

I think you can use <code>numpy.tile</code> for repeat data: <pre class="prettyprint"><code>df2 = pd.DataFrame(np.tile(df2.values, len(df.index)).reshape(-1,len(df2.columns)), columns=df2.columns) print (df2) a b m u 0 1.0 2.0 1.0 4.0 1 1.0 2.0 1.0 4.0 2 1.0 2.0 1.0 4.0 3 1.0 2.0 1.0 4.0 df3 = df.join(df2) print (df3) one three two a b m u 0 1.0 10.0 4.0 1.0 2.0 1.0 4.0 1 2.0 3.0 3.0 1.0 2.0 1.0 4.0 2 3.0 22.0 2.0 1.0 2.0 1.0 4.0 3 4.0 1.0 1.0 1.0 2.0 1.0 4.0 </code></pre> Or improved John Galt solution - only replaced <code>NaN</code>s of columns from <code>df2</code>: <pre class="prettyprint"><code>df3 = df.join(df2) df3[df2.columns] = df3[df2.columns].ffill() print (df3) one three two a b m u 0 1.0 10.0 4.0 1.0 2.0 1.0 4.0 1 2.0 3.0 3.0 1.0 2.0 1.0 4.0 2 3.0 22.0 2.0 1.0 2.0 1.0 4.0 3 4.0 1.0 1.0 1.0 2.0 1.0 4.0 </code></pre> Another solution with <code>assign</code> by <code>Series</code> created by <code>iloc</code>, but columns names has to be strings: <pre class="prettyprint"><code>df3 = df.assign(**df2.iloc[0]) print (df3) one three two a b m u 0 1.0 10.0 4.0 1.0 2.0 1.0 4.0 1 2.0 3.0 3.0 1.0 2.0 1.0 4.0 2 3.0 22.0 2.0 1.0 2.0 1.0 4.0 3 4.0 1.0 1.0 1.0 2.0 1.0 4.0 </code></pre> Timings: <pre class="prettyprint"><code>np.random.seed(44) N = 1000000 df = pd.DataFrame(np.random.random((N,5)), columns=list('ABCDE')) df2 = pd.DataFrame(np.random.random((1, 50))) df2.columns = 'a' + df2.columns.astype(str) In [369]: %timeit df.join(pd.DataFrame(np.tile(df2.values, len(df.index)).reshape(-1,len(df2.columns)), columns=df2.columns)) 1 loop, best of 3: 897 ms per loop In [370]: %timeit df.assign(**df2.iloc[0]) 1 loop, best of 3: 467 ms per loop In [371]: %timeit df.assign(key=1).merge(df2.assign(key=1), on='key').drop('key',axis=1) 1 loop, best of 3: 1.55 s per loop In [372]: %%timeit ...: df3 = df.join(df2) ...: df3[df2.columns] = df3[df2.columns].ffill() ...: 1 loop, best of 3: 1.9 s per loop </code></pre>

Concatenate Two Dataframes Pandas with single Row

   one  three  two
0  1.0   10.0  4.0
1  2.0    3.0  3.0
2  3.0   22.0  2.0
3  4.0    1.0  1.0

I have another single row dataframe df2 that looks like:

Click to copy

     a    b    m    u
0  1.0  2.0  1.0  4.0

I want to concatenate the two to end up with:

Click to copy

   one  three  two    a    b    m    u
0  1.0   10.0  4.0  1.0  2.0  1.0  4.0
1  2.0    3.0  3.0  1.0  2.0  1.0  4.0
2  3.0   22.0  2.0  1.0  2.0  1.0  4.0
3  4.0    1.0  1.0  1.0  2.0  1.0  4.0

I tried:

Click to copy

df3 = pd.concat([df, df2], axis=1, ignore_index=True)

     0     1    2    3    4    5    6
0  1.0  10.0  4.0  1.0  2.0  1.0  4.0
1  2.0   3.0  3.0  NaN  NaN  NaN  NaN
2  3.0  22.0  2.0  NaN  NaN  NaN  NaN
3  4.0   1.0  1.0  NaN  NaN  NaN  NaN

Err Wrong answer...

How can I sort this out?

Many thanks.

982

asked Aug 16 '17 12:08

Chuck

Video Answer

2 Answers

Use merge with assigning a dummy key.

Click to copy

df.assign(key=1).merge(df2.assign(key=1), on='key').drop('key',axis=1)

Output:

Click to copy

   one  three  two    a    b    m    u
0  1.0   10.0  4.0  1.0  2.0  1.0  4.0
1  2.0    3.0  3.0  1.0  2.0  1.0  4.0
2  3.0   22.0  2.0  1.0  2.0  1.0  4.0
3  4.0    1.0  1.0  1.0  2.0  1.0  4.0

111

answered Nov 15 '22 21:11

Scott Boston

I think you can use numpy.tile for repeat data:

Click to copy

df2 = pd.DataFrame(np.tile(df2.values, len(df.index)).reshape(-1,len(df2.columns)), 
                   columns=df2.columns)
print (df2)
     a    b    m    u
0  1.0  2.0  1.0  4.0
1  1.0  2.0  1.0  4.0
2  1.0  2.0  1.0  4.0
3  1.0  2.0  1.0  4.0

df3 = df.join(df2)
print (df3)
   one  three  two    a    b    m    u
0  1.0   10.0  4.0  1.0  2.0  1.0  4.0
1  2.0    3.0  3.0  1.0  2.0  1.0  4.0
2  3.0   22.0  2.0  1.0  2.0  1.0  4.0
3  4.0    1.0  1.0  1.0  2.0  1.0  4.0

Or improved John Galt solution - only replaced NaNs of columns from df2:

Click to copy

df3 = df.join(df2)
df3[df2.columns] = df3[df2.columns].ffill()
print (df3)
   one  three  two    a    b    m    u
0  1.0   10.0  4.0  1.0  2.0  1.0  4.0
1  2.0    3.0  3.0  1.0  2.0  1.0  4.0
2  3.0   22.0  2.0  1.0  2.0  1.0  4.0
3  4.0    1.0  1.0  1.0  2.0  1.0  4.0

Another solution with assign by Series created by iloc, but columns names has to be strings:

Click to copy

df3 = df.assign(**df2.iloc[0])
print (df3)
   one  three  two    a    b    m    u
0  1.0   10.0  4.0  1.0  2.0  1.0  4.0
1  2.0    3.0  3.0  1.0  2.0  1.0  4.0
2  3.0   22.0  2.0  1.0  2.0  1.0  4.0
3  4.0    1.0  1.0  1.0  2.0  1.0  4.0

Timings:

Click to copy

np.random.seed(44)
N = 1000000

df = pd.DataFrame(np.random.random((N,5)), columns=list('ABCDE'))

df2 = pd.DataFrame(np.random.random((1, 50)))
df2.columns = 'a' + df2.columns.astype(str)


In [369]: %timeit df.join(pd.DataFrame(np.tile(df2.values, len(df.index)).reshape(-1,len(df2.columns)), columns=df2.columns))
1 loop, best of 3: 897 ms per loop

In [370]: %timeit df.assign(**df2.iloc[0])
1 loop, best of 3: 467 ms per loop

In [371]: %timeit df.assign(key=1).merge(df2.assign(key=1), on='key').drop('key',axis=1)
1 loop, best of 3: 1.55 s per loop

In [372]: %%timeit
     ...: df3 = df.join(df2)
     ...: df3[df2.columns] = df3[df2.columns].ffill()
     ...: 
1 loop, best of 3: 1.9 s per loop

answered Nov 15 '22 20:11

jezrael

Related questions
                            
                                Groupby Aggregate method is returning NaN always
                            
                                Panda rolling window percentile rank
                            
                                The requested address is not valid in its context error
                            
                                Can't load Flask config from parent directory
                            
                                How to add x-axis labels to every plot in a seaborn figure-level plot
                            
                                Impute missing data, while forcing correlation coefficient to remain the same
                            
                                Parallel version of t-SNE
                            
                                Python Jupyter Notebook: Specify cell execution order
                            
                                Get only certain fields of related object in Django
                            
                                Pandas adding Time column to Date index
                            
                                How to shift several rows in a pandas DataFrame?
                            
                                What is the point of the permission infrastructure in Pyramid?
                            
                                Using __prepare__ for an Enum ... what's the catch?
                            
                                regex string and substring
                            
                                What does "MEIPASS" stand for?
                            
                                Adding common parameters to groups with Click
                            
                                asyncio as_yielded from async generators
                            
                                Understanding free OPC/UA code in python
                            
                                How to round to specific values in Python
                            
                                Calculate distance to shore or coastline for a vessel

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Concatenate Two Dataframes Pandas with single Row

Tags:

python

concatenation

pandas

Chuck

People also ask

Video Answer

2 Answers

Scott Boston

jezrael

Recent Activity

Donate For Us