I have a dataframe called ref(first dataframe) with columns c1, c2 ,c3 and c4. <pre class="prettyprint"><code>ref= pd.DataFrame([[1,3,.3,7],[0,4,.5,4.5],[2,5,.6,3]], columns=['c1','c2','c3','c4']) print(ref) c1 c2 c3 c4 0 1 3 0.3 7.0 1 0 4 0.5 4.5 2 2 5 0.6 3.0 </code></pre> I wanted to create a new column i.e, c5 ( second dataframe) that has all the values from columns c1,c2,c3 and c4. I tried concat, merge columns but i cannot get it work. Please let me know if you have a solutions? <img src="https://i.stack.imgur.com/wOosM.png" alt="pic">

You can use <code>unstack</code> for creating <code>Series</code> from <code>DataFrame</code> and then <code>concat</code> to original: <pre class="prettyprint"><code>print (pd.concat([ref, ref.unstack().reset_index(drop=True).rename('c5')], axis=1)) c1 c2 c3 c4 c5 0 1.0 3.0 0.3 7.0 1.0 1 0.0 4.0 0.5 4.5 0.0 2 2.0 5.0 0.6 3.0 2.0 3 NaN NaN NaN NaN 3.0 4 NaN NaN NaN NaN 4.0 5 NaN NaN NaN NaN 5.0 6 NaN NaN NaN NaN 0.3 7 NaN NaN NaN NaN 0.5 8 NaN NaN NaN NaN 0.6 9 NaN NaN NaN NaN 7.0 10 NaN NaN NaN NaN 4.5 11 NaN NaN NaN NaN 3.0 </code></pre> Alternative solution for creating <code>Series</code> is convert <code>df</code> to <code>numpy array</code> by <code>values</code> and then reshape by <code>ravel</code>: <pre class="prettyprint"><code> print (pd.concat([ref, pd.Series(ref.values.ravel('F'), name='c5')], axis=1)) c1 c2 c3 c4 c5 0 1.0 3.0 0.3 7.0 1.0 1 0.0 4.0 0.5 4.5 0.0 2 2.0 5.0 0.6 3.0 2.0 3 NaN NaN NaN NaN 3.0 4 NaN NaN NaN NaN 4.0 5 NaN NaN NaN NaN 5.0 6 NaN NaN NaN NaN 0.3 7 NaN NaN NaN NaN 0.5 8 NaN NaN NaN NaN 0.6 9 NaN NaN NaN NaN 7.0 10 NaN NaN NaN NaN 4.5 11 NaN NaN NaN NaN 3.0 </code></pre>

using <code>join</code> + <code>ravel('F')</code> <pre class="prettyprint"><code>ref.join(pd.Series(ref.values.ravel('F')).to_frame('c5'), how='right') </code></pre> using <code>join</code> + <code>T.ravel()</code> <pre class="prettyprint"><code>ref.join(pd.Series(ref.values.T.ravel()).to_frame('c5'), how='right') </code></pre> <code>pd.concat</code> + <code>T.stack()</code> + <code>rename</code> <pre class="prettyprint"><code>pd.concat([ref, ref.T.stack().reset_index(drop=True).rename('c5')], axis=1) </code></pre> way too many transposes + <code>append</code> <pre class="prettyprint"><code>ref.T.append(ref.T.stack().reset_index(drop=True).rename('c5')).T </code></pre> <code>combine_first</code> + <code>ravel('F')</code> <--- my favorite <pre class="prettyprint"><code>ref.combine_first(pd.Series(ref.values.ravel('F')).to_frame('c5')) </code></pre> <hr> All yield <pre class="prettyprint"><code> c1 c2 c3 c4 c5 0 1.0 3.0 0.3 7.0 1.0 1 0.0 4.0 0.5 4.5 0.0 2 2.0 5.0 0.6 3.0 2.0 3 NaN NaN NaN NaN 3.0 4 NaN NaN NaN NaN 4.0 5 NaN NaN NaN NaN 5.0 6 NaN NaN NaN NaN 0.3 7 NaN NaN NaN NaN 0.5 8 NaN NaN NaN NaN 0.6 9 NaN NaN NaN NaN 7.0 10 NaN NaN NaN NaN 4.5 11 NaN NaN NaN NaN 3.0 </code></pre>

merging multiple columns into one columns in pandas

Tags:

stack

concat

python-3.x

pandas

I have a dataframe called ref(first dataframe) with columns c1, c2 ,c3 and c4.

ref= pd.DataFrame([[1,3,.3,7],[0,4,.5,4.5],[2,5,.6,3]], columns=['c1','c2','c3','c4'])
print(ref)
   c1  c2   c3   c4
0   1   3  0.3  7.0
1   0   4  0.5  4.5
2   2   5  0.6  3.0

I wanted to create a new column i.e, c5 ( second dataframe) that has all the values from columns c1,c2,c3 and c4.

I tried concat, merge columns but i cannot get it work.

Please let me know if you have a solutions?

446

asked Jan 13 '17 05:01

Sasihci

2 Answers

You can use unstack for creating Series from DataFrame and then concat to original:

print (pd.concat([ref, ref.unstack().reset_index(drop=True).rename('c5')], axis=1))
     c1   c2   c3   c4   c5
0   1.0  3.0  0.3  7.0  1.0
1   0.0  4.0  0.5  4.5  0.0
2   2.0  5.0  0.6  3.0  2.0
3   NaN  NaN  NaN  NaN  3.0
4   NaN  NaN  NaN  NaN  4.0
5   NaN  NaN  NaN  NaN  5.0
6   NaN  NaN  NaN  NaN  0.3
7   NaN  NaN  NaN  NaN  0.5
8   NaN  NaN  NaN  NaN  0.6
9   NaN  NaN  NaN  NaN  7.0
10  NaN  NaN  NaN  NaN  4.5
11  NaN  NaN  NaN  NaN  3.0

Alternative solution for creating Series is convert df to numpy array by values and then reshape by ravel:

    print (pd.concat([ref, pd.Series(ref.values.ravel('F'), name='c5')], axis=1))
         c1   c2   c3   c4   c5
    0   1.0  3.0  0.3  7.0  1.0
    1   0.0  4.0  0.5  4.5  0.0
    2   2.0  5.0  0.6  3.0  2.0
    3   NaN  NaN  NaN  NaN  3.0
    4   NaN  NaN  NaN  NaN  4.0
    5   NaN  NaN  NaN  NaN  5.0
    6   NaN  NaN  NaN  NaN  0.3
    7   NaN  NaN  NaN  NaN  0.5
    8   NaN  NaN  NaN  NaN  0.6
    9   NaN  NaN  NaN  NaN  7.0
    10  NaN  NaN  NaN  NaN  4.5
    11  NaN  NaN  NaN  NaN  3.0

161

answered Nov 09 '22 21:11

jezrael

using join + ravel('F')

ref.join(pd.Series(ref.values.ravel('F')).to_frame('c5'), how='right')

using join + T.ravel()

ref.join(pd.Series(ref.values.T.ravel()).to_frame('c5'), how='right')

pd.concat + T.stack() + rename

pd.concat([ref, ref.T.stack().reset_index(drop=True).rename('c5')], axis=1)

way too many transposes + append

ref.T.append(ref.T.stack().reset_index(drop=True).rename('c5')).T

combine_first + ravel('F') <--- my favorite

ref.combine_first(pd.Series(ref.values.ravel('F')).to_frame('c5'))

All yield

     c1   c2   c3   c4   c5
0   1.0  3.0  0.3  7.0  1.0
1   0.0  4.0  0.5  4.5  0.0
2   2.0  5.0  0.6  3.0  2.0
3   NaN  NaN  NaN  NaN  3.0
4   NaN  NaN  NaN  NaN  4.0
5   NaN  NaN  NaN  NaN  5.0
6   NaN  NaN  NaN  NaN  0.3
7   NaN  NaN  NaN  NaN  0.5
8   NaN  NaN  NaN  NaN  0.6
9   NaN  NaN  NaN  NaN  7.0
10  NaN  NaN  NaN  NaN  4.5
11  NaN  NaN  NaN  NaN  3.0

answered Nov 09 '22 22:11

piRSquared

Related questions
                            
                                python script command line arguments from file
                            
                                How to test if a webpage is an image
                            
                                Python: search through list of tuples
                            
                                python map array of dictionaries to dictionary?
                            
                                How do I calculate all pairs of vector differences in numpy?
                            
                                Not reading all rows while importing csv into pandas dataframe
                            
                                Python 3 EmailMessage html message with Image
                            
                                Drawing a polygon in PyQt
                            
                                How to quickly find first multiple of 2 of list element in list of large integers?
                            
                                Jinja can't find template path
                            
                                How to invalidate objects in with CloudFront using boto and python3?
                            
                                Is it possible to run a command that is in a list?
                            
                                Get index values from slice objects in python [duplicate]
                            
                                List comprehension iterate two variables at the same time [duplicate]
                            
                                Function generators vs class generators in Python 3
                            
                                Pandas insert alternate blank rows
                            
                                TypeError: sequence item 0 expected str instance, bytes found
                            
                                Elegant way of adding a set to a counter in Python
                            
                                Plotting a simple 3d numpy array using matplotlib
                            
                                How to decode a unicode-like string in Python 3?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With