This is my original DataFrame (with multiindex column): <pre class="prettyprint"><code>In [72]:df Out[72]: a b x y x y 0 1.545293 -0.459270 0.899254 -1.010453 1 0.458760 0.275400 -0.190951 0.169195 2 -0.941817 1.109823 0.077953 -0.247074 3 1.790101 -1.643470 0.979625 -1.704657 4 -2.044814 -0.243726 -0.039724 0.600066 </code></pre> and I have another DataFrame: <pre class="prettyprint"><code>In [77]:df2 Out[77]: x y 0 -1.085869 -0.952949 1 0.601585 0.570050 2 0.328601 0.802610 3 -0.415952 -0.090088 4 0.757545 -0.736933 </code></pre> how can I add <code>df2</code>'s columns to <code>df</code> to get a new DataFrame like this: <pre class="prettyprint"><code>In [83]:df3 Out[83]: a b c x y x y x y 0 1.545293 -0.459270 0.899254 -1.010453 -1.085869 -0.952949 1 0.458760 0.275400 -0.190951 0.169195 0.601585 0.570050 2 -0.941817 1.109823 0.077953 -0.247074 0.328601 0.802610 3 1.790101 -1.643470 0.979625 -1.704657 -0.415952 -0.090088 4 -2.044814 -0.243726 -0.039724 0.600066 0.757545 -0.736933 </code></pre> My current approach is to use a for loop: <pre class="prettyprint"><code>for col in df2.columns: df['c', col] = df2[col] </code></pre> is there any method to avoid the loop?

Try <code>pd.concat</code>: <pre class="prettyprint"><code>pieces = {'a' : df1['a'], 'b' : df1['b'], 'c' : df2} df3 = pd.concat(pieces, axis=1) </code></pre>

pandas, how to add columns to a multiindex column DataFrame

Tags:

pandas

This is my original DataFrame (with multiindex column):

In [72]:df
Out[72]: 
          a                   b          
          x         y         x         y
0  1.545293 -0.459270  0.899254 -1.010453
1  0.458760  0.275400 -0.190951  0.169195
2 -0.941817  1.109823  0.077953 -0.247074
3  1.790101 -1.643470  0.979625 -1.704657
4 -2.044814 -0.243726 -0.039724  0.600066

and I have another DataFrame:

In [77]:df2
Out[77]: 
          x         y
0 -1.085869 -0.952949
1  0.601585  0.570050
2  0.328601  0.802610
3 -0.415952 -0.090088
4  0.757545 -0.736933

how can I add df2's columns to df to get a new DataFrame like this:

In [83]:df3
Out[83]: 
          a                   b                   c          
          x         y         x         y         x         y
0  1.545293 -0.459270  0.899254 -1.010453 -1.085869 -0.952949
1  0.458760  0.275400 -0.190951  0.169195  0.601585  0.570050
2 -0.941817  1.109823  0.077953 -0.247074  0.328601  0.802610
3  1.790101 -1.643470  0.979625 -1.704657 -0.415952 -0.090088
4 -2.044814 -0.243726 -0.039724  0.600066  0.757545 -0.736933

My current approach is to use a for loop:

for col in df2.columns:
    df['c', col] = df2[col]

is there any method to avoid the loop?

254

asked Jan 27 '16 03:01

2 Answers

Try pd.concat:

pieces = {'a' : df1['a'],
          'b' : df1['b'],
          'c' : df2}
df3 = pd.concat(pieces, axis=1)

171

answered Oct 23 '22 04:10

Kartik

I discovered another way to do this in the general case (running Python 3.6), without having to explicitly deconstruct the DataFrame. You can use pd.concat with the dictionary argument,

df3 = pd.concat({**df1, **{('c',nm):val for nm,val in df2.items()})

** expansion on DataFrame objects seems to return a dictionary of Series objects with "names" equal to the column name string/value, or if the columns are MultiIndexed, the tuple containing the hieararchy of column string/values. Then, when read back into pd.concat as a dictionary, Pandas re-constructs the MultiIndexed columns from the tuples.

Note this is much less efficient than the direct assignment you were doing! Since it has to deconstruct each column and MultiIndex of the dataframe, then re-combine.

answered Oct 23 '22 03:10

Luke Davis

Related questions
                            
                                matplotlib colorbar formatting
                            
                                Sqlalchemy multiple insert fails with percentage symbol (%) in column name
                            
                                NumPy precision when doing dot product
                            
                                How to write python code that can be self-updated without need to quit application?
                            
                                How to plot the outlines of specific squares within a 2D grid using pcolormesh?
                            
                                How to get coverage report from a given package using nose2
                            
                                scipy.interp2d warning and large errors off the grid
                            
                                Using multiprocessing in Python, what is the correct approach for import statements?
                            
                                Combinatorial product of regex substitutions
                            
                                django : loading fixtures with natural foreignkey fails with 'ValueError: invalid literal for int() with base 10'
                            
                                load graph data from files on button click with bokeh
                            
                                Selenium hangs when using actionchain().move then actionchain.click or mouse_up
                            
                                Why this difference between the local response norm paper equation and tensorflow implementation?
                            
                                Nohup and Python -u : it still doesn't log data in realtime
                            
                                Foreign Key to Same Table in sqlalchemy
                            
                                Embedding Bokeh plot in Django website results in blank page with no error message
                            
                                Pygame lags when two players are implemented
                            
                                InvalidCiphertextException when calling kms.decrypt with S3 metadata
                            
                                The right way to define a function in theano?
                            
                                Python - Recommended way to dynamically add methods within a class

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

pandas, how to add columns to a multiindex column DataFrame

Tags:

python

pandas

cncggvg

People also ask

2 Answers

Kartik

Luke Davis

Recent Activity

Donate For Us