Concatenate Two DataFrames With Hierarchical Columns

Tags:

python

pandas

I would like to merge two DataFrames while creating a multilevel column naming scheme denoting which dataframe the rows came from. For example:

In [98]: A=pd.DataFrame(np.arange(9.).reshape(3,3),columns=list('abc'))
In [99]: A
Out[99]: 
   a  b  c
0  0  1  2
1  3  4  5
2  6  7  8

In [100]: B=A.copy()

If I use pd.merge(), then I get

In [104]: pd.merge(A,B,left_index=True,right_index=True)
Out[104]: 
   a_x  b_x  c_x  a_y  b_y  c_y
0    0    1    2    0    1    2
1    3    4    5    3    4    5
2    6    7    8    6    7    8

Which is what I expect with that statement, what I would like (but I don't know how to get!) is:

In [104]: <<one or more statements>>
Out[104]: 
     A              B
     a    b    c    a    b    c
0    0    1    2    0    1    2
1    3    4    5    3    4    5
2    6    7    8    6    7    8

Can this be done without changing the original pd.DataFrame calls? I am reading the data in the dataframes in from .csv files and that might be my problem.

746

asked Sep 23 '13 17:09

YourEconProf

1 Answers

first case can be ordered arbitrarily among A,B (not the columns, just the order A or B) 2nd should preserve ordering

IMHO this is pandonic!

In [5]: concat(dict(A = A, B = B),axis=1)
Out[5]: 
   A        B      
   a  b  c  a  b  c
0  0  1  2  0  1  2
1  3  4  5  3  4  5
2  6  7  8  6  7  8

In [6]: concat([ A, B ], keys=['A','B'],axis=1)
Out[6]: 
   A        B      
   a  b  c  a  b  c
0  0  1  2  0  1  2
1  3  4  5  3  4  5
2  6  7  8  6  7  8

107

answered Sep 24 '22 06:09

Jeff

Related questions
                            
                                pythonic way to delete elements from a numpy array [duplicate]
                            
                                Reading piano notes on Python
                            
                                How to write inline latex code in IPython notebook
                            
                                Synchronous/Asynchronous behaviour of python Pipes
                            
                                equivalent of raw_input in Ipython notebook
                            
                                Legend using PathCollections in matplotlib
                            
                                neural networks regression using pybrain
                            
                                How To: Python Pandas get current stock data
                            
                                Why is matplotlib plot produced from ipython notebook slightly different from terminal version?
                            
                                Error when trying to apply log method to pandas data frame column in Python
                            
                                Property method without class [duplicate]
                            
                                calculate 95 percentile of the list values in python [duplicate]
                            
                                Flask-SQLALchemy: No such table
                            
                                executing a while loop between defined time
                            
                                Which python SOAP libraries are still maintained?
                            
                                Python grequests with custom header
                            
                                Python regex slow when whitespace in string
                            
                                XMLHttpRequest multipart/form-data: Invalid boundary in multipart
                            
                                Installing newest Python on openSUSE
                            
                                Asynchronous file downloads in Python

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With