how to split and concat pandas dataframe

Tags:

I have a datetime index DataFrame of pandas like this:

                         A   B  C  A_1  B_1
2017-07-01 00:00:00  1  34  e    9    0
2017-07-01 00:05:00  2  34  e   92    2
2017-07-01 00:10:00  3  34  e   23    3
2017-07-01 00:15:00  4  34  e    2    5
2017-07-01 00:20:00  5  34  e    4    3

I want to split it and concat with axis=0, the result like this

Click to copy

                     C  REQ  _1
2017-07-01 00:00:00  e  1    9
2017-07-01 00:05:00  e  2   92
2017-07-01 00:10:00  e  3   23
2017-07-01 00:15:00  e  4    2
2017-07-01 00:20:00  e  5    4
2017-07-01 00:00:00  e  34    0
2017-07-01 00:05:00  e  34    2
2017-07-01 00:10:00  e  34    3
2017-07-01 00:15:00  e  34    5
2017-07-01 00:20:00  e  34    3

So, I have to do it like this: First, select df[['C','A','A_1']], df[['C','B', 'B_1']]. Then map the columns, and concat the result.

It's complicated，is there any built-in method in pandas to do this? Or any faster method? Because I have thousands of columns to concat with to get the final result.

390

asked Aug 18 '17 03:08

J.Dan

1 Answers

EDIT

After doing some research lreshape is not documented well and pd.wide_to_long, which is in the current API, does the same as lreshape with more flexibility.

https://github.com/pandas-dev/pandas/issues/2567

https://github.com/pandas-dev/pandas/issues/15003

Let's use the API documented method:

Click to copy

dict1 = {'A':'REQ_A1','B':'REQ_B1','A_1':'Value_A1','B_1':'Value_B1'}

df2 = df1.rename(columns=dict1)

(pd.wide_to_long(df2.reset_index(),['REQ','Value'],i='index',j='C',sep='_',suffix='.')
  .rename_axis(['index','dropme'])
  .reset_index()
  .drop('dropme', axis=1)
  .rename(columns={'Value':'_1'}))

Output:

Click to copy

                 index  C  REQ  _1
0  2017-07-01 00:00:00  e    1   9
1  2017-07-01 00:05:00  e    2  92
2  2017-07-01 00:10:00  e    3  23
3  2017-07-01 00:15:00  e    4   2
4  2017-07-01 00:20:00  e    5   4
5  2017-07-01 00:00:00  e   34   0
6  2017-07-01 00:05:00  e   34   2
7  2017-07-01 00:10:00  e   34   3
8  2017-07-01 00:15:00  e   34   5
9  2017-07-01 00:20:00  e   34   3

Use pd.lreshape:

Click to copy

d = {'REQ': ['A', 'B'], '_1': ['A_1', 'B_1']}
df_out = (pd.lreshape(df.reset_index(), d).set_index('index'))

Output:

Click to copy

                     C  REQ  _1
index                          
2017-07-01 00:00:00  e    1   9
2017-07-01 00:05:00  e    2  92
2017-07-01 00:10:00  e    3  23
2017-07-01 00:15:00  e    4   2
2017-07-01 00:20:00  e    5   4
2017-07-01 00:00:00  e   34   0
2017-07-01 00:05:00  e   34   2
2017-07-01 00:10:00  e   34   3
2017-07-01 00:15:00  e   34   5
2017-07-01 00:20:00  e   34   3

answered Oct 07 '22 01:10

Scott Boston

Related questions
                            
                                Python cx_Oracle Client Library Cannot Be Loaded
                            
                                how to enable/disable editing in TextInput using kivy in python
                            
                                SQLAlchemy update record with case clause and enum
                            
                                pyspark returns a no module named error for a custom module
                            
                                Robustly crop rotated bounding box on photos
                            
                                How PyQt5 keyPressEvent works
                            
                                Numpy match indexing dimensions
                            
                                C++ macro in scope of if statement not compiling
                            
                                Getting a scalar by integer location and column label (mixed indexing)
                            
                                Pandas: expanding DataFrame by number of observations in column
                            
                                How does the built-in function iter() convert a Python list to an iterator?
                            
                                Python JWT library PyJWT trouble signing with HS256 - HMAC using SHA-256 hash algorithm
                            
                                passing pandas dataframe into a python subprocess.Popen as an argument
                            
                                keras reshape input image to work with CNN
                            
                                Django .only() causing maximum recursion depth exceeded error?
                            
                                Python - Sqlite - OperationalError: near "s": syntax error [duplicate]
                            
                                Removing 'overlapping' dates from pandas dataframe
                            
                                How to calculate 3D distance (including altitude) between two points in GeoDjango
                            
                                How to allow non-admin users to authenticate via OAuth2.0 for tenants where users are not allowed to consent apps on their behalf?
                            
                                What's wrong with this implementation of quicksort?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

how to split and concat pandas dataframe

Tags:

python

pandas

time-series

J.Dan

People also ask

1 Answers

EDIT

Scott Boston

Recent Activity

Donate For Us