Is there a way to split a pandas data frame based on the column name? As an example consider the data frame has the following columns <code>df = ['A_x', 'B_x', 'C_x', 'A_y', 'B_y', 'C_y']</code> and I want to create two data frames <code>X = ['A_x', 'B_x', 'C_x']</code>and <code>Y = ['A_y', 'B_y', 'C_y']</code>. I know there is a possibility to do this: <pre class="prettyprint"><code>d = {'A': df.A_x, 'B': df.B_x, 'C': df.B_x} X = pd.DataFrame (data=d) </code></pre> but this would not be ideal as in my case I have 2200 columns in <code>df</code>. Is there a more elegant solution?

You could use <code>df.filter(regex=...)</code>: <pre class="prettyprint"><code>import numpy as np import pandas as pd df = pd.DataFrame(np.random.randn(2, 10), columns='Time A_x A_y A_z B_x B_y B_z C_x C_y C-Z'.split()) X = df.filter(regex='_x') Y = df.filter(regex='_y') </code></pre> yields <pre class="prettyprint"><code>In [15]: X Out[15]: A_x B_x C_x 0 -0.706589 1.031368 -0.950931 1 0.727826 0.879408 -0.049865 In [16]: Y Out[16]: A_y B_y C_y 0 -0.663647 0.635540 -0.532605 1 0.326718 0.189333 -0.803648 </code></pre>

Splitting pandas data frame based on column name

Tags:

python

pandas

Is there a way to split a pandas data frame based on the column name? As an example consider the data frame has the following columns df = ['A_x', 'B_x', 'C_x', 'A_y', 'B_y', 'C_y'] and I want to create two data frames X = ['A_x', 'B_x', 'C_x']and Y = ['A_y', 'B_y', 'C_y'].

I know there is a possibility to do this:

d = {'A': df.A_x, 'B': df.B_x, 'C': df.B_x}
X = pd.DataFrame (data=d)

but this would not be ideal as in my case I have 2200 columns in df. Is there a more elegant solution?

274

asked Sep 23 '15 12:09

Segmented

1 Answers

You could use df.filter(regex=...):

import numpy as np
import pandas as pd
df = pd.DataFrame(np.random.randn(2, 10),
                  columns='Time A_x A_y A_z B_x B_y B_z C_x C_y C-Z'.split())
X = df.filter(regex='_x')
Y = df.filter(regex='_y')

yields

In [15]: X
Out[15]: 
        A_x       B_x       C_x
0 -0.706589  1.031368 -0.950931
1  0.727826  0.879408 -0.049865

In [16]: Y
Out[16]: 
        A_y       B_y       C_y
0 -0.663647  0.635540 -0.532605
1  0.326718  0.189333 -0.803648

173

answered Oct 19 '22 01:10

unutbu

Related questions
                            
                                Is it possible to import class method without instantiating class?
                            
                                artifactory 404 artifact not found
                            
                                ValueError while trying to save a pixmap as a png file
                            
                                Writing an ASCII string as binary in python
                            
                                How to get the intersection of two dataframes?
                            
                                What is python equivalent of C#'s system.datetime.Ticks()? [closed]
                            
                                How is data from one Behave step passed to a later step?
                            
                                How to find the points of intersection of a line and multiple curves in Python?
                            
                                Python: weird "NameError: name ... is not defined" in an 'exec' environment
                            
                                Why are slice objects not hashable in python
                            
                                How to get image from video using opencv python
                            
                                pandas convert index values to lowercase
                            
                                Get last date in each month of a time series pandas
                            
                                python map function (+ lambda) involving conditionals (if)
                            
                                BeautifulSoup: RuntimeError: maximum recursion depth exceeded
                            
                                How to convert Object with Properties to JSON without "_" in Python 3?
                            
                                How to edit the style of a heading in Treeview (Python ttk)
                            
                                Error while loading Word2Vec model in gensim
                            
                                Django: How to set EDT timezone in settings for Florida
                            
                                Location of stored offline data for cartopy

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With