Split/Expand Dataframe based on column values

Tags:

python

pandas

I have a DataFrame like the below one, with identifiers as a column on top of an existing dateindex.

pd.DataFrame(index = [pd.to_datetime('2021-01-01'), pd.to_datetime('2021-01-01'),pd.to_datetime('2021-01-02'),pd.to_datetime('2021-01-02'), pd.to_datetime('2021-01-03'),pd.to_datetime('2021-01-03')], columns=['id','A', 'B'], data=[['foo',1,5],['bar',8,12],['foo',7,1], ['bar',5,1], ['foo',4,3],['bar',7,1]])

Out[6]: 
             id  A   B
2021-01-01  foo  1   5
2021-01-01  bar  8  12
2021-01-02  foo  7   1
2021-01-02  bar  5   1
2021-01-03  foo  4   3
2021-01-03  bar  7   1

My goal is to create a new sub-dataframes for each of the columns (A and B) except id, with dateIndex as single Index, and id (foo, bar) as column names. The expected output is shown below:

Click to copy

A
Out[9]: 
            foo  bar
2021-01-01    1    8
2021-01-02    7    5
2021-01-03    4    7

B
Out[11]: 
            foo  bar
2021-01-01    5   12
2021-01-02    1    1
2021-01-03    3    1

598

asked Mar 17 '21 15:03

ylnor

2 Answers

Click to copy

A, B = map(df.set_index('id', append=True).unstack().get, ['A', 'B'])

print(A)

id          bar  foo
2021-01-01    8    1
2021-01-02    5    7
2021-01-03    7    4

print(B)

id          bar  foo
2021-01-01   12    5
2021-01-02    1    1
2021-01-03    1    3

119

answered Oct 11 '22 22:10

piRSquared

This just simply:

Click to copy

out = df.set_index('id',append=True).unstack('id')
# if you have columns other than `A`,`B`:
# out = df.set_index('id',append=True)[['A','B']].unstack('id')

then you can do

Click to copy

out['A']

which gives:

Click to copy

id          bar  foo
2021-01-01    8    1
2021-01-02    5    7
2021-01-03    7    4

and similarly for out['B']. I found this is much easier and less error prone than hard-coding the variables to A,B.

answered Oct 11 '22 21:10

Quang Hoang

Related questions
                            
                                Is there an easy way to make unicode work in python?
                            
                                checking if a string is in alphabetical order in python
                            
                                Confused with getattribute and setattribute in python
                            
                                gcc error when I'm trying to install readline-6.2
                            
                                Repeatedly failing to install scrapy and lxml
                            
                                Python 2.X adding single quotes around a string
                            
                                Draw a map of a specific country with cartopy?
                            
                                bug of autocorrelation plot in matplotlib‘s plt.acorr?
                            
                                Display MNIST image using matplotlib [duplicate]
                            
                                How do I create a for-loop where the variable's value is equal to the stop value of range when the loop runs to the end in Python?
                            
                                Get feature importance from GridSearchCV
                            
                                Trouble with df.join(): ValueError: You are trying to merge on object and int64 columns
                            
                                Display GPU Usage While Code is Running in Colab
                            
                                How do I limit the number of active threads in python?
                            
                                Sql Alchemy What is wrong?
                            
                                How do I Filter the PyQt QCombobox Items based on the text input?
                            
                                Exclude field from values() or values_list()
                            
                                Split on either a space or a hyphen?
                            
                                Saving plot from seaborn
                            
                                ValueError: No axis named node2 for object type <class 'pandas.core.frame.DataFrame'>

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Split/Expand Dataframe based on column values

Tags:

python

pandas

ylnor

People also ask

2 Answers

piRSquared

Quang Hoang

Recent Activity

Donate For Us