I want to pivot a pandas dataframe without aggregation, and instead of presenting the pivot index column vertically I want to present it horizontally. I tried with <code>pd.pivot_table</code> but I'm not getting exactly what I wanted. <pre class="prettyprint"><code>data = {'year': [2011, 2011, 2012, 2013, 2013], 'A': [10, 21, 20, 10, 39], 'B': [12, 45, 19, 10, 39]} df = pd.DataFrame(data) print df A B year 0 10 12 2011 1 21 45 2011 2 20 19 2012 3 10 10 2013 4 39 39 2013 </code></pre> But I want to have: <pre class="prettyprint"><code>year 2011 2012 2013 cols A B A B A B 0 10 12 20 19 10 10 1 21 45 NaN NaN 39 39 </code></pre>

<code>groupby('year')</code> so I can <code>reset_index</code> to get index values of <code>0</code> and <code>1</code>. Then do a bunch of clean up. <pre class="prettyprint"><code>df.groupby('year')['A', 'B'] \ .apply(lambda df: df.reset_index(drop=True)) \ .unstack(0).swaplevel(0, 1, 1).sort_index(1) </code></pre> <img src="https://i.stack.imgur.com/Fmol8.png" alt="enter image description here">

Pandas pivot table arrangement no aggregation

Tags:

python

pandas

dataframe

pivot

I want to pivot a pandas dataframe without aggregation, and instead of presenting the pivot index column vertically I want to present it horizontally. I tried with pd.pivot_table but I'm not getting exactly what I wanted.

data = {'year': [2011, 2011, 2012, 2013, 2013],
        'A': [10, 21, 20, 10, 39],
        'B': [12, 45, 19, 10, 39]}

df = pd.DataFrame(data)
print df
    A   B  year
0  10  12  2011
1  21  45  2011
2  20  19  2012
3  10  10  2013
4  39  39  2013

But I want to have:

year      2011     2012      2013
cols     A    B   A    B    A    B
0       10    12  20   19   10   10
1       21    45  NaN  NaN  39   39

207

asked Jul 27 '16 07:07

DougKruger

2 Answers

groupby('year') so I can reset_index to get index values of 0 and 1. Then do a bunch of clean up.

df.groupby('year')['A', 'B'] \
    .apply(lambda df: df.reset_index(drop=True)) \
    .unstack(0).swaplevel(0, 1, 1).sort_index(1)

enter image description here

141

answered Oct 24 '22 16:10

piRSquared

You can first create column for new index by cumcount, then stack with unstack:

df['g'] = df.groupby('year')['year'].cumcount()
df1 = df.set_index(['g','year']).stack().unstack([1,2])
print (df1)

year  2011        2012        2013      
         A     B     A     B     A     B
g                                       
0     10.0  12.0  20.0  19.0  10.0  10.0
1     21.0  45.0   NaN   NaN  39.0  39.0

If need set columns names use rename_axis (new in pandas 0.18.0):

df['g'] = df.groupby('year')['year'].cumcount()
df1 = df.set_index(['g','year'])
        .stack()
        .unstack([1,2])
        .rename_axis(None)
        .rename_axis(('year','cols'), axis=1)
print (df1)
year  2011        2012        2013      
cols     A     B     A     B     A     B
0     10.0  12.0  20.0  19.0  10.0  10.0
1     21.0  45.0   NaN   NaN  39.0  39.0

Another solution with pivot, but you need swap first and second level of Multiindex in columns by swaplevel and then sort it by sort_index:

df['g'] = df.groupby('year')['year'].cumcount()
df1 = df.pivot(index='g', columns='year')
df1 = df1.swaplevel(0,1, axis=1).sort_index(axis=1)
print (df1)
year  2011        2012        2013      
         A     B     A     B     A     B
g                                       
0     10.0  12.0  20.0  19.0  10.0  10.0
1     21.0  45.0   NaN   NaN  39.0  39.0
print (df1)

year  2011        2012        2013      
         A     B     A     B     A     B
g                                       
0     10.0  12.0  20.0  19.0  10.0  10.0
1     21.0  45.0   NaN   NaN  39.0  39.0

answered Oct 24 '22 14:10

jezrael

Related questions
                            
                                Finding head of a noun phrase in NLTK and stanford parse according to the rules of finding head of a NP
                            
                                pyspark: TypeError: IntegerType can not accept object in type <type 'unicode'>
                            
                                Assigning to vs. from a slice
                            
                                Pass a parameter to Ansible's dynamic inventory
                            
                                How to continue a frame execution from last attempted instruction after handling an exception?
                            
                                unable to open jupyter(ipython) notebook on browser
                            
                                Python Traceback (most recent call last) [duplicate]
                            
                                matplotlib legend: How to specify font weight?
                            
                                How to read data into Tensorflow?
                            
                                PyCharm - how to suspend all threads
                            
                                Display cluster labels for a scipy dendrogram
                            
                                Static files not found with webpack and django
                            
                                Processing time gets longer and longer after each iteration (TensorFlow)
                            
                                MQTT Paho Python reliable reconnect
                            
                                What is the difference between these two ways of adding Neural Network layers in Keras?
                            
                                Thread starvation while locking in a loop in Python
                            
                                How to avoid brackets in SQL around Django custom database function call?
                            
                                Using py.test with compiled library code
                            
                                controlling the x ticks date values
                            
                                Python MD5 Cracker "TypeError: object supporting the buffer API required"

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With