Multi-index pivoting in Pandas

Tags:

Consider the following dataframe:

         item_id  hour    when        date      quantity
110   0YrKNYeEoa     1  before  2015-01-26        247286
111   0UMNiXI7op     1  before  2015-01-26        602001
112   0QBtIMN3AH     1  before  2015-01-26        981630
113   0GuKXLiWyV     1  after   2015-01-26       2203913
114   0SoFbjvXTs     1  after   2015-01-26        660183
115   0UkT257SXj     1  before  2015-01-26        689332
116   0RPjXnkiGx     1  after   2015-01-26        283090
117   0FhJ9RGsLT     1  before  2015-01-26       2024256
118   0FhGJ4MFlg     1  before  2015-01-26         74524
119   0FQhHZRXhB     1  before  2015-01-26             0
120   0FsSdJQlTB     1  before  2015-01-26             0
121   0FrrAzTFHE     1  before  2015-01-26             0
122   0FfkgBdMHi     1  before  2015-01-26             0
123   0FOnJNexRn     1  before  2015-01-26             0
124   0FcWhIdBds     1  before  2015-01-26             0
125   0F2lr0cL9t     1  before  2015-01-26       1787659

I would like to pivot it to get the table arranged as:

Index                     before           after
(item_id, hour, date)   quantityB      quantityA

When I try with:

df.pivot(index=['item_id', 'hour', 'date'], columns='when', values='quanty')

I get:

ValueError: Wrong number of items passed 8143, placement implies 3

Why?

233

asked Jan 29 '15 22:01

Amelio Vazquez-Reina

1 Answers

If I understand what you are asking I think what you want is pandas.pivot_table(...) which you can use like so:

table = pd.pivot_table(df, index=['item_id', 'hour', 'date'], columns='when', values='quantity')

which with a sample data frame of

    item_id  hour  when      date     quantity
0       a     1  before  2015-01-26        25
1       b     1  before  2015-01-26        14
2       a     1   after  2015-01-26         4
3       d     1  before  2015-01-26        43
4       b     1   after  2015-01-26        30
5       d     1   after  2015-01-26        12

produces

when                     after  before
item_id hour date                     
a       1    2015-01-26      4      25
b       1    2015-01-26     30      14
d       1    2015-01-26     12      43

151

answered Sep 25 '22 12:09

alacy

Related questions
                            
                                Performance of row vs column operations in NumPy
                            
                                2.7 CSV module wants unicode, but doesn't want unicode
                            
                                Auto-creating related objects on model creation in Django
                            
                                How to use unicode characters with PIL?
                            
                                Kivy to Apk in Windows
                            
                                How do I concatenate many objects into one object using inheritance in python? (during runtime)
                            
                                How to disable Flask-Cache caching
                            
                                Python implementation of the laplacian of gaussian edge detection
                            
                                Python multiprocessing - watch a process and restart it when fails
                            
                                Choose at random from combinations
                            
                                Python Non negative Matrix Factorization that handles both zeros and missing data?
                            
                                What does PuLP LpStatus=Undefined actually mean?
                            
                                Using custom methods in filter with django-rest-framework
                            
                                Generating low discrepancy quasi-random sequences in python/numpy/scipy?
                            
                                How to test coverage properly with Django + Nose
                            
                                Python: strftime() UTC Offset Not working as Expected in Windows
                            
                                Installing Pylab/Matplotlib
                            
                                How does one print a Unicode character code in Python?
                            
                                how to directly import now() from datetime.datetime submodule
                            
                                SAML 2.0 Service Provider in Python

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Multi-index pivoting in Pandas

Tags:

python

pandas

pivot

Amelio Vazquez-Reina

People also ask

1 Answers

alacy

Recent Activity

Donate For Us