Pivot Tables or Group By for Pandas?

Tags:

I have a hopefully straightforward question that has been giving me a lot of difficulty for the last 3 hours. It should be easy.

Here's the challenge.

I have a pandas dataframe:

+--------------------------+ |     Col 'X'    Col 'Y'  | +--------------------------+ |     class 1      cat 1  | |     class 2      cat 1  | |     class 3      cat 2  | |     class 2      cat 3  | +--------------------------+

What I am looking to transform the dataframe into:

+------------------------------------------+ |                  cat 1    cat 2    cat 3 | +------------------------------------------+ |     class 1         1        0        0  | |     class 2         1        0        1  | |     class 3         0        1        0  | +------------------------------------------+

Where the values are value counts. Anybody have any insight? Thanks!

518

asked Jun 06 '15 05:06

SteelyDanish

1 Answers

Here are couple of ways to reshape your data df

In [27]: df Out[27]:      Col X  Col Y 0  class 1  cat 1 1  class 2  cat 1 2  class 3  cat 2 3  class 2  cat 3

1) Using pd.crosstab()

In [28]: pd.crosstab(df['Col X'], df['Col Y']) Out[28]: Col Y    cat 1  cat 2  cat 3 Col X class 1      1      0      0 class 2      1      0      1 class 3      0      1      0

2) Or, use groupby on 'Col X','Col Y' with unstack over Col Y, then fill NaNs with zeros.

In [29]: df.groupby(['Col X','Col Y']).size().unstack('Col Y', fill_value=0) Out[29]: Col Y    cat 1  cat 2  cat 3 Col X class 1      1      0      0 class 2      1      0      1 class 3      0      1      0

3) Or, use pd.pivot_table() with index=Col X, columns=Col Y

In [30]: pd.pivot_table(df, index=['Col X'], columns=['Col Y'], aggfunc=len, fill_value=0) Out[30]: Col Y    cat 1  cat 2  cat 3 Col X class 1      1      0      0 class 2      1      0      1 class 3      0      1      0

4) Or, use set_index with unstack

In [492]: df.assign(v=1).set_index(['Col X', 'Col Y'])['v'].unstack(fill_value=0) Out[492]: Col Y    cat 1  cat 2  cat 3 Col X class 1      1      0      0 class 2      1      0      1 class 3      0      1      0

175

answered Sep 18 '22 17:09

Zero

Related questions
                            
                                In Django, how do I select 100 random records from the database? [duplicate]
                            
                                How can I infinitely loop an iterator in Python, via a generator or other?
                            
                                Breakpoint-induced interactive debugging of Python with IPython
                            
                                How can I save a list of dictionaries to a file?
                            
                                Error on amazon SES: SendEmail operation: Illegal addres
                            
                                How to use select_for_update to 'get' a Query in Django?
                            
                                Django - Cannot create migrations for ImageField with dynamic upload_to value
                            
                                Django SMTPAuthenticationError
                            
                                Matplotlib, horizontal bar chart (barh) is upside-down
                            
                                pandas datetime to unix timestamp seconds
                            
                                How do I get the client IP of a Tornado request?
                            
                                What is a "code object" mentioned in this TypeError message?
                            
                                Accessing Python dict values with the key start characters
                            
                                How do I use multiple conditions with pyspark.sql.functions.when()?
                            
                                Replace empty strings with None/null values in DataFrame
                            
                                virtualenv(python3.4), pip install mysqlclient error
                            
                                How to convert one-hot encodings into integers?
                            
                                What do I need to read Microsoft Access databases using Python?
                            
                                Connecting to a remote IPython instance
                            
                                How to pass a variable between Flask pages?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Pivot Tables or Group By for Pandas?

Tags:

python

pandas

count

group-by

pivot-table

SteelyDanish

People also ask

1 Answers

Zero

Recent Activity

Donate For Us