This code: <pre class="prettyprint"><code>df2 = ( pd.DataFrame({ 'X' : ['X1', 'X1', 'X1', 'X1'], 'Y' : ['Y2', 'Y1', 'Y1', 'Y1'], 'Z' : ['Z3', 'Z1', 'Z1', 'Z2'] }) ) g = df2.groupby('X') pd.pivot_table(g, values='X', rows='Y', cols='Z', margins=False, aggfunc='count') </code></pre> returns the following error: <pre class="prettyprint"><code>Traceback (most recent call last): ... AttributeError: 'Index' object has no attribute 'index' </code></pre> How do I get a Pivot Table with counts of unique values of one DataFrame column for two other columns? Is there <code>aggfunc</code> for count unique? Should I be using <code>np.bincount()</code>? NB. I am aware of <code>pandas.Series.values_counts()</code> however I need a pivot table. <hr> EDIT: The output should be: <pre class="prettyprint"><code>Z Z1 Z2 Z3 Y Y1 1 1 NaN Y2 NaN NaN 1 </code></pre>

Do you mean something like this? <pre class="prettyprint"><code>>>> df2.pivot_table(values='X', index='Y', columns='Z', aggfunc=lambda x: len(x.unique())) Z Z1 Z2 Z3 Y Y1 1 1 NaN Y2 NaN NaN 1 </code></pre> Note that using <code>len</code> assumes you don't have <code>NA</code>s in your DataFrame. You can do <code>x.value_counts().count()</code> or <code>len(x.dropna().unique())</code> otherwise.

This is a good way of counting entries within <code>.pivot_table</code>: <pre class="prettyprint"><code>>>> df2.pivot_table(values='X', index=['Y','Z'], columns='X', aggfunc='count') X1 X2 Y Z Y1 Z1 1 1 Z2 1 NaN Y2 Z3 1 NaN </code></pre>

Python Pandas : pivot table with aggfunc = count unique distinct

Tags:

python

pandas

pivot-table

This code:

Click to copy

df2 = (     pd.DataFrame({         'X' : ['X1', 'X1', 'X1', 'X1'],          'Y' : ['Y2', 'Y1', 'Y1', 'Y1'],          'Z' : ['Z3', 'Z1', 'Z1', 'Z2']     }) ) g = df2.groupby('X') pd.pivot_table(g, values='X', rows='Y', cols='Z', margins=False, aggfunc='count')

returns the following error:

Click to copy

Traceback (most recent call last): ...  AttributeError: 'Index' object has no attribute 'index'

How do I get a Pivot Table with counts of unique values of one DataFrame column for two other columns?
Is there aggfunc for count unique? Should I be using np.bincount()?

NB. I am aware of pandas.Series.values_counts() however I need a pivot table.

EDIT: The output should be:

Click to copy

Z   Z1  Z2  Z3 Y              Y1   1   1 NaN Y2 NaN NaN   1

801

asked Oct 12 '12 13:10

dmi

2 Answers

Do you mean something like this?

Click to copy

>>> df2.pivot_table(values='X', index='Y', columns='Z', aggfunc=lambda x: len(x.unique()))  Z   Z1  Z2  Z3 Y              Y1   1   1 NaN Y2 NaN NaN   1

Note that using len assumes you don't have NAs in your DataFrame. You can do x.value_counts().count() or len(x.dropna().unique()) otherwise.

answered Sep 23 '22 22:09

Chang She

This is a good way of counting entries within .pivot_table:

Click to copy

>>> df2.pivot_table(values='X', index=['Y','Z'], columns='X', aggfunc='count')          X1  X2 Y   Z        Y1  Z1   1   1     Z2   1  NaN Y2  Z3   1  NaN

answered Sep 21 '22 22:09

julian peng

Related questions
                            
                                Change y range to start from 0 with matplotlib
                            
                                Python mock Patch os.environ and return value
                            
                                Matplotlib overlapping annotations / text
                            
                                How to export Keras .h5 to tensorflow .pb?
                            
                                How to print the sign + of a digit for positive numbers in Python
                            
                                How to implement server push in Flask framework?
                            
                                NameError: name 'List' is not defined
                            
                                How to join on multiple columns in Pyspark?
                            
                                Why is 'a' in ('abc') True while 'a' in ['abc'] is False?
                            
                                TextField missing in django.forms
                            
                                Can't open lib 'ODBC Driver 13 for SQL Server'? Sym linking issue?
                            
                                Docker-compose and pdb
                            
                                How to get more than 1000 objects from S3 by using list_objects_v2?
                            
                                Finding duplicate files and removing them
                            
                                How would you do the equivalent of preprocessor directives in Python?
                            
                                shuffle string in python
                            
                                TypeError: get() takes no keyword arguments
                            
                                How do I access (read, write) Google Sheets spreadsheets with Python?
                            
                                Python check if website exists
                            
                                Read from File, or STDIN

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Python Pandas : pivot table with aggfunc = count unique distinct

Tags:

python

pandas

pivot-table

dmi

People also ask

2 Answers

Chang She

julian peng

Recent Activity

Donate For Us