I'm working in pandas doing pivot tables and when doing the groupby (to count distinct observations) <code>aggfunc={"person":{lambda x: len(x.unique())}}</code> gives me the following error: <code>'DataFrame' object has no attribute 'unique'</code> any ideas how to fix it?

One very easy solution to get the unique combinations of >1 columns from a DF is the following: <pre class="prettyprint"><code>unique_A_B_combos = df[['A', 'B']].value_counts().index.values </code></pre>

Rather than removing duplicates during the pivot table process, use the <code>df.drop_duplicates()</code> function to selectively drop duplicates. For example if you are pivoting using these <code>index='c0'</code> and <code>columns='c1'</code> then this simple step yields the correct counts. In this example the 5th row is a duplicate of the 4th (ignoring the non-pivoted <code>c2</code> column <pre class="prettyprint"><code>import pandas as pd data = {'c0':[0,1,0,1,1], 'c1':[0,0,1,1,1], 'person':[0,0,1,1,1], 'c_other':[1,2,3,4,5]} df = pd.DataFrame(data) df2 = df.drop_duplicates(subset=['c0','c1','person']) pd.pivot_table(df2, index='c0',columns='c1',values='person', aggfunc='count') </code></pre> This correctly outputs <pre class="prettyprint"><code>c1 0 1 c0 0 1 1 1 1 1 </code></pre>

<pre class="prettyprint"><code>df[['col1', 'col2']].nunique() </code></pre> Try this instead of separate function

Pandas 'DataFrame' object has no attribute 'unique'

Tags:

python

pandas

pivot-table

I'm working in pandas doing pivot tables and when doing the groupby (to count distinct observations) aggfunc={"person":{lambda x: len(x.unique())}} gives me the following error: 'DataFrame' object has no attribute 'unique' any ideas how to fix it?

697

asked Mar 24 '15 23:03

jwzinserl

4 Answers

One very easy solution to get the unique combinations of >1 columns from a DF is the following:

unique_A_B_combos = df[['A', 'B']].value_counts().index.values

111

answered Sep 20 '22 15:09

emilaz

DataFrames do not have that method; columns in DataFrames do:

df['A'].unique()

Or, to get the names with the number of observations (using the DataFrame given by closedloop):

>>> df.groupby('person').person.count()
Out[80]: 
person
0         2
1         3
Name: person, dtype: int64

answered Sep 17 '22 15:09

Alexander

Rather than removing duplicates during the pivot table process, use the df.drop_duplicates() function to selectively drop duplicates.

For example if you are pivoting using these index='c0' and columns='c1' then this simple step yields the correct counts.

In this example the 5th row is a duplicate of the 4th (ignoring the non-pivoted c2 column

import pandas as pd
data = {'c0':[0,1,0,1,1], 'c1':[0,0,1,1,1], 'person':[0,0,1,1,1], 'c_other':[1,2,3,4,5]}
df = pd.DataFrame(data)
df2 = df.drop_duplicates(subset=['c0','c1','person'])
pd.pivot_table(df2, index='c0',columns='c1',values='person', aggfunc='count')

This correctly outputs

answered Sep 19 '22 15:09

closedloop

df[['col1', 'col2']].nunique()

Try this instead of separate function

answered Sep 16 '22 15:09

Амир Джанибеков

Related questions
                            
                                Find next lower item in a sorted list
                            
                                How to remove whitespace in BeautifulSoup
                            
                                How would I use django.forms to prepopulate a choice field with rows from a model?
                            
                                How to use the option skip-name-resolve when using MySQLdb for Python?
                            
                                Don't split double-quoted words with Python string split()?
                            
                                How to remove all items from many-to-many collection in SqlAlchemy?
                            
                                Why is python list comprehension sometimes frowned upon?
                            
                                In python, how to convert a hex ascii string to raw internal binary string?
                            
                                How to create Dict from array in python
                            
                                Python source code for built-in "in" operator
                            
                                all python windows service can not start{error 1053}
                            
                                Django: How to get the root path of a site in template?
                            
                                Setuptools platform specific dependencies
                            
                                Why does list.append() return None? [duplicate]
                            
                                Is possible to mapping view with class using mapper in SqlAlchemy?
                            
                                Python Change Master/Application Volume
                            
                                How to get the numbers of data rows from sqlite table in python
                            
                                how to retrieve the selected row of a QTableView?
                            
                                How to format cell with datetime object of the form 'yyyy-mm-dd hh:mm:ss' in Excel using openpyxl
                            
                                python reading in multi-column tsv file with row numbers

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With