I have a <code>pandas.DataFrame</code> with a column called <code>name</code> containing strings. I would like to get a list of the names which occur more than once in the column. How do I do that? I tried: <pre class="prettyprint"><code>funcs_groups = funcs.groupby(funcs.name) funcs_groups[(funcs_groups.count().name>1)] </code></pre> But it doesn't filter out the singleton names.

A one liner can be: <pre class="prettyprint"><code>x.set_index('name').index.get_duplicates() </code></pre> the index contains a method for finding duplicates, columns does not seem to have a similar method..

How to find duplicate names using pandas?

Tags:

I have a pandas.DataFrame with a column called name containing strings. I would like to get a list of the names which occur more than once in the column. How do I do that?

I tried:

funcs_groups = funcs.groupby(funcs.name) funcs_groups[(funcs_groups.count().name>1)]

But it doesn't filter out the singleton names.

513

asked Mar 06 '13 12:03

Yariv

2 Answers

If you want to find the rows with duplicated name (except the first time we see that), you can try this

In [16]: import pandas as pd In [17]: p1 = {'name': 'willy', 'age': 10} In [18]: p2 = {'name': 'willy', 'age': 11} In [19]: p3 = {'name': 'zoe', 'age': 10} In [20]: df = pd.DataFrame([p1, p2, p3])  In [21]: df Out[21]:     age   name 0   10  willy 1   11  willy 2   10    zoe  In [22]: df.duplicated('name') Out[22]:  0    False 1     True 2    False

answered Oct 21 '22 11:10

waitingkuo

A one liner can be:

x.set_index('name').index.get_duplicates()

the index contains a method for finding duplicates, columns does not seem to have a similar method..

answered Oct 21 '22 13:10

idoda

Related questions
                            
                                All column names in my view are underlined in red in SSMS
                            
                                extract all hyperlinks ( from external website ) using node.js and request
                            
                                "Close Others" command shortcut in Sublime Text 2
                            
                                jQuery UI accordion: open multiple panels at once
                            
                                model.$modelValue is NaN in directive
                            
                                How to use resource arrays using xml in Android?
                            
                                What exactly are protocols and delegates and how are they used in IOS?
                            
                                Gmail Sending Limits
                            
                                Is there a way to reuse the previous comment on a git commit?
                            
                                SELECT COUNT(DISTINCT... ) error on multiple columns?
                            
                                Sending post data from angularjs to django as JSON and not as raw content
                            
                                Angular.js. How to count ng-repeat iterations which satisfy the custom filter

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With