If I want to drop duplicated index in a dataframe the following doesn't work for obvious reasons: <pre class="prettyprint"><code>myDF.drop_duplicates(cols=index) </code></pre> and <pre class="prettyprint"><code>myDF.drop_duplicates(cols='index') </code></pre> looks for a column named 'index' If I want to drop an index I have to do: <pre class="prettyprint"><code>myDF['index'] = myDF.index myDF= myDF.drop_duplicates(cols='index') myDF.set_index = myDF['index'] myDF= myDF.drop('index', axis =1) </code></pre> Is there a more efficient way?

Simply: <code>DF.groupby(DF.index).first()</code>

Fastest Way to Drop Duplicated Index in a Pandas DataFrame [duplicate]

Tags:

python

pandas

duplicate-removal

If I want to drop duplicated index in a dataframe the following doesn't work for obvious reasons:

myDF.drop_duplicates(cols=index)

and

myDF.drop_duplicates(cols='index')

looks for a column named 'index'

If I want to drop an index I have to do:

myDF['index'] = myDF.index myDF= myDF.drop_duplicates(cols='index') myDF.set_index = myDF['index'] myDF= myDF.drop('index', axis =1)

Is there a more efficient way?

344

asked Apr 07 '14 16:04

RukTech

2 Answers

Simply: DF.groupby(DF.index).first()

161

answered Sep 19 '22 16:09

CT Zhu

The 'duplicated' method works for dataframes and for series. Just select on those rows which aren't marked as having a duplicate index:

df[~df.index.duplicated()]

answered Sep 20 '22 16:09

danielstn

Related questions
                            
                                sort csv by column
                            
                                usleep in Python
                            
                                networkx add_node with specific position
                            
                                How to install SimpleJson Package for Python
                            
                                How do I subtract two dates in Django/Python?
                            
                                How do you set a conditional in python based on datatypes?
                            
                                Writing UTF-8 String to MySQL with Python
                            
                                Bottle framework and OOP, using method instead of function
                            
                                Python - Download Images from google Image search?
                            
                                Running a Python script outside of Django
                            
                                differences between "d = dict()" and "d = {}"
                            
                                Possible to append multiple lists at once? (Python)
                            
                                Convert percent string to float in pandas read_csv
                            
                                In Python, is it better to use list comprehensions or for-each loops?
                            
                                Find the root of the git repository where the file lives
                            
                                python requests module and connection reuse
                            
                                Count and Sort with Pandas
                            
                                install cx_oracle for python
                            
                                The pythonic way to generate pairs
                            
                                matplotlib label doesn't work

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With