Pandas: unique dataframe

Tags:

I have a DataFrame that has duplicated rows. I'd like to get a DataFrame with a unique index and no duplicates. It's ok to discard the duplicated values. Is this possible? Would it be a done by groupby?

768

asked Sep 07 '12 17:09

Adam Greenhall

2 Answers

Click to copy

In [29]: df.drop_duplicates()
Out[29]: 
   b  c
1  2  3
3  4  0
7  5  9

answered Oct 05 '22 20:10

Wouter Overmeire

Figured out one way to do it by reading the split-apply-combine documentation examples.

Click to copy

df = pandas.DataFrame({'b':[2,2,4,5], 'c': [3,3,0,9]}, index=[1,1,3,7])
df_unique = df.groupby(level=0).first()

df
   b  c
1  2  3
1  2  3
3  4  0
7  5  9

df_unique
   b  c
1  2  3
3  4  0
7  5  9

answered Oct 05 '22 20:10

Adam Greenhall

Related questions
                            
                                Difference between "findAll" and "find_all" in BeautifulSoup
                            
                                Python/PIL Resize all images in a folder
                            
                                Filter out rows based on list of strings in Pandas
                            
                                Add Multiple Columns to Pandas Dataframe from Function
                            
                                How can I remove all non-numeric characters from all the values in a particular column in pandas dataframe?
                            
                                How do I to flush redis db from python redis?
                            
                                How to check if folder is empty with Python?
                            
                                How to get the value of a Django Model Field object
                            
                                Command-line options to IPython *scripts*?
                            
                                How to convert nested list of lists into a list of tuples in python 3.3?
                            
                                How does django know which migrations have been run?
                            
                                How can I get terminal output in python? [duplicate]
                            
                                Setting the fmt option in numpy.savetxt
                            
                                Selenium give file name when downloading
                            
                                How can django sql queries use case insensitive and contains at the same time?
                            
                                Alert boxes in Python?
                            
                                Is it possible to keep the column order using csv.DictReader?
                            
                                SqlAlchemy: getting the id of the last record inserted
                            
                                Flask debug=True does not work when going through uWSGI
                            
                                Does dictionary's clear() method delete all the item related objects from memory?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Pandas: unique dataframe

Tags:

python

pandas

Adam Greenhall

People also ask

2 Answers

Wouter Overmeire

Adam Greenhall

Recent Activity

Donate For Us