Drop duplicates of one column based on value in another column, Python, Pandas

Tags:

I have a dataframe like this:

Date                PlumeO      Distance
2014-08-13 13:48:00  754.447905 5.844577 
2014-08-13 13:48:00  754.447905 6.888653
2014-08-13 13:48:00  754.447905 6.938860
2014-08-13 13:48:00  754.447905 6.977284
2014-08-13 13:48:00  754.447905 6.946430 
2014-08-13 13:48:00  754.447905 6.345506
2014-08-13 13:48:00  754.447905 6.133567
2014-08-13 13:48:00  754.447905 5.846046 
2014-08-13 16:59:00  754.447905 6.345506 
2014-08-13 16:59:00  754.447905 6.694847 
2014-08-13 16:59:00  754.447905 5.846046 
2014-08-13 16:59:00  754.447905 6.977284 
2014-08-13 16:59:00  754.447905 6.938860 
2014-08-13 16:59:00  754.447905 5.844577 
2014-08-13 16:59:00  754.447905 6.888653 
2014-08-13 16:59:00  754.447905 6.133567 
2014-08-13 16:59:00  754.447905 6.946430

I'm trying to keep the date with the smallest distance, so drop the duplicates dates and keep the with the smallest distance.

Is there a way to achieve this in pandas' df.drop_duplicates or am I stuck using if statements to find the smallest distance?

514

asked Jul 12 '17 13:07

Ahmed

1 Answers

Sort by distances and drop by dates:

df.sort_values('Distance').drop_duplicates(subset='Date', keep='first')
Out: 
                   Date      PlumeO  Distance
0   2014-08-13 13:48:00  754.447905  5.844577
13  2014-08-13 16:59:00  754.447905  5.844577

answered Nov 15 '22 22:11

ayhan

Related questions
                            
                                pandas: Remove all rows within time interval of another series's time index (i.e. time range exclusion)
                            
                                Why am I getting SQLAlchemy Error "__table_args__ value must be a tuple, dict, or None"
                            
                                Python, summarize daily data in dataframe to monthly and quarterly
                            
                                What is the C++ equivalent to 'r' prefix with strings in Python?
                            
                                Finding tan inverse in python [duplicate]
                            
                                How to access all dictionaries within a dictionary where a specific key has a particular value
                            
                                How do I get the first name and last name of a logged in user in Django?
                            
                                SQLAlchemy Many-To-Many join
                            
                                are there any boto3 + MFA examples out there?
                            
                                Finding elements in a pandas dataframe
                            
                                How to randomly generate really small numbers?
                            
                                difference between similar() and concordance in nltk
                            
                                How to clone from specific branch from Git using Gitpython
                            
                                openpyxl read tables from existing data book example?
                            
                                Python 3 type check not works with use typing module?
                            
                                Converting .py to .exe with Anaconda
                            
                                Why ImportError: No module named lightgbm
                            
                                Object of type 'bytes' is not JSON serializable when upgrading my python environment
                            
                                PyCharm OpenCV- autocomplete with import cv2.cv2, no errors with import cv2
                            
                                Get github username through primary email

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Drop duplicates of one column based on value in another column, Python, Pandas

Tags:

python

pandas

dataframe

conditional-statements

duplicates

Ahmed

People also ask

1 Answers

ayhan

Recent Activity

Donate For Us