Pandas group by on one column with max date on another column python

Tags:

pandas

python-2.7

i have a dataframe with following data :

invoice_no  dealer  billing_change_previous_month        date
       110       1                              0  2016-12-31
       100       1                         -41981  2017-01-30
      5505       2                              0  2017-01-30
      5635       2                          58730  2016-12-31

i want to have only one dealer with the maximum date . The desired output should be like this :

invoice_no  dealer  billing_change_previous_month        date
       100       1                         -41981  2017-01-30
      5505       2                              0  2017-01-30

each dealer should be distinct with maximum date, thanks in advance for your help.

596

asked Feb 12 '18 19:02

Anurag Rawat

1 Answers

You can use boolean indexing using groupby and transform

df_new = df[df.groupby('dealer').date.transform('max') == df['date']]

    invoice_no  dealer  billing_change_previous_month   date
1   100         1       -41981                          2017-01-30
2   5505        2       0                               2017-01-30

The solution works as expected even if there are more than two dealers (to address question posted by Ben Smith),

df = pd.DataFrame({'invoice_no':[110,100,5505,5635,10000,10001], 'dealer':[1,1,2,2,3,3],'billing_change_previous_month':[0,-41981,0,58730,9000,100], 'date':['2016-12-31','2017-01-30','2017-01-30','2016-12-31', '2019-12-31', '2020-01-31']})

df['date'] = pd.to_datetime(df['date'])
df[df.groupby('dealer').date.transform('max') == df['date']]


    invoice_no  dealer  billing_change_previous_month   date
1   100         1       -41981                          2017-01-30
2   5505        2       0                               2017-01-30
5   10001       3       100                             2020-01-31

105

answered Sep 20 '22 16:09

Vaishali

Related questions
                            
                                Dictionary size reduces upon increasing one element
                            
                                Weak reference to Python class method
                            
                                How to plot a rectangle on a datetime axis using matplotlib?
                            
                                How to set proxy authentication (user & password) using Python + Selenium
                            
                                fixing words with spaces using a dictionary look up in python?
                            
                                pip broken, reinstall doesn't work. EC2
                            
                                How can I create an in-memory database with sqlite?
                            
                                How to count top 10 most common values in a dict in python
                            
                                Python: PIP install path, what is the correct location for this and other addons?
                            
                                php shell_exec() command is not working
                            
                                LINK : fatal error LNK1104: cannot open file 'python27.lib'
                            
                                Scrapy .css select element with a specific attribute name and value
                            
                                TypeError: argument of type 'NoneType' is not iterable
                            
                                Extract file from file storage object in flask
                            
                                XML (.xsd) feed validation against a schema
                            
                                Exception: Cannot find PyQt5 plugin directories when using Pyinstaller despite PyQt5 not even being used
                            
                                Text File Parsing with Python
                            
                                Pandas: SettingWithCopyWarning [duplicate]
                            
                                celery: daemonic processes are not allowed to have children
                            
                                pip install gives me this error "can't open file 'pip': [Errno 2] No such file or directory"

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With