How to get value of a column based on the maximum of another column in case of DataFrame.groupby

Tags:

I have a dataframe which looks like this.

id YearReleased Artist count 168 2015 Muse 1 169 2015 Rihanna 3 170 2015 Taylor Swift 2 171 2016 Jennifer Lopez 1 172 2016 Rihanna 3 173 2016 Underworld 1 174 2017 Coldplay 1 175 2017 Ed Sheeran 2

I want to get the maximum count for each year and then get the corresponding Artist name.

Something like this:

YearReleased Artist

2015 Rihanna
2016 Rihanna
2017 Ed Sheeran

I have tried using a loop to iterate over the rows of the dataframe and create another dictionary with key as year and value as artist. But when I try to convert that dictionary to a dataframe, the keys are mapped to columns instead of rows.

Can somebody guide me to have a better approach to this without having to loop over the dataframe and instead use some inbuilt pandas method to achieve this?

923

asked Mar 13 '18 18:03

Jeet Banerjee

1 Answers

Look at idxmax

df.loc[df.groupby('YearReleased')['count'].idxmax()]
Out[445]: 
    id  YearReleased     Artist  count
1  169          2015    Rihanna      3
4  172          2016    Rihanna      3
7  175          2017  EdSheeran      2

127

answered Oct 15 '22 17:10

BENY

Related questions
                            
                                Concatenate string to the end of all elements of a list in python
                            
                                my Keras model does not predict negative values
                            
                                Django: relation "django_site" does not exist in app with psql using sites framework
                            
                                Recursion Depth Exceeded, pickle and BeautifulSoup
                            
                                Import _tkinter or tkinter?
                            
                                How to see Python executable output in a cmd window?
                            
                                Numpy ndarray shape with 3 parameters
                            
                                ThreadPoolExecutor with context manager
                            
                                How to preserve the datatype while iterating dataframe in pandas?
                            
                                Dask dataframes: reading multiple files & storing filename in column
                            
                                Collapse Dataframe Pivot to Single Row
                            
                                Python conditional joining of *consecutive* strings that don't end in punctuation with those that do
                            
                                Find maximum value of time in list containing tuples of time in format ('hour', 'min', 'AM/PM')
                            
                                How to add a table in django app models from PostgreSQL?
                            
                                Passing argument in groupby.agg with multiple functions
                            
                                Pandas groupby and sum total of group
                            
                                Pandas groupby conditional subtraction
                            
                                Pandas dataframe to excel gives "file is not UTF-8 encoded"
                            
                                Can the sigmoid activation function be used to solve regression problems in Keras?
                            
                                Understanding Partial Dependence for Gradient Boosted Regression trees

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to get value of a column based on the maximum of another column in case of DataFrame.groupby

Tags:

python

pandas

dataframe

data-science

Jeet Banerjee

People also ask

1 Answers

BENY

Recent Activity

Donate For Us