Is there a way to grab the last item of a group

Tags:

Say I have a DataFrame

data = {'Column 1':     [ 1, 1, 2, 2, 2, 3, 4, 4, 4, 4], 
        'Column 2':     [ 1, 2, 1, 2, 3, 1, 1, 2, 3, 4], 
        'Column 3':     [ 1, 2, 1, 4, 3, 6, 1, 2, 7, 5]}

df = pd.DataFrame(data=data)

I want to grab row 2, 5, 6 and 10 because these are the last row for each value in Column 1. Let's say Column 1 is an ID and Column 2 indicates the number of that ID. I need it to pick the maximum number in Column 2 for each number in Column 1 and keep Column 3 without changing Column 2 and 3 pairs.

So I go from

If I do

df.groupby(['Column 1']).max()

I do not get what I want, because it will max both column 2 and 3.

311

asked May 08 '20 16:05

nielsen

4 Answers

`groupby`/`tail`

df.groupby('Column 1').tail(1)

   Column 1  Column 2  Column 3
1         1         2         2
4         2         3         3
5         3         1         6
9         4         4         5

100

answered Nov 14 '22 23:11

piRSquared

Use drop_duplicates

df_final = df.drop_duplicates('Column 1', keep='last')

Out[9]:
   Column 1  Column 2  Column 3
1         1         2         2
4         2         3         3
5         3         1         6
9         4         4         5

answered Nov 14 '22 21:11

Andy L.

Use Groupby.nth:

In [198]: df.groupby('Column 1', as_index=False).nth([-1])    
Out[198]: 
   Column 1  Column 2  Column 3
1         1         2         2
4         2         3         3
5         3         1         6
9         4         4         5

answered Nov 14 '22 22:11

Mayank Porwal

if your Dataframe is ordered we don't need groupby, we can perform a boolean indexing with Series.shift

df_filtered = df.loc[~df['Column 2'].lt(df['Column 2'].shift(-1))]
print(df_filtered)
   Column 1  Column 2  Column 3
1         1         2         2
4         2         3         3
5         3         1         6
9         4         4         5

answered Nov 14 '22 22:11

ansev

Related questions
                            
                                Appending a numpy array to a list - strange happenings
                            
                                How to only close TopLevel window in Python Tkinter?
                            
                                Remove a list from a list of lists Python
                            
                                UnicodeEncodeError: 'utf-8' codec can't encode character '\ud83d' in position 388: surrogates not allowed
                            
                                Printing not being logged by Kubernetes
                            
                                Config Set min and max value for window size Kivy
                            
                                Django TemplateDoesNotExist at / debug_toolbar/base.html after deployiing to EC2
                            
                                ImportError: No module named 'selenium' in PyCharm
                            
                                Speed up Metropolis--Hastings in Python
                            
                                How to add bytes to bytearray in Python 3.7?
                            
                                Section divider in Spyder
                            
                                How to create multiple seaborn heatmaps with a shared legend in one figure?
                            
                                UnicodeEncodeError: 'latin-1' codec can't encode character '\u2013' (writing to PDF)
                            
                                Why is Pandas so madly fast? How to define such functions?
                            
                                How to change entire row if NaN present if a single column has NaN
                            
                                What does 'del self.self ' in an __init__ function mean?
                            
                                How do you invert a tensor of boolean values in Pytorch?
                            
                                Looping through multiple arrays & concatenating values in pandas
                            
                                how to count the frequency of letters in text excluding whitespace and numbers?
                            
                                How to upsert pandas DataFrame to PostgreSQL table?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Is there a way to grab the last item of a group

Tags:

python

pandas

dataframe

nielsen

People also ask

4 Answers

`groupby`/`tail`

piRSquared

Andy L.

Mayank Porwal

ansev

Recent Activity

Donate For Us

Is there a way to grab the last item of a group

Tags:

python

pandas

dataframe

nielsen

People also ask

4 Answers

groupby/tail

piRSquared

Andy L.

Mayank Porwal

ansev

Related questions

Recent Activity

Donate For Us

`groupby`/`tail`