Using Pandas, how do I drop the last row of each group?

Tags:

python

pandas

I have a dataframe as shown below:

import pandas as pd
df = pd.DataFrame({'A': ['one', 'one', 'two', 'three', 'three', 'one'], 'B': range(6)})
grouped = df.groupby('A')
print grouped.head()

             A  B
A                
one   0    one  0
      1    one  1
      5    one  5
three 3  three  3
      4  three  4
two   2    two  2

I can easily select the last rows of each group by doing:

print(grouped.agg(lambda x: x.iloc[-1]))

      B
A       
one    5
three  4
two    2

How can I drop the last row of each group instead? The result would be:

       A  B
0    one  0
1    one  1
3  three  3

I have tried filtering but it does not seem to do anything:

print grouped.filter(lambda x: x.iloc[-1])

       A  B
0    one  0
1    one  1
5    one  5
3  three  3
4  three  4
2    two  2

Thank you

452

asked Mar 26 '14 19:03

user3465658

1 Answers

How about:

>>> df.groupby("A", as_index=False).apply(lambda x: x.iloc[:-1])
       A  B
0    one  0
1    one  1
3  three  3

[3 rows x 2 columns]

answered Oct 05 '22 17:10

DSM

Related questions
                            
                                Sort List in Python by two other lists
                            
                                Killing child process when parent crashes in python
                            
                                matplotlib.pyplot.imshow: removing white space within plots when using attributes "sharex" and "sharey"
                            
                                Disable DTR in pyserial from code
                            
                                Python multiprocessing Process crashes silently
                            
                                Why is Ruby's Float#round behavior different than Python's?
                            
                                Python regex matching all but last occurrence
                            
                                Python subprocess: wait for command to finish before starting next one?
                            
                                Replace x with y or append y if no x
                            
                                Using pyserial to send binary data
                            
                                How to conjugate a verb in NLTK given POS tag?
                            
                                Mocking urllib2.urlopen().read() for different responses
                            
                                Python / Django multi-tenancy solution
                            
                                Does python have Matlab's `ans` variable that captures returned value not stored in any variable?
                            
                                In a gevent application, how can I kill all greenlets that have been started?
                            
                                getting seconds from numpy timedelta64
                            
                                Redis Queue + python-rq: Right pattern to prevent high memory usage?
                            
                                Python class method chaining
                            
                                using python WeakSet to enable a callback functionality
                            
                                Storing a dict with np.savez gives unexpected result?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With