I have the following large dataframe (<code>df</code>) that looks like this: <pre class="prettyprint"><code> ID date PRICE 1 10001 19920103 14.500 2 10001 19920106 14.500 3 10001 19920107 14.500 4 10002 19920108 15.125 5 10002 19920109 14.500 6 10002 19920110 14.500 7 10003 19920113 14.500 8 10003 19920114 14.500 9 10003 19920115 15.000 </code></pre> Question: What's the most efficient way to delete (or remove) the first row of each ID? I want this: <pre class="prettyprint"><code> ID date PRICE 2 10001 19920106 14.500 3 10001 19920107 14.500 5 10002 19920109 14.500 6 10002 19920110 14.500 8 10003 19920114 14.500 9 10003 19920115 15.000 </code></pre> I can do a loop over each unique <code>ID</code> and remove the first row but I believe this is not very efficient.

Another one line code is <code>df.groupby('ID').apply(lambda group: group.iloc[1:, 1:])</code> <pre class="prettyprint"><code>Out[100]: date PRICE ID 10001 2 19920106 14.5 3 19920107 14.5 10002 5 19920109 14.5 6 19920110 14.5 10003 8 19920114 14.5 9 19920115 15.0 </code></pre>

Python: Pandas - Delete the first row by group

Tags:

I have the following large dataframe (df) that looks like this:

    ID     date        PRICE        1   10001  19920103  14.500     2   10001  19920106  14.500     3   10001  19920107  14.500      4   10002  19920108  15.125      5   10002  19920109  14.500    6   10002  19920110  14.500     7   10003  19920113  14.500  8   10003  19920114  14.500      9   10003  19920115  15.000

Question: What's the most efficient way to delete (or remove) the first row of each ID? I want this:

        ID     date     PRICE            2   10001  19920106  14.500         3   10001  19920107  14.500          5   10002  19920109  14.500        6   10002  19920110  14.500         8   10003  19920114  14.500          9   10003  19920115  15.000

I can do a loop over each unique ID and remove the first row but I believe this is not very efficient.

815

asked Jul 05 '15 00:07

Plug4

1 Answers

Another one line code is df.groupby('ID').apply(lambda group: group.iloc[1:, 1:])

Out[100]:               date  PRICE ID                       10001 2  19920106   14.5       3  19920107   14.5 10002 5  19920109   14.5       6  19920110   14.5 10003 8  19920114   14.5       9  19920115   15.0

answered Oct 12 '22 18:10

Jianxun Li

Related questions
                            
                                Swift Switch case: Default will never be executed warning
                            
                                Why does next_permutation skip some permutations?
                            
                                SFSafariViewController crash: The specified URL has an unsupported scheme
                            
                                Swashbuckle Swagger - How to annotate content types?
                            
                                Sum operation on PySpark DataFrame giving TypeError when type is fine
                            
                                NodeJS - nodemon not restarting my server
                            
                                Difference in Azure Availability Sets and Scale Sets
                            
                                Accessing firebase.storage() with AngularFire2 (Angular2 rc.5)
                            
                                How to install xgboost in python on MacOS?
                            
                                Dart - NumberFormat
                            
                                MANIFEST.MF (The system cannot find the path specified)
                            
                                How to create MUI Dialog with transparent background color?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With