I'm trying to accomplish two things in my Pandas dataframe: <ol> <li>Create new column Last Row ('Yes' or 'No') based on new DateCompleted</li> <li>Capture the next transaction on the current row, unless it's a new DateCompleted (in which case mark as Null).</li> </ol> Original Dataset <pre class="prettyprint"><code> DateCompleted TranNumber Sales 0 1/1/17 10:15AM 3133 130.31 1 1/1/17 11:21AM 3531 103.12 2 1/1/17 12:31PM 3652 99.23 3 1/2/17 9:31AM 3689 83.22 4 1/2/17 10:31AM 3701 29.93 5 1/3/17 8:30AM 3709 31.31 </code></pre> Desired Output <pre class="prettyprint"><code> DateCompleted TranNumber Sales NextTranSales LastRow 0 1/1/17 10:15AM 3133 130.31 103.12 No 1 1/1/17 11:21AM 3531 103.12 99.23 No 2 1/1/17 12:31PM 3652 99.23 NaN Yes 3 1/2/17 9:31AM 3689 83.22 29.93 No 4 1/2/17 10:31AM 3701 29.93 NaN Yes 5 1/3/17 8:30AM 3709 31.31 ... No </code></pre> I can get the NextTranSales based on: <pre class="prettyprint"><code> df['NextTranSales'] = df.Sales.shift(-1) </code></pre> But I'm having trouble determining the last row in the DateCompleted group and marking NextTranSales as Null if it is the last row. Thanks for your help!

If your data frame has been sorted by the DateCompleted column, then you might just need <code>groupby.shift</code>: <pre class="prettyprint"><code>date = pd.to_datetime(df.DateCompleted).dt.date df["NextTranSales"] = df.groupby(date).Sales.shift(-1) </code></pre> <img src="https://i.stack.imgur.com/vaIM3.png" alt="enter image description here"> If you need the <code>LastRow</code> column, you can find out the last row index with <code>groupby</code> and then assign <code>yes</code> to the rows: <pre class="prettyprint"><code>last_row_index = df.groupby(date, as_index=False).apply(lambda g: g.index[-1]) df["LastRow"] = "No" df.loc[last_row_index, "LastRow"] = "Yes" df </code></pre> <img src="https://i.stack.imgur.com/KXhyn.png" alt="enter image description here">

Pandas - Identify Last Row by Date

Tags:

python

pandas

group-by

shift

I'm trying to accomplish two things in my Pandas dataframe:

Create new column Last Row ('Yes' or 'No') based on new DateCompleted
Capture the next transaction on the current row, unless it's a new DateCompleted (in which case mark as Null).

Original Dataset

Click to copy

        DateCompleted      TranNumber  Sales

    0   1/1/17 10:15AM     3133         130.31
    1   1/1/17 11:21AM     3531         103.12  
    2   1/1/17 12:31PM     3652         99.23  
    3   1/2/17 9:31AM      3689         83.22
    4   1/2/17 10:31AM     3701         29.93
    5   1/3/17 8:30AM      3709         31.31

Desired Output

Click to copy

        DateCompleted      TranNumber   Sales    NextTranSales  LastRow

    0   1/1/17 10:15AM     3133         130.31   103.12         No
    1   1/1/17 11:21AM     3531         103.12   99.23          No
    2   1/1/17 12:31PM     3652         99.23    NaN            Yes
    3   1/2/17 9:31AM      3689         83.22    29.93          No 
    4   1/2/17 10:31AM     3701         29.93    NaN            Yes
    5   1/3/17 8:30AM      3709         31.31    ...            No

I can get the NextTranSales based on:

Click to copy

 df['NextTranSales'] = df.Sales.shift(-1)

But I'm having trouble determining the last row in the DateCompleted group and marking NextTranSales as Null if it is the last row.

Thanks for your help!

888

asked Mar 24 '17 21:03

Walt Reed

2 Answers

If your data frame has been sorted by the DateCompleted column, then you might just need groupby.shift:

Click to copy

date = pd.to_datetime(df.DateCompleted).dt.date    
df["NextTranSales"] = df.groupby(date).Sales.shift(-1)

enter image description here

If you need the LastRow column, you can find out the last row index with groupby and then assign yes to the rows:

Click to copy

last_row_index = df.groupby(date, as_index=False).apply(lambda g: g.index[-1])
df["LastRow"] = "No"
df.loc[last_row_index, "LastRow"] = "Yes"
df

enter image description here

117

answered Sep 28 '22 14:09

Psidom

NOTE: This depends on Sales being free of NaN. If it has any NaN we will get erroneous determinations of last row. This happens because I'm leveraging the convenience that the shifted column leaves a NaN in the last position.

Click to copy

d = df.DateCompleted.dt.date
m = {True: 'Yes', False: 'No'}
s = df.groupby(d).Sales.shift(-1)
df = df.assign(NextTranSales=s).assign(LastRow=s.isnull().map(m))
print(df)

        DateCompleted  TranNumber   Sales  NextTranSales LastRow
0 2017-01-01 10:15:00        3133  130.31         103.12      No
1 2017-01-01 11:21:00        3531  103.12          99.23      No
2 2017-01-01 12:31:00        3652   99.23            NaN     Yes
3 2017-01-02 09:31:00        3689   83.22          29.93      No
4 2017-01-02 10:31:00        3701   29.93            NaN     Yes
5 2017-01-03 08:30:00        3709   31.31            NaN     Yes

We can be free of the no NaN restriction with this

Click to copy

d = df.DateCompleted.dt.date
m = {True: 'Yes', False: 'No'}
s = df.groupby(d).Sales.shift(-1)
l = pd.Series(
    'Yes', df.groupby(d).tail(1).index
).reindex(df.index, fill_value='No')
df.assign(NextTranSales=s).assign(LastRow=l)

        DateCompleted  TranNumber   Sales  NextTranSales LastRow
0 2017-01-01 10:15:00        3133  130.31         103.12      No
1 2017-01-01 11:21:00        3531  103.12          99.23      No
2 2017-01-01 12:31:00        3652   99.23            NaN     Yes
3 2017-01-02 09:31:00        3689   83.22          29.93      No
4 2017-01-02 10:31:00        3701   29.93            NaN     Yes
5 2017-01-03 08:30:00        3709   31.31            NaN     Yes

answered Sep 28 '22 14:09

piRSquared

Related questions
                            
                                How to write table structure data in PDF file in python?
                            
                                Paths in AWS lambda with Python NLTK
                            
                                Create a string array parameter with zeep?
                            
                                Python unittest : how to specify custom equality predicate?
                            
                                Pythonlibs3 CMake and macOS
                            
                                Transparent Networx Edge Labels
                            
                                How to spot gaps between pandas dataframe indexes?
                            
                                How do I center text in the Tkinter Text widget?
                            
                                I have downloaded a pojo from h2o, compiled it, but how do I use it?
                            
                                Use random userAgent for Selenium (python)
                            
                                Mark a *single* IMAP message as unread
                            
                                Getting monthly climatology using xarray in python
                            
                                Symbolic solution of equation system using Sympy with trivial solutions depending on symbols
                            
                                Wagtail ModelAdmin read only
                            
                                How to create an inequality constraint on the inner product of two columns in CVXPY?
                            
                                Python: how to pass argument name using variable
                            
                                Python: sort specific elements in a list
                            
                                Plotting system of differential equations in Python
                            
                                Use spacy Spanish Tokenizer
                            
                                os.environ.get() does not return the Environment Value in windows?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Pandas - Identify Last Row by Date

Tags:

python

pandas

group-by

shift

Walt Reed

People also ask

2 Answers

Psidom

piRSquared

Recent Activity

Donate For Us