I have searched the forums in search of a cleaner way to create a new column in a dataframe that is the sum of the row with the previous row- the opposite of the .diff() function which takes the difference. this is how I'm currently solving the problem <pre class="prettyprint"><code>df = pd.DataFrame ({'c':['dd','ee','ff', 'gg', 'hh'], 'd':[1,2,3,4,5]} df['e']= df['d'].shift(-1) df['f'] = df['d'] + df['e'] </code></pre> Your ideas are appreciated.

You can use <code>rolling</code> with a window size of 2 and <code>sum</code>: <pre class="prettyprint"><code>df['f'] = df['d'].rolling(2).sum().shift(-1) c d f 0 dd 1 3.0 1 ee 2 5.0 2 ff 3 7.0 3 gg 4 9.0 4 hh 5 NaN </code></pre>

opposite of df.diff() in pandas

Tags:

python

pandas

I have searched the forums in search of a cleaner way to create a new column in a dataframe that is the sum of the row with the previous row- the opposite of the .diff() function which takes the difference.

this is how I'm currently solving the problem

df = pd.DataFrame ({'c':['dd','ee','ff', 'gg', 'hh'], 'd':[1,2,3,4,5]}
df['e']= df['d'].shift(-1)
df['f'] = df['d'] + df['e']

Your ideas are appreciated.

818

asked Jan 24 '18 17:01

MissBleu

1 Answers

You can use rolling with a window size of 2 and sum:

df['f'] = df['d'].rolling(2).sum().shift(-1)

    c  d    f
0  dd  1  3.0
1  ee  2  5.0
2  ff  3  7.0
3  gg  4  9.0
4  hh  5  NaN

119

answered Oct 06 '22 18:10

Scott Boston

Related questions
                            
                                Input shape and Conv1d in Keras
                            
                                Why does computational time decrease when removing unnecessary items from a list in Python
                            
                                Google cloud vision not accepting base64 encoded images python
                            
                                How to get length of query result SqlAlchemy
                            
                                How to keep column MultiIndex values when merging pandas DataFrames
                            
                                Use os.listdir to show directories only [duplicate]
                            
                                Matplotlib reads jpg into int8 and png into normalized float
                            
                                Using a colormap for matplotlib line plots
                            
                                PYQT - nesting widgets and layouts in multiple levels
                            
                                How to remove the multiindex from GroupBy.apply()?
                            
                                How can I parse a host:port pair in Python
                            
                                Suptitle alignment issues in Matplotlib
                            
                                gsutil no longer works?
                            
                                What's the inferred name of variables in argparse in conflicting cases
                            
                                How to set the timeout of 'driver.get' for python selenium 3.8.0?
                            
                                Seaborn heatmap, custom tick values
                            
                                Round to nearest 1000 in pandas
                            
                                Pandas, how to combine multiple columns into an array column
                            
                                Django '/' only homepage url error
                            
                                Making numpy arrays JSON serializable

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With