Adding a column thats result of difference in consecutive rows in pandas

Tags:

Lets say I have a dataframe like this

0,1,2,3 are times, a, c, e, g is one time series and b, d, f, h is another time series. I need to be able to add two columns to the orignal dataframe which is got by computing the differences of consecutive rows for certain columns.

So i need something like this

    A   B   dA
0   a   b  (a-c)
1   c   d  (c-e)
2   e   f  (e-g)
3   g   h   Nan

I saw something called diff on the dataframe/series but that does it slightly differently as in first element will become Nan.

202

asked Apr 17 '14 20:04

AMM

2 Answers

Use shift.

df['dA'] = df['A'] - df['A'].shift(-1)

150

answered Oct 12 '22 12:10

exp1orer

You could use diff and pass -1 as the periods argument:

>>> df = pd.DataFrame({"A": [9, 4, 2, 1], "B": [12, 7, 5, 4]})
>>> df["dA"] = df["A"].diff(-1)
>>> df
   A   B  dA
0  9  12   5
1  4   7   2
2  2   5   1
3  1   4 NaN

[4 rows x 3 columns]

answered Oct 12 '22 11:10

DSM

Related questions
                            
                                Counting non zero values in each column of a dataframe in python
                            
                                Parallelize apply after pandas groupby
                            
                                Pass percentiles to pandas agg function
                            
                                Is it possible to insert a row at an arbitrary position in a dataframe using pandas?
                            
                                Finding common rows (intersection) in two Pandas dataframes
                            
                                Pandas bar plot changes date format
                            
                                Get index of a row of a pandas dataframe as an integer
                            
                                python pandas extract year from datetime: df['year'] = df['date'].year is not working
                            
                                How to conditionally update DataFrame column in Pandas
                            
                                Set value for particular cell in pandas DataFrame with iloc
                            
                                Querying for NaN and other names in Pandas
                            
                                pandas select from Dataframe using startswith
                            
                                Pandas merge two dataframes with different columns
                            
                                Impute categorical missing values in scikit-learn
                            
                                How to iterate over pandas multiindex dataframe using index
                            
                                Is there a way to copy only the structure (not the data) of a Pandas DataFrame?
                            
                                Moving Average Pandas
                            
                                pandas apply function that returns multiple values to rows in pandas dataframe
                            
                                Merge two data frames based on common column values in Pandas
                            
                                Pandas concat yields ValueError: Plan shapes are not aligned

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Adding a column thats result of difference in consecutive rows in pandas

Tags:

pandas

dataframe

series

AMM

People also ask

2 Answers

exp1orer

DSM

Recent Activity

Donate For Us