Add extra column as the cumulative time difference

Tags:

How to add an extra column that is the cumulative value of the time differences for each course? For example, the initial table is:

 id_A       course     weight                ts_A       value
 id1        cotton     3.5       2017-04-27 01:35:30  150.000000
 id1        cotton     3.5       2017-04-27 01:36:00  416.666667
 id1        cotton     3.5       2017-04-27 01:36:30  700.000000
 id1        cotton     3.5       2017-04-27 01:37:00  950.000000
 id2     cotton blue   5.0       2017-04-27 02:35:30  150.000000
 id2     cotton blue   5.0       2017-04-27 02:36:00  450.000000
 id2     cotton blue   5.0       2017-04-27 02:36:30  520.666667
 id2     cotton blue   5.0       2017-04-27 02:37:00  610.000000

The expected result is:

 id_A       course     weight                ts_A       value      cum_delta_sec
 id1        cotton     3.5       2017-04-27 01:35:30  150.000000      0
 id1        cotton     3.5       2017-04-27 01:36:00  416.666667      30 
 id1        cotton     3.5       2017-04-27 01:36:30  700.000000      60
 id1        cotton     3.5       2017-04-27 01:37:00  950.000000      90
 id2     cotton blue   5.0       2017-04-27 02:35:30  150.000000      0
 id2     cotton blue   5.0       2017-04-27 02:36:00  450.000000      30
 id2     cotton blue   5.0       2017-04-27 02:36:30  520.666667      60
 id2     cotton blue   5.0       2017-04-27 02:37:00  610.000000      90

619

asked Jul 20 '17 15:07

Carlo Allocca

1 Answers

You can chain the diff method with cumsum:

# convert ts_A to datetime type
df.ts_A = pd.to_datetime(df.ts_A)

# convert ts_A to seconds, group by id and then use transform to calculate the cumulative difference
df['cum_delta_sec'] = df.ts_A.astype(int).div(10**9).groupby(df.id_A).transform(lambda x: x.diff().fillna(0).cumsum())
df

enter image description here

102

answered Sep 19 '22 14:09

Psidom

Related questions
                            
                                neural network with multiple outputs in sklearn
                            
                                Can you install a Python package via R - Reticulate
                            
                                Python Wand and ImageMagick on AWS Lambda
                            
                                AttributeError: 'module' object has no attribute 'audio_fadein'
                            
                                Total count of objects in Django Model
                            
                                Keeping track of original indicies when sorting a list of lists by length
                            
                                How to ignore some unittest test in Pycharm 2017.1?
                            
                                Add @timestamp field in ElasticSearch with Python
                            
                                Pandas: Resample dataframe column, get discrete feature that corresponds to max value
                            
                                scipy -- how to integrate a linearly interpolated function?
                            
                                Run two different versions of chrome using selenium (Python)
                            
                                Get list of MySQL databases with python
                            
                                How does data shape change during Conv2D and Dense in Keras?
                            
                                Pandas Crosstabulation and counting
                            
                                pandas display: truncate column display rather than wrapping
                            
                                How to get all confusion matrix terminologies (TPR, FPR, TNR, FNR) for a multi class?
                            
                                Python, OpenCV -- Aligning and overlaying multiple images, one after another
                            
                                Python: Calculate sine/cosine with a precision of up to 1 million digits
                            
                                How can I adapt the autolabel function in matplotlib so that it displays negative values correctly?
                            
                                Find annual average of pandas dataframe with date column

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Add extra column as the cumulative time difference

Tags:

python

timestamp

pandas

dataframe

Carlo Allocca

People also ask

1 Answers

Psidom

Recent Activity

Donate For Us