Pandas - Subtract min date from max date for each group

Tags:

I want to add a column that is a result of subtraction of min date from max date for each customer_id to this table

Input:

action_date customer_id
 2017-08-15       1
 2017-08-21       1
 2017-08-21       1
 2017-09-02       1
 2017-08-28       2
 2017-09-29       2
 2017-10-15       3   
 2017-10-30       3
 2017-12-05       3

And get this table

Output:

action_date customer_id    diff
 2017-08-15       1         18
 2017-08-21       1         18
 2017-08-21       1         18
 2017-09-02       1         18
 2017-08-28       2         32
 2017-09-29       2         32
 2017-10-15       3         51
 2017-10-30       3         51
 2017-12-05       3         51

I tried this code, but it puts lots of NaN's

group = df.groupby(by='customer_id')
df['diff'] = (group['action_date'].max() - group['action_date'].min()).dt.days

268

asked Dec 27 '17 12:12

Superbman

1 Answers

you can use transform method:

In [23]: df['diff'] = df.groupby('customer_id') \
                        ['action_date'] \
                        .transform(lambda x: (x.max()-x.min()).days)

In [24]: df
Out[24]:
  action_date  customer_id  diff
0  2017-08-15            1    18
1  2017-08-21            1    18
2  2017-08-21            1    18
3  2017-09-02            1    18
4  2017-08-28            2    32
5  2017-09-29            2    32
6  2017-10-15            3    51
7  2017-10-30            3    51
8  2017-12-05            3    51

answered Sep 30 '22 11:09

MaxU - stop WAR against UA

Related questions
                            
                                mypy error: List or tuple literal expected as the second argument to namedtuple()
                            
                                Set Polygon Colors Matplotlib
                            
                                Is there a way to read Stata labels in python?
                            
                                Summing up more than two dataframes with the same indexes in Pandas
                            
                                NumPy sort function returns None
                            
                                ImportError: No module named 'matplotlib.externals'
                            
                                Unpack a List in to Indices of another list in python
                            
                                What is this error in Python tabula module?
                            
                                Remove non-duplicated rows from pandas
                            
                                Unable to load firefox in selenium webdriver in python
                            
                                Optimize computation of the "difference function"
                            
                                Pytz Timezone from UTC offset
                            
                                Use package from Github in Conda Virtual Environment
                            
                                Why do I get a Errno 1 Operation not permitted when the folder is created with full read/write permissions for everyone in Python?
                            
                                Error when parsing graph_def from string
                            
                                Python ssl.SSLError: [SSL: CERTIFICATE_VERIFY_FAILED] certificate verify failed (_ssl.c:748)
                            
                                Why doesn't Python have a "__req__" (reflected equality) method?
                            
                                Python asyncio/aiohttp: ValueError: too many file descriptors in select() on Windows
                            
                                Efficient shifting based on date within groups in Pandas?
                            
                                cv2.imshow() giving black screen

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Pandas - Subtract min date from max date for each group

Tags:

python

pandas

group-by

Superbman

People also ask

1 Answers

MaxU - stop WAR against UA

Recent Activity

Donate For Us