Mark every Nth row per group using pandas

Tags:

I have a Dataframe with customer info with their purchase details. I am trying to add a new columns that indicates every 3rd purchase done by the same customer.

Given below is the Dataframe

customer_name,bill_no,date
Mark,101,2018-10-01
Scott,102,2018-10-01
Pete,103,2018-10-02
Mark,104,2018-10-02
Mark,105,2018-10-04
Scott,106,2018-10-21
Julie,107,2018-10-03
Kevin,108,2018-10-07
Steve,109,2018-10-02
Mark,110,2018-10-06
Mark,111,2018-10-02
Mark,112,2018-10-05
Mark,113,2018-10-05

I am writing to filter every 3rd purchase done by the same customer. So in this case, I would like to add a flag for the below bill_no

Mark,105,2018-10-04
Mark,112,2018-10-05

Basically every multiple of 3 bill generated for the same customer.

727

asked Dec 17 '18 11:12

scott martin

1 Answers

Using groupby.cumcount:

n = 3
df['flag'] = df.groupby('customer_name').cumcount() + 1
df['flag'] = ((df['flag'] % n) == 0).astype(int)

print(df)
   customer_name  bill_no        date  flag
0           Mark      101  2018-10-01     0
1          Scott      102  2018-10-01     0
2           Pete      103  2018-10-02     0
3           Mark      104  2018-10-02     0
4           Mark      105  2018-10-04     1
5          Scott      106  2018-10-21     0
6          Julie      107  2018-10-03     0
7          Kevin      108  2018-10-07     0
8          Steve      109  2018-10-02     0
9           Mark      110  2018-10-06     0
10          Mark      111  2018-10-02     0
11          Mark      112  2018-10-05     1
12          Mark      113  2018-10-05     0

answered Sep 26 '22 22:09

Space Impact

Related questions
                            
                                Add multiple docs in yaml file | PyYAML
                            
                                Django datetime not validating right
                            
                                How to get quarter beginning date in python
                            
                                Insert a pandas dataframe into a SQLite table
                            
                                SQLAlchemy: filtering by a key in a JSON column
                            
                                Operating the Celery Worker in the ECS Fargate
                            
                                Python requests timeout not working properly
                            
                                best way to distribute keyword arguments?
                            
                                How to use pip with python 2 & 3 installed? (OSX)
                            
                                Python 3.7: Applying the proxy to all parts of pip installation, failing to maintain the proxy variable
                            
                                How to perform MultiLabel stratified sampling?
                            
                                How to execute Shell Script from Flask App [duplicate]
                            
                                "Type 'lxml.etree._ElementUnicodeResult' cannot be serialized"
                            
                                How do I accumulate a sequence of digits in a string and convert them to one number?
                            
                                Setting tick colors of matplotlib 3D plot
                            
                                Putting a python script into a docker container
                            
                                AWS lambda CLI 'update-function-code' does not update lambda_handler file
                            
                                How can I launch pyqt GUI multiple times consequtively in a process?
                            
                                How to normalize a non-normal distribution?
                            
                                DNNClassifier: 'DataFrame' object has no attribute 'dtype'

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Mark every Nth row per group using pandas

Tags:

python

pandas

dataframe

group-by

pandas-groupby

scott martin

People also ask

1 Answers

Space Impact

Recent Activity

Donate For Us