How to add sequential counter column on groups using Pandas groupby

Tags:

python

pandas

I feel like there is a better way than this:

import pandas as pd df = pd.DataFrame(     columns="   index    c1    c2    v1 ".split(),     data= [             [       0,  "A",  "X",    3, ],             [       1,  "A",  "X",    5, ],             [       2,  "A",  "Y",    7, ],             [       3,  "A",  "Y",    1, ],             [       4,  "B",  "X",    3, ],             [       5,  "B",  "X",    1, ],             [       6,  "B",  "X",    3, ],             [       7,  "B",  "Y",    1, ],             [       8,  "C",  "X",    7, ],             [       9,  "C",  "Y",    4, ],             [      10,  "C",  "Y",    1, ],             [      11,  "C",  "Y",    6, ],]).set_index("index", drop=True) def callback(x):     x['seq'] = range(1, x.shape[0] + 1)     return x df = df.groupby(['c1', 'c2']).apply(callback) print df

To achieve this:

   c1 c2  v1  seq 0   A  X   3    1 1   A  X   5    2 2   A  Y   7    1 3   A  Y   1    2 4   B  X   3    1 5   B  X   1    2 6   B  X   3    3 7   B  Y   1    1 8   C  X   7    1 9   C  Y   4    1 10  C  Y   1    2 11  C  Y   6    3

Is there a way to do it that avoids the callback?

399

asked May 02 '14 19:05

Owen

2 Answers

use cumcount(), see docs here

In [4]: df.groupby(['c1', 'c2']).cumcount() Out[4]:  0     0 1     1 2     0 3     1 4     0 5     1 6     2 7     0 8     0 9     0 10    1 11    2 dtype: int64

If you want orderings starting at 1

In [5]: df.groupby(['c1', 'c2']).cumcount()+1 Out[5]:  0     1 1     2 2     1 3     2 4     1 5     2 6     3 7     1 8     1 9     1 10    2 11    3 dtype: int64

answered Sep 30 '22 07:09

Jeff

This might be useful

df = df.sort_values(['userID', 'date']) grp = df.groupby('userID')['ItemID'].aggregate(lambda x: '->'.join(tuple(x))).reset_index() print(grp)

it will create a sequence like this enter image description here

answered Sep 30 '22 08:09

Shaina Raza

Related questions
                            
                                Django - Rollback save with transaction atomic
                            
                                Automatically Generating Documentation for All Python Package Contents
                            
                                How do setuptools, distribute, and pip relate to one another?
                            
                                Pythonic way to iterate over a collections.Counter() instance in descending order?
                            
                                What is the difference between pandas agg and apply function?
                            
                                Python nose framework: How to stop execution upon first failure
                            
                                How did Python implement the built-in function pow()?
                            
                                Xpath like query for nested python dictionaries
                            
                                Replace string/value in entire DataFrame
                            
                                Attaching a process with pdb
                            
                                Automatically remove *.pyc files and otherwise-empty directories when I check out a new branch
                            
                                Sorting by arbitrary lambda
                            
                                How do I use TensorFlow GPU?
                            
                                Python ImportError cannot import urandom Since Ubuntu 12.04 upgrade
                            
                                how to get derived class name from base class
                            
                                Size of data type using NumPy
                            
                                What does a colon and comma stand in a python list?
                            
                                lxml etree xmlparser remove unwanted namespace
                            
                                join or merge with overwrite in pandas
                            
                                Cast base class to derived class python (or more pythonic way of extending classes)

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With