assign unique ID to each unique value in group after pandas groupby

Tags:

I have a DataFrame as following.

df = pd.DataFrame({'col1': ['a','b','c','c','d','e','a','h','i','a'],'col2':['3:00','3:00','4:00','4:00','3:00','5:00','5:00','3:00','3:00','2:00']})

df
Out[83]: 
  col1  col2
0    a  3:00
1    b  3:00
2    c  4:00
3    c  4:00
4    d  3:00
5    e  5:00
6    a  5:00
7    h  3:00
8    i  3:00
9    a  2:00

What I'd like to do is groupby 'col1' and assign a unique ID to different values in col2 as following:

col1  col2  ID
 a    2:00   0
 a    3:00   1
 a    5:00   2
 b    3:00   0
 c    4:00   0
 c    4:00   0
 ...

I tried to use pd.Categorical but can't quite get to where I wanted to be.

840

asked Jul 13 '17 16:07

user4279562

1 Answers

we can use pd.factorize() method:

In [170]: df['ID'] = df.groupby('col1')['col2'].transform(lambda x: pd.factorize(x)[0])

In [171]: df
Out[171]:
  col1  col2  ID
0    a  3:00   0
1    b  3:00   0
2    c  4:00   0
3    c  4:00   0
4    d  3:00   0
5    e  5:00   0
6    a  5:00   1
7    h  3:00   0
8    i  3:00   0
9    a  2:00   2

answered Nov 03 '22 06:11

MaxU - stop WAR against UA

Related questions
                            
                                Subtract pandas columns from a specified column
                            
                                Upload file with Selenium Webdriver Python
                            
                                How to iterate over pandas DataFrameGroupBy and select all entries per grouped variable for specific column?
                            
                                Text Language detection in python
                            
                                How to open interactive python console by default?
                            
                                Getting an Ebay OAuth Token
                            
                                Pydub export error - Choose encoder manually
                            
                                does not show icons
                            
                                Can't use matplotlib.use('Agg'), graphs always show on the screen
                            
                                Read CSV file with semicolon as delimiter
                            
                                Fast way to calculate conditional function
                            
                                Recursion function (with bit shift)
                            
                                Filling list nan values
                            
                                Python - Find the number of duplicates in a string text
                            
                                Pandas apply unidecode to several columns
                            
                                Pythonic way to limit ranges on a variable?
                            
                                Replace column names in a pandas data frame that partially match a string
                            
                                Logical operators in Python
                            
                                Pandas left outer join exclusion
                            
                                Python - Asyncio - pass list of argument to function defined with *

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

assign unique ID to each unique value in group after pandas groupby

Tags:

python

pandas

pandas-groupby

user4279562

People also ask

1 Answers

MaxU - stop WAR against UA

Recent Activity

Donate For Us