Python Pandas: How can I group by and assign an id to all the items in a group?

Tags:

I have df:

domain           orgid
csyunshu.com    108299
dshu.com        108299
bbbdshu.com     108299
cwakwakmrg.com  121303
ckonkatsunet.com    121303

I would like to add a new column with replaces domain column with numeric ids per orgid:

domain           orgid   domainid
csyunshu.com    108299      1
dshu.com        108299      2
bbbdshu.com     108299      3
cwakwakmrg.com  121303      1
ckonkatsunet.com 121303     2

I have already tried this line but it does not give the result I want:

df.groupby('orgid').count['domain'].reset_index()

Can anybody help?

594

asked Mar 17 '16 14:03

UserYmY

1 Answers

You can call rank on the groupby object and pass param method='first':

In [61]:
df['domainId'] = df.groupby('orgid')['orgid'].rank(method='first')
df

Out[61]:
             domain   orgid  domainId
0      csyunshu.com  108299         1
1          dshu.com  108299         2
2       bbbdshu.com  108299         3
3    cwakwakmrg.com  121303         1
4  ckonkatsunet.com  121303         2

If you want to overwrite the column you can do:

df['domain'] = df.groupby('orgid')['orgid'].rank(method='first')

148

answered Nov 09 '22 09:11

EdChum

Related questions
                            
                                how to sum across many columns with pandas groupby?
                            
                                Is there a way to sandbox test execution with pytest, especially filesystem access?
                            
                                No module named Win32com.client error when using the pyttsx package
                            
                                Pyqt - What signal does my standard "Apply" button emit and how do I write the slot for it?
                            
                                No module named win32com
                            
                                CRSF Token Interfering With TDD - Is there a variable that stores csrf output?
                            
                                How to check if a docker instance is running?
                            
                                Python itertools: Best way to unpack product of product of list of lists
                            
                                Python Networkx detecting loops/circles
                            
                                python multiply two collection counters
                            
                                Control xaxis tick mark size on all subplots
                            
                                Remove multiple values from [list] dictionary python
                            
                                Locating table with no id or class attributes
                            
                                Django ignore extra arguments on constructing model
                            
                                how to get Python XMLGenerator to output CDATA
                            
                                Replacing characters from string one to string two
                            
                                django restframework :getting NotImplementedError
                            
                                Simple Python String (Backward) Slicing
                            
                                Elegant way to replace values in pandas.DataFrame from another DataFrame
                            
                                Generate all combinations of nucleotide k-mers between range(i, j)

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Python Pandas: How can I group by and assign an id to all the items in a group?

Tags:

python

indexing

pandas

group-by

UserYmY

People also ask

1 Answers

EdChum

Recent Activity

Donate For Us