Pandas Dataframe: how to add column with number of occurrences in other column

Tags:

I have to following df:

Col1    Col2
test    Something
test2   Something
test3   Something
test    Something
test2   Something
test5   Something

I want to get

Col1    Col2          Occur
test    Something     2
test2   Something     2
test3   Something     1
test    Something     2
test2   Something     2
test5   Something     1

I've tried to use:

df["Occur"] = df["Col1"].value_counts()

But it didn't help. I've got Occur column full of 'NaN'

416

asked May 06 '16 17:05

Laser

2 Answers

You can also use GroupBy + transform with size:

df['Occur'] = df.groupby('Col1')['Col1'].transform('size')

print(df)

    Col1       Col2  Occur
0   test  Something      2
1  test2  Something      2
2  test3  Something      1
3   test  Something      2
4  test2  Something      2
5  test5  Something      1

101

answered Oct 20 '22 16:10

jpp

groupby on 'col1' and then apply transform on Col2 to return a Series with its index aligned to the original df so you can add it as a column:

In [3]:
df['Occur'] = df.groupby('Col1')['Col2'].transform(pd.Series.value_counts)
df

Out[3]:
    Col1       Col2 Occur
0   test  Something     2
1  test2  Something     2
2  test3  Something     1
3   test  Something     2
4  test2  Something     2
5  test5  Something     1

answered Oct 20 '22 17:10

EdChum

Related questions
                            
                                read HDF5 file to pandas DataFrame with conditions
                            
                                How to make 'pip install' not uninstall other versions?
                            
                                Kivy properly set own icon
                            
                                What type signature do generators have in Python?
                            
                                Find substrings in PyMongo
                            
                                PyQt4: How to pause a Thread until a signal is emitted?
                            
                                Python BigQuery allowLargeResults with pandas.io.gbq
                            
                                'Unexpected Keyword Argument' in super().__init__()
                            
                                Sklearn SVM: SVR and SVC, getting the same prediction for every input
                            
                                How do I ADD accents to a letter? [closed]
                            
                                How to read index data as string with pandas.read_csv()?
                            
                                How to normalize only certain columns in scikit-learn?
                            
                                Convert mask (boolean) array to list of x,y coordinates
                            
                                Chunking bytes (not strings) in Python 2 and 3
                            
                                TK Framework double implementation issue
                            
                                Python Pandas Distance matrix using jaccard similarity
                            
                                Alpine 3.3, Python 2.7.11, urllib2 causing SSL: CERTIFICATE_VERIFY_FAILED
                            
                                Pyinstaller Jinja2 TemplateNotFound
                            
                                Is there a way to download a video from a webpage with python?
                            
                                Spark reading python3 pickle as input

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Pandas Dataframe: how to add column with number of occurrences in other column

Tags:

python

pandas

pandas-groupby

Laser

People also ask

2 Answers

jpp

EdChum

Recent Activity

Donate For Us