I want to count number of times each values is appearing in dataframe. Here is my dataframe - <code>df</code>: <pre class="prettyprint"><code> status 1 N 2 N 3 C 4 N 5 S 6 N 7 N 8 S 9 N 10 N 11 N 12 S 13 N 14 C 15 N 16 N 17 N 18 N 19 S 20 N </code></pre> I want to dictionary of counts: ex. <code>counts = {N: 14, C:2, S:4}</code> I have tried <code>df['status']['N']</code> but it gives <code>keyError</code> and also <code>df['status'].value_counts</code> but no use.

You can use <code>value_counts</code> and <code>to_dict</code>: <pre class="prettyprint"><code>print df['status'].value_counts() N 14 S 4 C 2 Name: status, dtype: int64 counts = df['status'].value_counts().to_dict() print counts {'S': 4, 'C': 2, 'N': 14} </code></pre>

An alternative one liner using underdog <code>Counter</code>: <pre class="prettyprint"><code>In [3]: from collections import Counter In [4]: dict(Counter(df.status)) Out[4]: {'C': 2, 'N': 14, 'S': 4} </code></pre>

Count frequency of values in pandas DataFrame column

Tags:

python

pandas

dataframe

django

I want to count number of times each values is appearing in dataframe.

Here is my dataframe - df:

    status 1     N 2     N 3     C 4     N 5     S 6     N 7     N 8     S 9     N 10    N 11    N 12    S 13    N 14    C 15    N 16    N 17    N 18    N 19    S 20    N

I want to dictionary of counts:

ex. counts = {N: 14, C:2, S:4}

I have tried df['status']['N'] but it gives keyError and also df['status'].value_counts but no use.

523

asked Mar 15 '16 07:03

Kishan Mehta

2 Answers

You can use value_counts and to_dict:

print df['status'].value_counts() N    14 S     4 C     2 Name: status, dtype: int64  counts = df['status'].value_counts().to_dict() print counts {'S': 4, 'C': 2, 'N': 14}

169

answered Oct 11 '22 21:10

jezrael

An alternative one liner using underdog Counter:

In [3]: from collections import Counter  In [4]: dict(Counter(df.status)) Out[4]: {'C': 2, 'N': 14, 'S': 4}

answered Oct 11 '22 20:10

Colonel Beauvel

Related questions
                            
                                Plotly chart not showing in Jupyter notebook
                            
                                What is the pythonic way to count the leading spaces in a string?
                            
                                running multiple bash commands with subprocess
                            
                                How to calculate correlation between all columns and remove highly correlated ones using pandas?
                            
                                Get all the diagonals in a matrix/list of lists in Python
                            
                                how to pass parameters of a function when using timeit.Timer()
                            
                                Keep certain columns in a pandas DataFrame, deleting everything else
                            
                                printing bit representation of numbers in python
                            
                                sublime text2 python error message /usr/bin/python: can't find '__main__' module in ''
                            
                                Running get_dummies on several DataFrame columns?
                            
                                Mocking a global variable
                            
                                MATLAB-style find() function in Python
                            
                                merging several python dictionaries
                            
                                Multiple parameters in Flask approute
                            
                                How to check if all values of a dictionary are 0
                            
                                TensorFlow - Importing data from a TensorBoard TFEvent file?
                            
                                How do I send empty response in Django without templates
                            
                                AttributeError: 'Series' object has no attribute 'reshape'
                            
                                Create file path from variables
                            
                                Iterating over Numpy matrix rows to apply a function each?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With