I have a pandas data frame I want to count how often a number appears in a column for each column <pre class="prettyprint"><code> a b c d e 0 2 3 1 5 4 1 1 3 2 5 4 2 1 3 2 5 4 3 2 4 1 5 3 4 2 4 1 5 3 </code></pre> This is my code that does not work <pre class="prettyprint"><code>def equalsOne(x): x[x.columns == 1].sum() df1.apply(equalOne(), axis = 1) </code></pre> Here is the desired output <pre class="prettyprint"><code>a 2 b 0 c 3 d 0 e 0 </code></pre>

This should do the trick <pre class="prettyprint"><code>df1[df1 == 1].count() </code></pre>

count occurrences of number by column in pandas data frame

Tags:

python

pandas

I have a pandas data frame I want to count how often a number appears in a column for each column

     a   b   c   d   e
0    2   3   1   5   4
1    1   3   2   5   4
2    1   3   2   5   4
3    2   4   1   5   3
4    2   4   1   5   3

This is my code that does not work

def equalsOne(x):
    x[x.columns == 1].sum()

df1.apply(equalOne(), axis = 1)

Here is the desired output

a 2
b 0
c 3
d 0
e 0

534

asked Nov 26 '14 03:11

Erich

2 Answers

You can do:

(df==1).sum()

df==1 gives:

       a      b      c      d      e
0  False  False   True  False  False
1   True  False  False  False  False
2   True  False  False  False  False
3  False  False   True  False  False
4  False  False   True  False  False

and the sum() treats False as 0 and True as 1.

190

answered Oct 05 '22 08:10

Daniel

This should do the trick

df1[df1 == 1].count()

answered Oct 05 '22 08:10

Bob Haffner

Related questions
                            
                                Replace fieldnames when using DictReader
                            
                                how to use Google Shortener API with Python
                            
                                Process data, much larger than physical memory, in chunks
                            
                                Text-Replace in docx and save the changed file with python-docx
                            
                                Choosing random integers except for a particular number for python?
                            
                                Python bottle vs uwsgi/bottle vs nginx/uwsgi/bottle
                            
                                best way to add additional fields to django-rest-framework ModelViewSet when create
                            
                                Run separate processes in parallel - Python
                            
                                Bisect a Python List and finding the Index
                            
                                Pytest init setup for few modules
                            
                                Reverse diagonal on numpy python
                            
                                Concise Ruby hash equivalent of Python dict.get()
                            
                                Python local vs global variables
                            
                                Get a header with Python and convert in JSON (requests - urllib2 - json)
                            
                                How to create a modal window in pyqt?
                            
                                How can I check whether a URL is valid using `urlparse`?
                            
                                Sorting XML in python etree
                            
                                Rotate a 2D image around specified origin in Python
                            
                                Python Multiprocessing: Only one process is running
                            
                                What's the Pythonic way to report nonfatal errors in a parser?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With