Counting the amount of times a boolean goes from True to False in a column

Tags:

I have a column in a dataframe which is filled with booleans and i want to count how many times it changes from True to False.

I can do this when I convert the booleans to 1's and 0's ,then use df.diff and then divide that answer by 2

import pandas as pd

d = {'Col1': [True, True, True, False, False, False, True, True, True, True, False, False, False, True, True, False, False, True, ]}


df = pd.DataFrame(data=d)


print(df)

0    True
1    True
2    True
3   False
4   False
5   False
6    True
7    True
8    True
9    True
10  False
11  False
12  False
13   True
14   True
15  False
16  False

My expected outcome would be The amount of times False came up is 3

931

asked Jan 16 '19 15:01

Martijn van Amsterdam

2 Answers

You can perform a bitwise and of the Col1 with a mask indicating where changes occur in successive rows:

(df.Col1 & (df.Col1 != df.Col1.shift(1))).sum()
3

Where the mask, is obtained by comparing Col1 with a shifted version of itself (pd.shift):

df.Col1 != df.Col1.shift(1)

0      True
1     False
2     False
3      True
4     False
5     False
6      True
7     False
8     False
9     False
10     True
11    False
12    False
13     True
14    False
15    False
16    False
17    False
Name: Col1, dtype: bool

For multiple columns, you can do exactly the same (Here I tested with a col2 identical to col1)

(df & (df != df.shift(1))).sum()

Col1    3
Col2    3
dtype: int64

132

answered Sep 21 '22 08:09

yatu

Notice that subtracting True (1) from False (0) in integer terms gives -1:

res = df['Col1'].astype(int).diff().eq(-1).sum()  # 3

To apply across a Boolean dataframe, you can construct a series mapping label to count:

res = df.astype(int).diff().eq(-1).sum()

answered Sep 19 '22 08:09

jpp

Related questions
                            
                                How to find the number of bands in gdal in python?
                            
                                Custom Data Generator for Keras LSTM with TimeSeriesGenerator
                            
                                How to send file to response in Django?
                            
                                Vim error : Error detected while processing function <SNR>14_UseConfigFiles[26]..<SNR>14_Initialize[47]..<SNR>14_InitializePythonBuiltin:
                            
                                Google Colab not updating package?
                            
                                Python: Revert sys.stdout to default
                            
                                WARNING:tensorflow:Ignoring detection with image id despite true config parameters
                            
                                Benchmarking matrix multiplication performance: C++ (eigen) is much slower than Python
                            
                                multiple tasks using python asyncio
                            
                                ImportError: cannot import name 'include'
                            
                                How to plot a boxplot for each column in a DataFrame? [duplicate]
                            
                                How to write unit tests for your GRPC server in Python?
                            
                                How to read edf data in Python 3
                            
                                Best way to remove '\xad' in Python?
                            
                                Lazy evaluation of strings in python logging: comparing `%` with `.format`
                            
                                Sum pattern across array
                            
                                In what situations should you actually use generators in python?
                            
                                How to read an ORC file stored locally in Python Pandas?
                            
                                spacy fails to run with error: 'cymem.cymem' has no attribute 'PyMalloc'
                            
                                What is the backward process of max operation in deep learning?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Counting the amount of times a boolean goes from True to False in a column

Tags:

python

pandas

series

Martijn van Amsterdam

People also ask

2 Answers

yatu

jpp

Recent Activity

Donate For Us