I would like to see if a particular string exists in a particular column within my dataframe. I'm getting the error <blockquote> ValueError: The truth value of a Series is ambiguous. Use a.empty, a.bool(), a.item(), a.any() or a.all(). </blockquote> <pre class="prettyprint"><code>import pandas as pd BabyDataSet = [('Bob', 968), ('Jessica', 155), ('Mary', 77), ('John', 578), ('Mel', 973)] a = pd.DataFrame(data=BabyDataSet, columns=['Names', 'Births']) if a['Names'].str.contains('Mel'): print ("Mel is there") </code></pre>

<code>a['Names'].str.contains('Mel')</code> will return an indicator vector of boolean values of size <code>len(BabyDataSet)</code> Therefore, you can use <pre class="prettyprint"><code>mel_count=a['Names'].str.contains('Mel').sum() if mel_count>0: print ("There are {m} Mels".format(m=mel_count)) </code></pre> Or <code>any()</code>, if you don't care how many records match your query <pre class="prettyprint"><code>if a['Names'].str.contains('Mel').any(): print ("Mel is there") </code></pre>

You should use <code>any()</code> <pre class="prettyprint"><code>In [98]: a['Names'].str.contains('Mel').any() Out[98]: True In [99]: if a['Names'].str.contains('Mel').any(): ....: print "Mel is there" ....: Mel is there </code></pre> <code>a['Names'].str.contains('Mel')</code> gives you a series of bool values <pre class="prettyprint"><code>In [100]: a['Names'].str.contains('Mel') Out[100]: 0 False 1 False 2 False 3 False 4 True Name: Names, dtype: bool </code></pre>

Check if string is in a pandas dataframe

Tags:

python

pandas

I would like to see if a particular string exists in a particular column within my dataframe.

I'm getting the error

ValueError: The truth value of a Series is ambiguous. Use a.empty, a.bool(), a.item(), a.any() or a.all().

Click to copy

import pandas as pd  BabyDataSet = [('Bob', 968), ('Jessica', 155), ('Mary', 77), ('John', 578), ('Mel', 973)]  a = pd.DataFrame(data=BabyDataSet, columns=['Names', 'Births'])  if a['Names'].str.contains('Mel'):     print ("Mel is there")

262

asked Jun 19 '15 18:06

user2242044

2 Answers

a['Names'].str.contains('Mel') will return an indicator vector of boolean values of size len(BabyDataSet)

Therefore, you can use

Click to copy

mel_count=a['Names'].str.contains('Mel').sum() if mel_count>0:     print ("There are {m} Mels".format(m=mel_count))

Or any(), if you don't care how many records match your query

Click to copy

if a['Names'].str.contains('Mel').any():     print ("Mel is there")

answered Oct 20 '22 01:10

Uri Goren

You should use any()

Click to copy

In [98]: a['Names'].str.contains('Mel').any() Out[98]: True  In [99]: if a['Names'].str.contains('Mel').any():    ....:     print "Mel is there"    ....: Mel is there

a['Names'].str.contains('Mel') gives you a series of bool values

Click to copy

In [100]: a['Names'].str.contains('Mel') Out[100]: 0    False 1    False 2    False 3    False 4     True Name: Names, dtype: bool

answered Oct 20 '22 01:10

Zero

Related questions
                            
                                If range() is a generator in Python 3.3, why can I not call next() on a range?
                            
                                setuptools: package data folder location
                            
                                How to use Python type hints with Django QuerySet?
                            
                                Is there any difference between django.conf.settings and import settings?
                            
                                How to add an element to the beginning of an OrderedDict?
                            
                                Python Method overriding, does signature matter?
                            
                                Convert 2d numpy array into list of lists [duplicate]
                            
                                data type not understood
                            
                                Pandas Left Outer Join results in table larger than left table
                            
                                When is hash(n) == n in Python?
                            
                                How to use youtube-dl from a python program?
                            
                                Connection Timeout with Elasticsearch
                            
                                Java Equivalent to Python Dictionaries
                            
                                What causes a Python segmentation fault?
                            
                                requirements.txt depending on python version
                            
                                What is a 'NoneType' object?
                            
                                Invert image displayed by imshow in matplotlib
                            
                                Emulating Bash 'source' in Python
                            
                                Is it necessary or useful to inherit from Python's object in Python 3.x?
                            
                                Integrating MySQL with Python in Windows

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Check if string is in a pandas dataframe

Tags:

python

pandas

user2242044

People also ask

2 Answers

Uri Goren

Zero

Recent Activity

Donate For Us