After creating the three-rows DataFrame: <pre class="prettyprint"><code>import pandas as pd df = pd.DataFrame({'a': ['1-2', '3-4', '5-6']}) </code></pre> I check if there is any cell equal to '3-4': <pre class="prettyprint"><code>df['a']=='3-4' </code></pre> <img src="https://i.stack.imgur.com/7UYJY.png" alt="enter image description here"> Since <code>df['a']=='3-4'</code> command results to <code>pandas.core.series.Series</code> object I can use it to create a "filtered" version of the original DataFrame like so: <pre class="prettyprint"><code>filtered = df[ df['a']=='3-4' ] </code></pre> <img src="https://i.stack.imgur.com/AGU3O.png" alt="enter image description here"> In Python I can check for the occurrence of the string character in another string using: <pre class="prettyprint"><code>string_value = '3-4' print('-' in string_value) </code></pre> What would be a way to accomplish the same while working with DataFrames? So, I could create the filtered version of the original DataFrame by checking if '-' character in every row's cell, like: <pre class="prettyprint"><code>filtered = df['-' in df['a']] </code></pre> But this syntax above is invalid and throws <code>KeyError: False</code> error message.

Use <code>str</code> and <code>contains</code>: <pre class="prettyprint"><code>In [5]: df['a'].str.contains('-') Out[5]: 0 True 1 True 2 True Name: a, dtype: bool </code></pre>

How to check if character exists in DataFrame cell

Tags:

python

pandas

dataframe

After creating the three-rows DataFrame:

import pandas as pd
df = pd.DataFrame({'a': ['1-2', '3-4', '5-6']})

I check if there is any cell equal to '3-4':

df['a']=='3-4'

enter image description here

Since df['a']=='3-4' command results to pandas.core.series.Series object I can use it to create a "filtered" version of the original DataFrame like so:

filtered = df[ df['a']=='3-4' ]

enter image description here

In Python I can check for the occurrence of the string character in another string using:

string_value = '3-4'
print('-' in string_value)

What would be a way to accomplish the same while working with DataFrames?

So, I could create the filtered version of the original DataFrame by checking if '-' character in every row's cell, like:

filtered = df['-' in df['a']]

But this syntax above is invalid and throws KeyError: False error message.

990

asked Sep 02 '16 19:09

alphanumeric

1 Answers

Use str and contains:

In [5]: df['a'].str.contains('-')
Out[5]: 
0    True
1    True
2    True
Name: a, dtype: bool

answered Sep 30 '22 16:09

juanpa.arrivillaga

Related questions
                            
                                seaborn boxplots at desired distances along the x axis
                            
                                Reading in csv file as dataframe from hdfs
                            
                                Python mock object instantiation
                            
                                parallel processing in pandas python
                            
                                Is there a difference between setting a variable to None or deleting it? [duplicate]
                            
                                how to understand empty dimension in python numpy array?
                            
                                Use pdist() in python with a custom distance function defined by you
                            
                                PUT and DELETE Django
                            
                                Why are multiprocessing.sharedctypes assignments so slow?
                            
                                using decorators to persist python objects
                            
                                Python import CSV short code (pandas?) delimited with ';' and ',' in entires
                            
                                Numpy - check if elements of a array belong to another array
                            
                                find the start position of the longest sequence of 1's
                            
                                Why does a Python script to read files cause my computer to emit beeping sounds?
                            
                                Multiple consecutive join with pyspark
                            
                                How can I remove a widget in kivy?
                            
                                NumPy boolean array warning?
                            
                                portable way to write csv file in python 2 or python 3
                            
                                Difference between Python 2 and 3 for shuffle with a given seed
                            
                                Multiple stacked bar plot with pandas

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With