I'm trying to drop all rows from this df where column 'DB Serial' contains the character *: <pre class="prettyprint"><code> DB Serial 0 13058 1 13069 2 *13070 3 13070 4 13044 5 13042 </code></pre> I am using: <pre class="prettyprint"><code>df = df[~df['DB Serial'].str.contains('*')] </code></pre> but i get this error: <pre class="prettyprint"><code> raise error, v # invalid expression error: nothing to repeat </code></pre>

Escape <code>*</code> by <code>\</code> because <code>*</code> is interpreted as regex: <blockquote> '*' Causes the resulting RE to match 0 or more repetitions of the preceding RE </blockquote> <pre class="prettyprint"><code>df = df[~df['DB Serial'].str.contains('\*')] print (df) DB Serial 0 13058 1 13069 3 13070 4 13044 5 13042 </code></pre> If also get: <blockquote> TypeError: bad operand type for unary ~: 'float' </blockquote> then cast column to <code>string</code>, because mixed values - numeric with strings: <pre class="prettyprint"><code>df = df[~df['DB Serial'].astype(str).str.contains('\*')] print (df) DB Serial 0 13058 1 13069 3 13070 4 13044 5 13042 </code></pre> If possible <code>NaN</code>s values: <pre class="prettyprint"><code>df = df[~df['DB Serial'].str.contains('\*', na=False)] </code></pre>

Pandas drop rows where column contains *

Tags:

python

pandas

I'm trying to drop all rows from this df where column 'DB Serial' contains the character *:

    DB Serial
0     13058
1     13069
2    *13070
3     13070
4     13044
5     13042

I am using:

df = df[~df['DB Serial'].str.contains('*')]

but i get this error:

    raise error, v # invalid expression
error: nothing to repeat

905

asked Apr 23 '17 08:04

warrenfitzhenry

1 Answers

Escape * by \ because * is interpreted as regex:

'*' Causes the resulting RE to match 0 or more repetitions of the preceding RE

df = df[~df['DB Serial'].str.contains('\*')]
print (df)
  DB Serial
0     13058
1     13069
3     13070
4     13044
5     13042

If also get:

TypeError: bad operand type for unary ~: 'float'

then cast column to string, because mixed values - numeric with strings:

df = df[~df['DB Serial'].astype(str).str.contains('\*')]
print (df)
  DB Serial
0     13058
1     13069
3     13070
4     13044
5     13042

If possible NaNs values:

df = df[~df['DB Serial'].str.contains('\*', na=False)]

145

answered Sep 28 '22 16:09

jezrael

Related questions
                            
                                xlsxwriter not applying format to header row of dataframe - Python Pandas
                            
                                Is it possible to subclass DataFrame in Pyspark?
                            
                                IndexError: tuple index out of range when parsing method arguments
                            
                                keeping track of indices change in numpy.reshape
                            
                                What is row slicing vs What is column slicing?
                            
                                pandas groupby two columns and summarize by mean
                            
                                How to add CSS class to widget/field with Django 1.11 template-based form rendering
                            
                                How to create an infinite iterator to generate an incrementing alphabet pattern?
                            
                                How to list all classification/regression/clustering algorithms in scikit-learn?
                            
                                Pdfkit OSError: No wkhtmltopdf executable found
                            
                                Python & MS Word: Convert .doc to .docx?
                            
                                How to make auto indention in nano while programming in python in linux?
                            
                                python - opencv morphologyEx remove specific color
                            
                                Download all blobs files locally from azure container using python
                            
                                Check if a string defines a color
                            
                                Find EC2 Instances belonging to specific Target Group with Boto3
                            
                                Adding a SearchVectorField to a model in Django
                            
                                Update and append new rows based on index value python
                            
                                Add multiple columns to DataFrame and set them equal to an existing column
                            
                                How can I use curses with PyCharm?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With