I'm running below code to clean text <pre class="prettyprint"><code>import pandas as pd def not_regex(pattern): return r"((?!{}).)".format(pattern) tmp = pd.DataFrame(['No one has a European accent either @', 'That the kid reminds me of Kevin']) tmp[0].str.replace(not_regex('(\\b[-/]\\b|[a-zA-Z0-9])'), ' ') </code></pre> Then it returns a warning <pre class="prettyprint"><code><ipython-input-8-ef8a43f91dbd>:9: FutureWarning: The default value of regex will change from True to False in a future version. tmp[0].str.replace(not_regex('(\\b[-/]\\b|[a-zA-Z0-9])'), ' ') </code></pre> Could you please elaborate on the reason of this warning?

I have like <pre class="prettyprint"><code>df.Experience.head(5) </code></pre> <pre class="prettyprint"><code>0 24 years experience 1 12 years experience 2 9 years experience 3 12 years experience 4 20 years experience Name: Experience, dtype: object </code></pre> I use like <pre class="prettyprint"><code>df['Experience']=df['Experience'].str.replace(r'\D+','', regex=True).astype(int) </code></pre> I get like <pre class="prettyprint"><code>df.Experience.head(5) </code></pre> <pre class="prettyprint"><code>0 24 1 12 2 9 3 12 4 20 Name: Experience, dtype: int64 </code></pre>

FutureWarning: The default value of regex will change from True to False in a future version

import pandas as pd

def not_regex(pattern):
        return r"((?!{}).)".format(pattern)
    
tmp = pd.DataFrame(['No one has a European accent either @',
                    'That the kid   reminds me of Kevin'])

tmp[0].str.replace(not_regex('(\\b[-/]\\b|[a-zA-Z0-9])'), ' ')

Then it returns a warning

Click to copy

<ipython-input-8-ef8a43f91dbd>:9: FutureWarning: The default value of regex will change from True to False in a future version.
  tmp[0].str.replace(not_regex('(\\b[-/]\\b|[a-zA-Z0-9])'), ' ')

Could you please elaborate on the reason of this warning?

905

asked Mar 12 '21 16:03

Akira

2 Answers

See Pandas 1.2.0 release notes:

The default value of regex for Series.str.replace() will change from True to False in a future release. In addition, single character regular expressions will not be treated as literal strings when regex=True is set (GH24804)

I.e., use regular expressions explicitly now:

Click to copy

dframe['colname'] = dframe['colname'].str.replace(r'\D+', regex=True)

165

answered Oct 16 '22 15:10

Ryszard Czech

I have like

Click to copy

df.Experience.head(5)

Click to copy

0    24 years experience
1    12 years experience
2     9 years experience
3    12 years experience
4    20 years experience
Name: Experience, dtype: object

I use like

Click to copy

df['Experience']=df['Experience'].str.replace(r'\D+','', regex=True).astype(int)

I get like

Click to copy

df.Experience.head(5)

Click to copy

0    24
1    12
2     9
3    12
4    20
Name: Experience, dtype: int64

answered Oct 16 '22 16:10

PlutoSenthil

Related questions
                            
                                What do ref, val and out mean on method parameters?
                            
                                Pex users: what are your Impressions of Pex and Automated Exploratory Testing in general?
                            
                                Is there a way to make Strongly Typed Resource files public (as opposed to internal)?
                            
                                Implementing a resizable textarea?
                            
                                How can I branch in SVN and have it branch my svn:external folders as well?
                            
                                When is it best to use Regular Expressions over basic string splitting / substring'ing?
                            
                                A priority queue which allows efficient priority update?
                            
                                What is the difference between spawn and exec?
                            
                                What's the difference between %TMP% and %TEMP% in Vista environment variables?
                            
                                In C#, how do you declare a subclass of EventHandler in an interface?
                            
                                File used by another process [duplicate]
                            
                                2GB limit on file size when using fwrite in C?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

FutureWarning: The default value of regex will change from True to False in a future version

Tags:

python

string

regex

python-3.x

pandas

Akira

People also ask

2 Answers

Ryszard Czech

PlutoSenthil

Recent Activity

Donate For Us