Can a regular expression match whitespace or the start of a string? I'm trying to replace currency the abbreviation GBP with a £ symbol. I could just match anything starting GBP, but I'd like to be a bit more conservative, and look for certain delimiters around it. <pre class="prettyprint"><code>>>> import re >>> text = u'GBP 5 Off when you spend GBP75.00' >>> re.sub(ur'GBP([\W\d])', ur'£\g<1>', text) # matches GBP with any prefix u'\xa3 5 Off when you spend \xa375.00' >>> re.sub(ur'^GBP([\W\d])', ur'£\g<1>', text) # matches at start only u'\xa3 5 Off when you spend GBP75.00' >>> re.sub(ur'(\W)GBP([\W\d])', ur'\g<1>£\g<2>', text) # matches whitespace prefix only u'GBP 5 Off when you spend \xa375.00' </code></pre> Can I do both of the latter examples at the same time?

Use the OR "<code>|</code>" operator: <pre class="prettyprint"><code>>>> re.sub(r'(^|\W)GBP([\W\d])', u'\g<1>£\g<2>', text) u'\xa3 5 Off when you spend \xa375.00' </code></pre>

Regular expression: match start or whitespace

Tags:

python

regex

Can a regular expression match whitespace or the start of a string?

I'm trying to replace currency the abbreviation GBP with a £ symbol. I could just match anything starting GBP, but I'd like to be a bit more conservative, and look for certain delimiters around it.

>>> import re >>> text = u'GBP 5 Off when you spend GBP75.00'  >>> re.sub(ur'GBP([\W\d])', ur'£\g<1>', text) # matches GBP with any prefix u'\xa3 5 Off when you spend \xa375.00'  >>> re.sub(ur'^GBP([\W\d])', ur'£\g<1>', text) # matches at start only u'\xa3 5 Off when you spend GBP75.00'  >>> re.sub(ur'(\W)GBP([\W\d])', ur'\g<1>£\g<2>', text) # matches whitespace prefix only u'GBP 5 Off when you spend \xa375.00'

Can I do both of the latter examples at the same time?

469

asked Feb 08 '09 12:02

Mat

1 Answers

Use the OR "|" operator:

>>> re.sub(r'(^|\W)GBP([\W\d])', u'\g<1>£\g<2>', text) u'\xa3 5 Off when you spend \xa375.00'

134

answered Sep 27 '22 18:09

Zach Scrivena

Related questions
                            
                                Adding calculated column(s) to a dataframe in pandas
                            
                                Concatenate rows of two dataframes in pandas
                            
                                Why is it possible to replace sometimes set() with {}?
                            
                                Python: skip comment lines marked with # in csv.DictReader
                            
                                'Can't set attribute' with new-style properties in Python
                            
                                What exactly is a "raw string regex" and how can you use it?
                            
                                Why does Python's __import__ require fromlist?
                            
                                Why are NumPy arrays so fast?
                            
                                Using Django database layer outside of Django?
                            
                                Could not find library geos_c or load any of its variants
                            
                                How to create a fix size list in python?
                            
                                WTForms: Install 'email_validator' for email validation support
                            
                                How to read datetime back from sqlite as a datetime instead of string in Python?
                            
                                Concatenate two NumPy arrays vertically
                            
                                Selenium Webdriver finding an element in a sub-element
                            
                                Python, TypeError: unhashable type: 'list'
                            
                                Pandas Plotting with Multi-Index
                            
                                What does the c underscore expression `c_` do exactly?
                            
                                How do I run tox in a project that has no setup.py?
                            
                                Slice Pandas dataframe by index values that are (not) in a list

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With