Can anyone tell me what does "\1" mean in the following regular expression in Python? <pre class="prettyprint"><code>re.sub(r'(\b[a-z]+) \1', r'\1', 'cat in the the hat') </code></pre>

<code>\1</code> is equivalent to <code>re.search(...).group(1)</code>, the first parentheses-delimited expression inside of the regex. It's also, fun fact, part of the reason that regular expressions are significantly slower in Python and other programming languages than required to be by CS theory.

python regular expression "\1"

Tags:

python

regex

Can anyone tell me what does "\1" mean in the following regular expression in Python?

re.sub(r'(\b[a-z]+) \1', r'\1', 'cat in the the hat')

324

asked Dec 27 '13 14:12

Mengwen

2 Answers

\1 is equivalent to re.search(...).group(1), the first parentheses-delimited expression inside of the regex.

It's also, fun fact, part of the reason that regular expressions are significantly slower in Python and other programming languages than required to be by CS theory.

answered Sep 19 '22 03:09

Patrick Collins

The first \1 means the first group - i.e. the first bracketed expression (\b[a-z]+)

From the docs \number

"Matches the contents of the group of the same number. Groups are numbered starting from 1. For example, (.+) \1 matches 'the the' or '55 55', but not 'thethe' (note the space after the group)"

In your case it is looking for a repeated "word" (well, block of lower case letters).

The second \1 is the replacement to use in case of a match, so a repeated word will be replaced by a single word.

answered Sep 23 '22 03:09

doctorlove

Related questions
                            
                                Python: min(None, x)
                            
                                Generate a sequence of numbers in Python
                            
                                How to make a local variable (inside a function) global [duplicate]
                            
                                how to add border around an image in opencv python
                            
                                Pandas - combine column values into a list in a new column
                            
                                ImportError: No module named 'xlrd'
                            
                                What python libraries can tell me approximate location and time zone given an IP address?
                            
                                Objective-C (cocoa) equivalent to python's endswith/beginswith
                            
                                running a command line containing Pipes and displaying result to STDOUT
                            
                                Python: significance of -u option?
                            
                                return default if pandas dataframe.loc location doesn't exist
                            
                                Get all keys from GroupBy object in Pandas
                            
                                list.extend and list comprehension
                            
                                Does pip handle extras_requires from setuptools/distribute based sources?
                            
                                Django: Check if settings variable is set
                            
                                Why does my python not add current working directory to the path?
                            
                                Sound generation / synthesis with python?
                            
                                How to get the index with the key in a dictionary?
                            
                                Skip multiple iterations in loop
                            
                                Python method for reading keypress?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With