I'm noticing some odd behavior in Python's Regex library, and I'm not sure if I'm doing something wrong. If I run a regex on it using <code>re.sub()</code>, with <code>re.MULTILINE</code>. It seems to only replace the first few occurrences. It replaces all occurrences if I turn off <code>re.MULTILINE</code>, use <code>re.subn(..., count = 0, flags = re.MULTILINE)</code>, or compile the regex using <code>re.compile(..., re.MULTILINE)</code>. I am running Python 2.7 on Ubuntu 12.04. I've posted a random example on: <ul> <li> Pastebin.com - Output from terminal</li> <li> codepad - Script, confirming behavior (except for re.subn(), which is different on 2.5)</li> </ul> Can someone confirm / deny this behavior on their machine? EDIT: Realized I should go ahead and post this on the Python bug tracker. EDIT 2: Issue reported: http://bugs.python.org/msg168909

Use <pre class="prettyprint"><code>re.sub(pattern, replace, text, flags=re.MULTILINE) </code></pre> instead of <pre class="prettyprint"><code>re.sub(pattern, replace, text, re.MULTILINE) </code></pre> which is equivalent to <pre class="prettyprint"><code>re.sub(pattern, replace, text, count=re.MULTILINE) </code></pre> which is a bug in your code. See re.sub()

Bug in Python Regex? (re.sub with re.MULTILINE)

Tags:

python

regex

I'm noticing some odd behavior in Python's Regex library, and I'm not sure if I'm doing something wrong.

If I run a regex on it using re.sub(), with re.MULTILINE. It seems to only replace the first few occurrences. It replaces all occurrences if I turn off re.MULTILINE, use re.subn(..., count = 0, flags = re.MULTILINE), or compile the regex using re.compile(..., re.MULTILINE).

I am running Python 2.7 on Ubuntu 12.04.

I've posted a random example on:

Pastebin.com - Output from terminal
codepad - Script, confirming behavior (except for re.subn(), which is different on 2.5)

Can someone confirm / deny this behavior on their machine?

EDIT: Realized I should go ahead and post this on the Python bug tracker. EDIT 2: Issue reported: http://bugs.python.org/msg168909

410

asked Aug 22 '12 23:08

eacousineau

1 Answers

Use

re.sub(pattern, replace, text, flags=re.MULTILINE)

instead of

re.sub(pattern, replace, text, re.MULTILINE)

which is equivalent to

re.sub(pattern, replace, text, count=re.MULTILINE)

which is a bug in your code.

See re.sub()

118

answered Nov 19 '22 19:11

jfs

Related questions
                            
                                Python 3: Flattening nested dictionaries and lists within dictionaries
                            
                                In Python, if I type a=1 b=2 c=a c=b, what is the value of c? What does c point to?
                            
                                Python 3 How to format to yyyy-mm-ddThh:mm:ssZ
                            
                                ```AttributeError: 'module' object has no attribute 'set_random_seed'``` when i run ```python2 ./train.py``` from the terminal
                            
                                Can Regex be used for this particular string manipulation?
                            
                                Python Reflection and Type Conversion
                            
                                Catch only some runtime errors in Python
                            
                                Updated (current) recommendation on Rails versus Django? [closed]
                            
                                How do I parse timezones with UTC offsets in Python?
                            
                                How to reverse engineer a program which has no documentation [closed]
                            
                                Efficient way to find the largest key in a dictionary with non-zero value
                            
                                Python code to use a regular expression to make sure a string is alphanumeric plus . - _
                            
                                Automating HP Quality Center with Python or Java
                            
                                Stop generator from within block in Python
                            
                                Remove all elements from the dictionary whose key is an element of a list
                            
                                Python distutils not using correct version of gcc
                            
                                python identity dictionary [duplicate]
                            
                                Equivalent of __func__ (from C) in Python
                            
                                Scrapy start_urls
                            
                                How to fix forward slash issue in path on windows in python?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With