Regex

Question

I am trying to replace all \n and from a string by   but I dont want the \\n to be replaced.

Im trying with the following expression using negative lookbehind but it does not return the correct result because all the "n" and "\" are replaced:

import re
string = "this is a sample\nstring\\ncontaining
all cases
\n
!"
re.sub(r"(?<!\)([\n
]+)", r"<br>", string)
>>'this is a sample<br>stri<br>g<br>co<br>tai<br>i<br>g<br>all cases<br>!'

expected output

"this is a sample<br>string\\ncontaining<br>all cases<br><br><br>!"

dercolamann · Accepted Answer

This will do the magic:

re.sub(r"(?<!\)\n|
", "<br>", string)

Note: it will replace line breaks (" ") and escaped line breaks (" " or r" "). It does not escape "\n" (or r" "). "\ " (backslash + new line) becomes "\ ".

Maybe, what you really want is:

re.sub(r"(?<!\)(\\)*\n|
", "\1<br>", string)

This replaces all new lines and all escaped n (r" "). r"\n" is not replaced. r"\ " is again replaced (escaped backslash + escaped n).

ctwheels · Answer

Your regex has the character class [\n ] which matches \, n, or . Your lookbehind logic is correct, you just need to change your character class to a different subpattern: \{1,2}n.

See regex in use here

(?<!\)\{1,2}n

(?<!\) Negative lookbehind ensuring what precedes is not \
\{1,2} Match \ once or twice
n Match this literally

Replacement:  

Alternative: (?<!\)\\?n as provided by @revo in the comments below the question

Usage in code

See in use here

import re

r = re.compile(r"(?<!\)\{1,2}n")
s = r"this is a sample\nstring\\ncontaining
all cases
\n
!"
print(r.sub("<br>", s, 0))

Result: this is a sample string\\ncontaining all cases !

Regex - Replace \\n and \n in string by <br> but not \\\\n

Tags:

python

Below the Radar

2 Answers

dercolamann

Usage in code

ctwheels

Recent Activity

Donate For Us

Regex - Replace \\n and \n in string by <br> but not \\\\n

Tags:

python

regex

Below the Radar

2 Answers

dercolamann

Usage in code

ctwheels

Related questions

Recent Activity

Donate For Us