I have a string. In that string are double backslashes. I want to replace the double backslashes with single backslashes, so that unicode char codes can be parsed correctly. <pre class="prettyprint"><code>(Pdb) p fetched_page 'Chapter 0<\\/span><\\/strong><\\/p>\nChapter 0 in \\u201cDreaming in Code\\u201d give a brief description of programming in its early years and how and why programmers are still struggling today...' </code></pre> Inside of this string, you can see escaped unicode character codes, such as: <pre class="prettyprint"><code>\\u201c </code></pre> I want to turn this into: <pre class="prettyprint"><code>\u201c </code></pre> Attempt 1: <pre class="prettyprint"><code>fetched_page.replace('\\\\', '\\') </code></pre> but this doesn't work -- it searches for quadruple backslashes. Attempt 2: <pre class="prettyprint"><code>fetched_page.replace('\\', '\') </code></pre> But this results in an end of line error. Attempt 3: <pre class="prettyprint"><code>fetched_page.decode('string_escape') </code></pre> But this had no effect on the text. All the double backslashes remained as double backslashes.

You can try <code>codecs.escape_decode</code>, this should decode the escape sequences.

How to replace a double backslash with a single backslash in python?

Tags:

python

escaping

backslash

I have a string. In that string are double backslashes. I want to replace the double backslashes with single backslashes, so that unicode char codes can be parsed correctly.

(Pdb) p fetched_page '<p style="text-align:center;" align="center"><strong><span style="font-family:\'Times New Roman\', serif;font-size:115%;">Chapter 0<\\/span><\\/strong><\\/p>\n<p><span style="font-family:\'Times New Roman\', serif;font-size:115%;">Chapter 0 in \\u201cDreaming in Code\\u201d give a brief description of programming in its early years and how and why programmers are still struggling today...'

Inside of this string, you can see escaped unicode character codes, such as:

\\u201c

I want to turn this into:

\u201c

Attempt 1:

fetched_page.replace('\\\\', '\\')

but this doesn't work -- it searches for quadruple backslashes.

Attempt 2:

fetched_page.replace('\\', '\')

But this results in an end of line error.

Attempt 3:

fetched_page.decode('string_escape')

But this had no effect on the text. All the double backslashes remained as double backslashes.

638

asked Jul 19 '11 18:07

zzz

1 Answers

You can try codecs.escape_decode, this should decode the escape sequences.

166

answered Sep 20 '22 20:09

schlamar

Related questions
                            
                                How do I change the range of the x-axis with datetimes in matplotlib?
                            
                                ValueError: max() arg is an empty sequence
                            
                                Viewing the content of a Spark Dataframe Column
                            
                                gaierror: [Errno 8] nodename nor servname provided, or not known (with macOS Sierra)
                            
                                Shared variable in python's multiprocessing
                            
                                How to get tkinter canvas to dynamically resize to window width?
                            
                                How do I do a bitwise Not operation in Python?
                            
                                Pandas groupby with bin counts
                            
                                scikit-learn: how to scale back the 'y' predicted result
                            
                                Python Reverse Find in String
                            
                                Is there an official or common knowledge standard minimal interface for a "list-like" object?
                            
                                FSharp runs my algorithm slower than Python
                            
                                Firebase cloud functions using Python?
                            
                                How to apply __str__ function when printing a list of objects in Python
                            
                                function is not defined error in Python
                            
                                How can I classify data with the nearest-neighbor algorithm using Python?
                            
                                Python Class Based Decorator with parameters that can decorate a method or a function
                            
                                How can I get a list of the symbols in a sympy expression?
                            
                                Multiple configuration files with Python ConfigParser
                            
                                Timer for Python game

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With