Questions Linux Laravel Mysql Ubuntu Git Menu

HTML CSS JAVASCRIPT SQL PYTHON PHP BOOTSTRAP JAVA JQUERY R React Kotlin

How can I use Python to replace HTML escape characters? [duplicate]

Tags:

python

Possible Duplicate:
Decode HTML entities in Python string?

I have a string full of HTML escape characters such as ", ”, and —.

Do any Python libraries offer reliable ways for me to replace all of these escape characters with their respective actual characters?

For instance, I want all "s replaced with "s.

like image

329

asked Jul 10 '12 02:07

dangerChihuahua007

People also ask

How do you escape a special character in a string python?

Escape sequences allow you to include special characters in strings. To do this, simply add a backslash ( \ ) before the character you want to escape.

1 Answers

You want to use this:

try:
    from html.parser import HTMLParser  # Python 3
except ModuleNotFoundError:
    from HTMLParser import HTMLParser  # Python 2
parser = HTMLParser()
html_decoded_string = parser.unescape(html_encoded_string)

I also am seeing a lot of love for BeautifulSoup

from BeautifulSoup import BeautifulSoup
html_decoded_string = BeautifulSoup(html_encoded_string, convertEntities=BeautifulSoup.HTML_ENTITIES)

Also Duplicate of these existing questions:

Decode HTML entities in Python string?

Decoding HTML entities with Python

Decoding HTML Entities With Python

like image

197

answered Sep 28 '22 10:09

Francis Yaconiello

Sign in to Comment

Related questions
                            
                                MySQL: Order by a function of two columns
                            
                                Matplotlib Agg Rendering Complexity Error
                            
                                using dropbox as a server for my django app
                            
                                Python regex split case insensitive in 2.6
                            
                                Unescaping escaped characters in a string using Python 3.2
                            
                                Accessing the eBay Developer's API through Python? [closed]
                            
                                Update a dict with part of another dict
                            
                                Filter an array in Python3 / Numpy and return indices
                            
                                Pythonic and efficient way of defining multiple regexes for use over many iterations
                            
                                Dividing large numbers in Python
                            
                                How do you investigate python's implementation of built-in methods?
                            
                                has no attribute '_meta' error when creating a ModelAdmin object
                            
                                floating point in python gives a wrong answer
                            
                                How could a distributed queue-like-thing be implemented on top of a RBDMS or NOSQL datastore or other messaging system (e.g., rabbitmq)?
                            
                                BeautifulSoup get_text does not strip all tags and JavaScript
                            
                                How to create windows installer for pyqt project
                            
                                Concurrent requests in Appengine Python
                            
                                Connecting an overloaded PyQT signal using new-style syntax
                            
                                python: extracting one slice of a multidimensional array given the dimension index
                            
                                Running Django with Run can not find LESS CSS

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With