Python HTML Encoding \xc2\xa0

Tags:

I've been struggling with this one for a while. I'm trying to write strings to HTML but have issues with the format once I've cleaned them. Here's an example:

paragraphs = ['Grocery giant and household name Woolworths is battered and bruised. ', 
'But behind the problems are still the makings of a formidable company']

x = str(" ")
for item in paragraphs:
    x = x + str(item)
x

Output:

"Grocery giant and household name\xc2\xa0Woolworths is battered and\xc2\xa0bruised. 
But behind the problems are still the makings of a formidable\xc2\xa0company"

Desired output:

"Grocery giant and household name Woolworths is battered and bruised. 
But behind the problems are still the makings of a formidable company"

I'm hoping you're able to explain why this happens and how I can fix. Thanks in advance!

332

asked Sep 06 '15 02:09

Sam Perry

1 Answers

\xc2\xa0 means 0xC2 0xA0 is so-called

Non-breaking space

It is a kind of invisible control character in UTF-8 encodings. More info about it check the wikipedia: https://en.wikipedia.org/wiki/Non-breaking_space

I copied what you have pasted in the questions and got the expected output.

answered Oct 04 '22 06:10

liuyix

Related questions
                            
                                telnetlib python example
                            
                                Simple way to group items into buckets
                            
                                Parallel optimizations in SciPy
                            
                                Networkx: Differences between pagerank, pagerank_numpy, and pagerank_scipy?
                            
                                Divide one list by another list
                            
                                Format Python Decimal object to a specified precision
                            
                                For Pylint, is it possible to have a different pylintrc file for each Eclipse project?
                            
                                Swapping Axes in Pandas
                            
                                Python: How to not print comma in last element in a for loop?
                            
                                global variable inside main function python
                            
                                Python: NameError: free variable 're' referenced before assignment in enclosing scope
                            
                                Selenium / Python - Selecting via css selector
                            
                                Empty list returned from ElementTree findall
                            
                                Redirect print to string list?
                            
                                How to change my django server time
                            
                                Integration of python in C# Application
                            
                                Python built-in sum function vs. for loop performance
                            
                                PyQt5: Keyboard shortcuts w/ QAction
                            
                                How to label and change the scale of Seaborn kdeplot's axes
                            
                                speech recognition python code not working

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Python HTML Encoding \xc2\xa0

Tags:

python

html

encoding

Sam Perry

People also ask

1 Answers

liuyix

Recent Activity

Donate For Us