Python - UnicodeEncodeError: 'charmap' codec can't encode characters in position 85-89: character maps to <undefined>

Tags:

I am trying to see if I can transfer the output of urllib.request.urlopen() to a text file just to look at it. I tried decoding the output into a string so I can write into a file, but apparently the original output included some Korean characters that are not translating properly into the string.

So far I have:

from urllib.request import urlopen

openU = urlopen(myUrl)
pageH = openU.read()
openU.close()
stringU = pageH.decode("utf-8")

f=open("test.txt", "w+")
f.write(stringU)

I do not get any errors until the last step at which point it says:

Traceback (most recent call last):  
  File "<stdin>", line 1, in <module>  
  File "C:\Users\Chae\AppData\Local\Programs\Python\Python36\lib\encodings\cp1252.py", line 19, in encode  
  return codecs.charmap_encode(input,self.errors,encoding_table)[0] 
UnicodeEncodeError: 'charmap' codec can't encode characters in position 85-89: character maps to `<undefined>`

Is there a way to get the string to also include Korean or if not, how do I skip the characters causing problems and write the rest of the string into the file?

545

asked Apr 06 '18 01:04

Chae

1 Answers

Does it matter to you what the file encoding is? If not, then use utf-8 encoding:

f=open("test.txt", "w+", encoding="utf-8")
f.write(stringU)

If you want the file to be cp1252-encoded, which apparently is the default on your system, and to ignore unencodable values, add errors="ignore":

f=open("test.txt", "w+", errors="ignore")
f.write(stringU)

156

answered Oct 05 '22 11:10

Robᵩ

Related questions
                            
                                Easy way to distinguish between 0 and False in a dataframe with mixed values
                            
                                Why do we need str type? Why not just byte-strings?
                            
                                why is multiprocess Pool slower than a for loop?
                            
                                How to execute a local python script into a docker from another python script?
                            
                                Missing interpolate in scipy 0.17
                            
                                Which one is faster np.vstack, np.append, np.concatenate or a manual function made in cython?
                            
                                Create list by subtracting the nth+1 value from the nth values of another list
                            
                                Pass parameters between flask @app.route('/page')
                            
                                python pivot table of counts
                            
                                Create a Dataframe from a Series and a String
                            
                                NumPy ufuncs are 2x faster in one axis over the other
                            
                                Get index where value changes in pandas dataframe column
                            
                                Efficiency: 2D-list to dictionary in python
                            
                                How to upgrade tensorflow with GPU on google colaboratory
                            
                                ImportError: Unable to find zbar shared library on Flask
                            
                                Escape New line character in Spark CSV read
                            
                                How do I input values from a list into a string?
                            
                                What is the difference between pandas dtype vs dtypes
                            
                                pandas merge_asof keys must be sorted error after sorting
                            
                                Pybind Numpy access 2D / ND arrays

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Python - UnicodeEncodeError: 'charmap' codec can't encode characters in position 85-89: character maps to <undefined>

Tags:

python

python-3.x

utf-8

web-scraping

Chae

People also ask

1 Answers

Robᵩ

Recent Activity

Donate For Us