I have html file called <code>test.html</code> it has one word <code>בדיקה</code>. I open the test.html and print it's content using this block of code: <pre class="prettyprint"><code>file = open("test.html", "r") print file.read() </code></pre> but it prints <code>??????</code>, why this happened and how could I fix it? BTW. when I open text file it works good. Edit: I'd tried this: <pre class="prettyprint"><code>>>> import codecs >>> f = codecs.open("test.html",'r') >>> print f.read() ????? </code></pre>

<pre class="prettyprint"><code>import codecs f=codecs.open("test.html", 'r') print f.read() </code></pre> Try something like this.

How to open html file?

Tags:

python

character-encoding

python-2.7

I have html file called test.html it has one word בדיקה.

I open the test.html and print it's content using this block of code:

file = open("test.html", "r") print file.read()

but it prints ??????, why this happened and how could I fix it?

BTW. when I open text file it works good.

Edit: I'd tried this:

>>> import codecs >>> f = codecs.open("test.html",'r') >>> print f.read() ?????

954

asked Dec 02 '14 06:12

david

2 Answers

import codecs f=codecs.open("test.html", 'r') print f.read()

Try something like this.

answered Oct 05 '22 13:10

vks

I encountered this problem today as well. I am using Windows and the system language by default is Chinese. Hence, someone may encounter this Unicode error similarly. Simply add encoding = 'utf-8':

with open("test.html", "r", encoding='utf-8') as f:     text= f.read()

answered Oct 05 '22 14:10

Chen Mier

Related questions
                            
                                What is the order of evaluation in python when using pop(), list[-1] and +=?
                            
                                How to convert a boto3 Dynamo DB item to a regular dictionary in Python?
                            
                                running code if try statements were successful in python
                            
                                assign operator to variable in python?
                            
                                changing the values of the diagonal of a matrix in numpy
                            
                                how to post multiple value with same key in python requests?
                            
                                How do I reverse a part (slice) of a list in Python?
                            
                                Flask-WTF - validate_on_submit() is never executed
                            
                                How to keep multiple independent celery queues?
                            
                                Un-persisting all dataframes in (py)spark
                            
                                How can I replicate rows in Pandas?
                            
                                Unexplainable Flask 404 errors
                            
                                Django and Middleware which uses request.user is always Anonymous
                            
                                Dynamically exclude or include a field in Django REST framework serializer
                            
                                In numpy, what does selection by [:,None] do?
                            
                                Python - writing and reading from a temporary file
                            
                                ImportError: cannot import name np_utils
                            
                                Filtering against query param
                            
                                Python: Getting a traceback from a multiprocessing.Process
                            
                                Python and default dict, how to pprint

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With