Working with UTF-8 encoding in Python source [duplicate]

Tags:

Consider:

$ cat bla.py  u = unicode('d…') s = u.encode('utf-8') print s $ python bla.py    File "bla.py", line 1 SyntaxError: Non-ASCII character '\xe2' in file bla.py on line 1, but no encoding declared; see http://www.python.org/peps/pep-0263.html for details

How can I declare UTF-8 strings in source code?

438

asked Jun 09 '11 07:06

Nullpoet

Video Answer

1 Answers

In Python 3, UTF-8 is the default source encoding (see PEP 3120), so unicode characters can be used anywhere.

In Python 2, you can declare in the source code header:

# -*- coding: utf-8 -*- ....

It is described in the PEP 0263:

Then you can use UTF-8 in strings:

# -*- coding: utf-8 -*-  u = 'idzie wąż wąską dróżką' uu = u.decode('utf8') s = uu.encode('cp1250') print(s)

In addition, it may be worth verifying that your text editor properly encodes your code in UTF-8. Otherwise, you may have invisible characters that are not interpreted as UTF-8.

answered Sep 28 '22 03:09

Michał Niklas

Related questions
                            
                                How to select all columns, except one column in pandas?
                            
                                Convert base-2 binary number string to int
                            
                                How to save a Python interactive session?
                            
                                How to extract the substring between two markers?
                            
                                Is there a way to perform "if" in python's lambda?
                            
                                Print a list in reverse order with range()?
                            
                                How do I execute a string containing Python code in Python?
                            
                                Case insensitive regular expression without re.compile?
                            
                                How to reset index in a pandas dataframe? [duplicate]
                            
                                How do I get the parent directory in Python?
                            
                                Pandas read_csv low_memory and dtype options
                            
                                Python unittest - opposite of assertRaises?
                            
                                Is it possible to use pip to install a package from a private GitHub repository?
                            
                                How to declare and add items to an array in Python?
                            
                                List attributes of an object [duplicate]
                            
                                How to properly assert that an exception gets raised in pytest?
                            
                                How to check if a column exists in Pandas
                            
                                Python's time.clock() vs. time.time() accuracy?
                            
                                Python Dictionary Comprehension
                            
                                Reloading submodules in IPython

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Working with UTF-8 encoding in Python source [duplicate]

Tags:

python

character-encoding

encoding

utf-8

Nullpoet

People also ask

Video Answer

1 Answers

Michał Niklas

Recent Activity

Donate For Us