python 3.0, how to make print() output unicode?

Tags:

I'm working in WinXP 5.1.2600, writing a Python application involving Chinese pinyin, which has involved me in endless Unicode problems. Switching to Python 3.0 has solved many of them. But the print() function for console output is not Unicode-aware for some odd reason. Here's a teeny program.

print('sys.stdout encoding is "' + sys.stdout.encoding + '"')
str1 = 'lüelā'
print(str1)

Output is (changing angle brackets to square brackets for readability):

    sys.stdout encoding is "cp1252"
    Traceback (most recent call last):
      File "TestPrintEncoding.py", line 22, in [module]
        print(str1)
      File "C:\Python30\lib\io.py", line 1491, in write
        b = encoder.encode(s)
      File "C:\Python30\lib\encodings\cp1252.py", line 19, in encode
        return codecs.charmap_encode(input,self.errors,encoding_table)[0]
    UnicodeEncodeError: 'charmap' codec can't encode character '\u0101' 
    in position 4: character maps to [undefined]

Note that ü = \xfc = 252 gives no problem since it's upper ASCII. But ā = \u0101 is beyond 8-bits.

Anyone have an idea how to change the encoding of sys.stdout to 'utf-8'? Bear in mind that Python 3.0 no longer uses the codecs module, if I understand the documentation right.

Apologies, I gave you the program without the preamble. Before the 3 lines given, it starts like this:

#!/usr/bin/env python
# -*- coding: utf-8 -*-

import sys

Unfortunately, the coding specified by the "coding:" line is the coding of the source code, not of the console output. But thank you for your thoughts!

730

asked Feb 03 '09 13:02

bigturtle

1 Answers

The Windows command prompt (cmd.exe) cannot display the Unicode characters you are using, even though Python is handling it in a correct manner internally. You need to use IDLE, Cygwin, or another program that can display Unicode correctly.

See this thread for a full explanation: http://www.nabble.com/unable-to-print-Unicode-characters-in-Python-3-td21670662.html

183

answered Oct 20 '22 14:10

Brandon

Related questions
                            
                                Javascript unicode string, chinese character but no punctuation
                            
                                How can I detect certain Unicode characters in a string in Ruby?
                            
                                Why Unicode character for "Hearts" symbol fails with HTML
                            
                                Why use Unicode if your program is English only?
                            
                                Where are the fields documented for the unicode.org file "UnicodeData.txt"? [closed]
                            
                                MySQL unicode literals
                            
                                Unicode normalization in Postgres
                            
                                Display emoji/emotion icon in Android TextView
                            
                                Unicode output on Windows command line?
                            
                                Drawing multilingual text using PIL
                            
                                Print string literal unicode as the actual character
                            
                                Perl Encode.pm cannot decode string with wide character
                            
                                Handle wrongly encoded character in Python unicode string
                            
                                What Unicode symbols are accepted in Python 3 variable names?
                            
                                Confused about C++'s std::wstring, UTF-16, UTF-8 and displaying strings in a windows GUI
                            
                                How to uppercase/lowercase UTF-8 characters in C++?
                            
                                Any hints for those that want to upgrade from Delphi 7 (and down) to Delphi 2010?
                            
                                Java: How to get Unicode name of a character (or its type category)?
                            
                                Java Char to its unicode hexadecimal string representation and vice-versa
                            
                                Can .NET convert Unicode to ASCII to remove "smart quotes", etc?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

python 3.0, how to make print() output unicode?

Tags:

python-3.x

printing

stdout

unicode

console

bigturtle

People also ask

1 Answers

Brandon

Recent Activity

Donate For Us