I would like to print a unicode's character code, and not the actual glyph it represents in Python. For example, if <code>u</code> is a list of unicode characters: <pre class="prettyprint"><code>>>> u[0] u'\u0103' >>> print u[0] ă </code></pre> I would like to output the character code as a raw string: <code>u'\u0103'</code>. I have tried to just print it to a file, but this doesn't work without encoding it in <code>UTF-8</code>. <pre class="prettyprint"><code>>>> w = open('~/foo.txt', 'w') >>> print>>w, u[0].decode('utf-8') Traceback (most recent call last): File "<pyshell#33>", line 1, in <module> print>>w, u[0].decode('utf-8') File "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/encodings/utf_8.py", line 16, in decode return codecs.utf_8_decode(input, errors, True) UnicodeEncodeError: 'ascii' codec can't encode character u'\u0103' in position 0: ordinal not in range(128) >>> print>>w, u[0].encode('utf-8') >>> w.close() </code></pre> Encoding it results in the glyph <code>ă</code> being written to the file. How can I write the character code?

For printing raw unicode data one only need specify the correct encoding: <pre class="prettyprint"><code>>>> s = u'\u0103' >>> print s.encode('raw_unicode_escape') \u0103 </code></pre>

How does one print a Unicode character code in Python?

Tags:

python

unicode

I would like to print a unicode's character code, and not the actual glyph it represents in Python.

For example, if u is a list of unicode characters:

>>> u[0]
u'\u0103'
>>> print u[0]
ă

I would like to output the character code as a raw string: u'\u0103'.

I have tried to just print it to a file, but this doesn't work without encoding it in UTF-8.

>>> w = open('~/foo.txt', 'w')
>>> print>>w, u[0].decode('utf-8')

Traceback (most recent call last):
  File "<pyshell#33>", line 1, in <module>
    print>>w, u[0].decode('utf-8')
  File "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/encodings/utf_8.py", line 16, in decode
    return codecs.utf_8_decode(input, errors, True)
UnicodeEncodeError: 'ascii' codec can't encode character u'\u0103' in position 0: ordinal not in range(128)
>>> print>>w, u[0].encode('utf-8')
>>> w.close()

Encoding it results in the glyph ă being written to the file.

How can I write the character code?

411

asked Dec 12 '14 02:12

user3898238

1 Answers

For printing raw unicode data one only need specify the correct encoding:

>>> s = u'\u0103'
>>> print s.encode('raw_unicode_escape')
\u0103

155

answered Sep 20 '22 06:09

Jacob Bridges

Related questions
                            
                                Parse all the xml files in a directory one by one using ElementTree
                            
                                Tutorial for using requests_oauth2
                            
                                Setting Cell Formats with xlwt format strings
                            
                                Performance of row vs column operations in NumPy
                            
                                2.7 CSV module wants unicode, but doesn't want unicode
                            
                                Auto-creating related objects on model creation in Django
                            
                                How to use unicode characters with PIL?
                            
                                Kivy to Apk in Windows
                            
                                How do I concatenate many objects into one object using inheritance in python? (during runtime)
                            
                                How to disable Flask-Cache caching
                            
                                Python implementation of the laplacian of gaussian edge detection
                            
                                Python multiprocessing - watch a process and restart it when fails
                            
                                Choose at random from combinations
                            
                                Python Non negative Matrix Factorization that handles both zeros and missing data?
                            
                                What does PuLP LpStatus=Undefined actually mean?
                            
                                Using custom methods in filter with django-rest-framework
                            
                                Generating low discrepancy quasi-random sequences in python/numpy/scipy?
                            
                                How to test coverage properly with Django + Nose
                            
                                Python: strftime() UTC Offset Not working as Expected in Windows
                            
                                Installing Pylab/Matplotlib

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With