So I have a python script that I'd prefer worked on python 3.2 and 2.7 just for convenience. Is there a way to have unicode literals that work in both? E.g. <pre class="prettyprint"><code>#coding: utf-8 whatever = 'שלום' </code></pre> The above code would require a unicode string in python 2.x (<code>u''</code>) and in python 3.x that little <code>u</code> causes a syntax error.

Edit - Since Python 3.3, the <code>u''</code> literal works again, so the <code>u()</code> function isn't needed. The best option is to make a method that creates unicode objects from string objects in Python 2, but leaves the string objects alone in Python 3 (as they are already unicode). <pre class="prettyprint"><code>import sys if sys.version < '3': import codecs def u(x): return codecs.unicode_escape_decode(x)[0] else: def u(x): return x </code></pre> You would then use it like so: <pre class="prettyprint"><code>>>> print(u('\u00dcnic\u00f6de')) Ünicöde >>> print(u('\xdcnic\N{Latin Small Letter O with diaeresis}de')) Ünicöde </code></pre>

Unicode literals that work in python 3 and 2

Tags:

python

python-3.x

unicode

python-2.x

unicode-literals

So I have a python script that I'd prefer worked on python 3.2 and 2.7 just for convenience.

Is there a way to have unicode literals that work in both? E.g.

#coding: utf-8
whatever = 'שלום'

The above code would require a unicode string in python 2.x (u'') and in python 3.x that little u causes a syntax error.

430

asked Jul 08 '11 14:07

ubershmekel

1 Answers

Edit - Since Python 3.3, the u'' literal works again, so the u() function isn't needed.

The best option is to make a method that creates unicode objects from string objects in Python 2, but leaves the string objects alone in Python 3 (as they are already unicode).

import sys
if sys.version < '3':
    import codecs
    def u(x):
        return codecs.unicode_escape_decode(x)[0]
else:
    def u(x):
        return x

You would then use it like so:

>>> print(u('\u00dcnic\u00f6de'))
Ünicöde
>>> print(u('\xdcnic\N{Latin Small Letter O with diaeresis}de'))
Ünicöde

136

answered Oct 09 '22 15:10

Lennart Regebro

Related questions
                            
                                Installing pyodbc fails on OSX 10.9 (Mavericks)
                            
                                How do you create an incremental ID in a Python Class
                            
                                How to delete the last column of data of a pandas dataframe
                            
                                django Datefield to Unix timestamp
                            
                                Mapping a class against multiple tables in SQLAlchemy
                            
                                Optimizing subgraph of large graph - slower than optimizing subgraph by itself
                            
                                How can I remove distortion introduced by librosa griffin lim?
                            
                                Twisted server crashes unexpectedly while running django
                            
                                Calling condition.wait() inside thread causes retrieval of any future to block on main thread
                            
                                Fitting a scikits.learn.hmm.GaussianHMM to variable length training sequences
                            
                                How to override the django admin translation?
                            
                                Python alternative to R Markdown [closed]
                            
                                Python fastest way to read a large text file (several GB) [duplicate]
                            
                                How do you add breakpoints to a Python program in IDLE?
                            
                                Algorithm to group sets of points together that follow a direction
                            
                                Django and VirtualEnv Development/Deployment Best Practices
                            
                                Reset ipython kernel
                            
                                Generating all 5 card poker hands
                            
                                Django - Understanding X-Sendfile
                            
                                Can the Django ORM store an unsigned 64-bit integer (aka ulong64 or uint64) in a reliably backend-agnostic manner?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With