Python, len and slices on unicode strings

Tags:

I am handling a situation where I need to make a string fit in the allocated gap in the screen, as I'm using unicode len() and slices[] work apparently on bytes and I end up cutting unicode strings too short, because € only occupies one space in the screen but 2 for len() or slices[].

I have the encoding headers properly setup, and I'm willing to use other things than slices or len() to deal with this, but I really need to know how many spaces will the string take and how to cut it to the available.

$cat test.py
# -*- coding: utf-8 -*-
a = "2 €uros"
b = "2 Euros"
print len(b)
print len(a)
print a[3:]
print b[3:]

$python test.py
7
9
��uros
uros

919

asked Apr 17 '11 19:04

Arkaitz Jimenez

1 Answers

You're not creating Unicode strings there; you're creating byte strings with UTF-8 encoding (which is variable-length, as you're seeing). You need to use constants of the form u"..." (or u'...'). If you do that, you get the expected result:

% cat test.py
# -*- coding: utf-8 -*-
a = u"2 €uros"
b = u"2 Euros"
print len(b)
print len(a)
print a[3:]
print b[3:]
% python test.py 
7
7
uros
uros

answered Sep 22 '22 14:09

Nicholas Riley

Related questions
                            
                                python string substitution
                            
                                Convert HTTP Proxy to HTTPS Proxy in Twisted
                            
                                How to wrap a python dict?
                            
                                How do I split a string and rejoin it without creating an intermediate list in Python?
                            
                                python win32 filename length workaround
                            
                                Is it better to use exceptions in a "validation" class or return status codes?
                            
                                TypeError: 'int' object is unsubscriptable
                            
                                RESTful APIs for Django projects/apps
                            
                                Django template tag to display Django version
                            
                                Where is the help.py for Android's monkeyrunner
                            
                                Floating Point in Python
                            
                                Django custom template tag which accepts a boolean parameter
                            
                                Is there someway I can get specific details about an AttributeError exception in Python?
                            
                                Django doesn't create translation .po files
                            
                                Python: Convert JSON (returned by URL) into List
                            
                                The @login_required decorator of Django redirects people to /accounts/login when they aren't registered. How to change this URL?
                            
                                To find the number of syllables in a word
                            
                                Browser and wget load JPEG differently?
                            
                                How to optimize this Python code (from ThinkPython, Exercise 10.10)
                            
                                Python urllib2 automatic form filling and retrieval of results

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Python, len and slices on unicode strings

Tags:

python

string

unicode

Arkaitz Jimenez

People also ask

1 Answers

Nicholas Riley

Recent Activity

Donate For Us