This works <pre class="prettyprint"><code>s = 'jiā' s.find(u'\u0101') </code></pre> How do I do something like this: <pre class="prettyprint"><code>s = 'jiā' zzz = '\u0101' s.find(zzz) </code></pre> Since I'm using a variable now, how do I indicate the string represented by the variable is Unicode?

<blockquote> Since I'm using a variable now, how do I indicate the string represented by the variable is Unicode? </blockquote> By defining it as a Unicode string in the first place. <pre class="prettyprint"><code>zzz = u"foo" </code></pre> Or, if you already have a string in some other encoding, by converting it to Unicode (the original encoding must be specified if the string is non-ASCII). <pre class="prettyprint"><code>zzz = unicode(zzz, encoding="latin1") </code></pre> Or by using Python 3 where all strings are Unicode.

<code>zzz</code> as defined in your post is a plain <code>str</code> object, not a <code>unicode</code> object, so there is no way to indicate that it is something it actually isn't. You can convert the <code>str</code> object to a <code>unicode</code> object, though, by specifying an encoding: <pre class="prettyprint"><code>s.find(zzz.decode("utf-8")) </code></pre> Substitue <code>utf-8</code> by whatever encoding the string is encoded in. Note that in your example <pre class="prettyprint"><code>zzz = '\u0101' </code></pre> <code>zzz</code> is a plain string of length 6. There is no easy way to fix this wrong string literal afterwards, except for hacks along the lines of <pre class="prettyprint"><code>ast.literal_eval("u'" + zzz + "'") </code></pre>

Python - How can I do a string find on a Unicode character that is a variable?

Tags:

python

unicode

This works

s = 'jiā'
s.find(u'\u0101')

How do I do something like this:

s = 'jiā'
zzz = '\u0101'
s.find(zzz)

Since I'm using a variable now, how do I indicate the string represented by the variable is Unicode?

255

asked Nov 11 '11 16:11

Steve

2 Answers

Since I'm using a variable now, how do I indicate the string represented by the variable is Unicode?

By defining it as a Unicode string in the first place.

zzz = u"foo"

Or, if you already have a string in some other encoding, by converting it to Unicode (the original encoding must be specified if the string is non-ASCII).

zzz = unicode(zzz, encoding="latin1")

Or by using Python 3 where all strings are Unicode.

166

answered Oct 24 '22 17:10

kindall

zzz as defined in your post is a plain str object, not a unicode object, so there is no way to indicate that it is something it actually isn't. You can convert the str object to a unicode object, though, by specifying an encoding:

s.find(zzz.decode("utf-8"))

Substitue utf-8 by whatever encoding the string is encoded in.

Note that in your example

zzz = '\u0101'

zzz is a plain string of length 6. There is no easy way to fix this wrong string literal afterwards, except for hacks along the lines of

ast.literal_eval("u'" + zzz + "'")

answered Oct 24 '22 15:10

Sven Marnach

Related questions
                            
                                What's the difference between /usr/local/lib/python2.6 and /usr/lib/python2.6?
                            
                                How to get generated captcha image using mechanize
                            
                                Django is sooo slow? errno 32 broken pipe? dcramer-django-sentry-? static folder?
                            
                                When processing a file, how do I obtain the current line number?
                            
                                python - open() not working with path names
                            
                                Find names of positional arguments through introspection
                            
                                Why won't LD_PRELOAD work with Python?
                            
                                Efficient Shift Scheduling in Python
                            
                                Python TypeError: Required argument 'source' (pos 1) not found
                            
                                Polar contour plot in Matplotlib
                            
                                Explanation needed regarding explanation of hashable objects
                            
                                Checkbox input using python mechanize
                            
                                iterate through unicode strings and compare with unicode in python dictionary
                            
                                Python: overloading tuple multi-assignment capabilities?
                            
                                speeding up numpy kronecker products
                            
                                Easy to use Python encryption library/wrapper?
                            
                                WebDriverException:can't load profile error in selenium python script
                            
                                python "string" module?
                            
                                Python decorators: how to use parent class decorators in a child class [duplicate]
                            
                                What's the cost of releasing the GIL?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With