Different storage position of equal strings with special characters [duplicate]

Tags:

python-2.7

I am new at python and I'm currently exploring some of its core functionalities.

Could you explain me why the following example always return false in case of a string with special characters:

>>> a="x"
>>> b="x"
>>> a is b
True
>>> a="xxx"
>>> b="xxx"
>>> a is b
True
>>> a="xü"
>>> b="xü"
>>> a is b
False
>>> a="ü"
>>> b="ü"
>>> a is b
True
>>> #strange: with one special character it works as expected

I understand that the storage positions are different for strings with special characters on each assignment, I already checked it with the id() function but for which reason python handles strings in this unconsistent way?

850

asked Aug 07 '14 12:08

Nico M

1 Answers

Python (the reference implementation at least) has a cache for small integers and strings. I guess unicode strings outside the ASCII range are bigger than the cache threshold (internally unicode is stored using 16 or 32 bit wide characters, UCS-2 or UCS-4) and so they are not cached.

[edit]

Found a more complete answer at: About the changing id of a Python immutable string

Se also: http://www.laurentluce.com/posts/python-string-objects-implementation/

166

answered Oct 07 '22 14:10

Paulo Scardine

Related questions
                            
                                request.FILES.getlist('file') is empty
                            
                                Auto import modules with emacs-jedi
                            
                                Turning binary string into an image with PIL
                            
                                Pandas rolling apply with variable window length
                            
                                Python Pandas DataFrame: unorderable types: str() > int()
                            
                                NumPy convert 8-bit to 16/32-bit image
                            
                                Get xpath() to return empty values
                            
                                io.BufferedReader peek function returning all the text in the buffer
                            
                                Save a many-to-many model in Django/REST?
                            
                                /_ah/queue/deferred strange import error
                            
                                What's a good way to handle url parameters types?
                            
                                Python - Matplotlib: normalize axis when plotting a Probability Density Function
                            
                                Organizing a package with Cython
                            
                                Django REST framework: nested relationship: non_field_errors
                            
                                PyDev: How to invoke debugging specific command from console (with breakpoints)?
                            
                                Simulate missing package for testing?
                            
                                Loading JSON file in BigQuery using Google BigQuery Client API
                            
                                Doctests fail with UnicodeDecodeError on C-extension and Python3
                            
                                Opening/Attempting to Read a file [duplicate]
                            
                                There is an example of Spyne client?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With