Does python's hash function remain identical across different versions?

Tags:

I'm currently using hash on tuples of integers and strings (and nested tuples of integers and strings etc.) in order to compute the uniqueness of some objects. Barring that there might be a hash collisions, I wonder - is the hash function on those data types guaranteed to return the same result for different versions of Python?

459

asked May 09 '13 00:05

Claudiu

Video Answer

2 Answers

No. Apart from long-standing differences between 32- and 64-bit versions of Python, the hashing algorithm was changed in Python 3.3 to resolve a security issue:

By default, the hash() values of str, bytes and datetime objects are “salted” with an unpredictable random value. Although they remain constant within an individual Python process, they are not predictable between repeated invocations of Python.

This is intended to provide protection against a denial-of-service caused by carefully-chosen inputs that exploit the worst case performance of a dict insertion, O(n^2) complexity. See http://www.ocert.org/advisories/ocert-2011-003.html for details.

Changing hash values affects the iteration order of dicts, sets and other mappings. Python has never made guarantees about this ordering (and it typically varies between 32-bit and 64-bit builds).

As a result, from 3.3 onwards hash() is not even guaranteed to return the same result across different invocations of the same Python version.

answered Oct 18 '22 15:10

Zero Piraeus

I'm not sure what you are looking for, but you can always use hashlib if you're looking for consistent hashing.

>>> import hashlib
>>> t = ("values", "other")
>>> hashlib.sha256(str(t)).hexdigest()
'bc3ed71325acf1386b40aa762b661bb63bb72e6df9457b838a2ea93c95cc8f0c'

OR:

>>> h = hashlib.sha256()
>>> for item in t:
...     h.update(item)
...
>>> h.hexdigest()
'5e98df135627bc8d98250ca7e638aeb2ccf7981ce50ee16ce00d4f23efada068'

answered Oct 18 '22 14:10

monkut

Related questions
                            
                                Calculating EuropeanOptionImpliedVolatility in quantlib-python
                            
                                Broadcasting a python function on to numpy arrays
                            
                                What does Python 3.2 "with/as" do
                            
                                how to return a dictionary in python django and view it in javascript?
                            
                                Python: elif or new if?
                            
                                Replacing a weird single-quote (’) with blank string in Python
                            
                                Check if a line exists in a file
                            
                                Select random item with weight
                            
                                How to use Content-Encoding: gzip with Python SimpleHTTPServer
                            
                                Pickleable Image Object
                            
                                Replacing characters in a file
                            
                                Numpy: How to check if array contains certain numbers?
                            
                                Sending a binary file in Tornado
                            
                                How can I interleave or create unique permutations of two strings (without recursion)
                            
                                Filtering out ANSI escape sequences [duplicate]
                            
                                Turn off PyQt Event Loop While Editing Table
                            
                                Celery: Chaining tasks with multiple arguments
                            
                                reading pyqt stylesheet from external qss file
                            
                                Finding the minimum value in a numpy array and the corresponding values for the rest of that array's row
                            
                                cc1: error: unrecognized command line option "-Wno-null-conversion" within installing python-mysql on mac 10.7.5

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Does python's hash function remain identical across different versions?

Tags:

python

hash

compatibility

backwards-compatibility