I believed that <code>hash()</code> function works the same in all python interpreters. But it differs when I run it on my mobile using python for android. I get same hash value for hashing strings and numbers but when I hash built-in data types the hash value differs. PC Python Interpreter (Python 2.7.3) <pre class="prettyprint"><code>>>> hash(int) 31585118 >>> hash("hello sl4a") 1532079858 >>> hash(101) 101 </code></pre> Mobile Python Interpreter (Python 2.6.2) <pre class="prettyprint"><code>>>> hash(int) -2146549248 >>> hash("hello sl4a") 1532079858 >>> hash(101) 101 </code></pre> Can any one tell me is it a bug or I misunderstood something.

<code>hash()</code> is randomised by default each time you start a new instance of recent versions (Python3.3+) to prevent dictionary insertion DOS attacks Prior to that, <code>hash()</code> was different for 32bit and 64bit builds anyway. If you want something that does hash to the same thing every time, use one of the hashes in hashlib <pre class="prettyprint"><code>>>> import hashlib >>> hashlib.algorithms ('md5', 'sha1', 'sha224', 'sha256', 'sha384', 'sha512') </code></pre>

Why doesn't Python hash function give the same values when run on Android implementation?

Tags:

python

hash

sl4a

I believed that hash() function works the same in all python interpreters. But it differs when I run it on my mobile using python for android. I get same hash value for hashing strings and numbers but when I hash built-in data types the hash value differs.

PC Python Interpreter (Python 2.7.3)

>>> hash(int) 31585118 >>> hash("hello sl4a") 1532079858 >>> hash(101) 101

Mobile Python Interpreter (Python 2.6.2)

>>> hash(int) -2146549248 >>> hash("hello sl4a") 1532079858 >>> hash(101) 101

Can any one tell me is it a bug or I misunderstood something.

303

asked Jun 19 '13 13:06

Balakrishnan

2 Answers

hash() is randomised by default each time you start a new instance of recent versions (Python3.3+) to prevent dictionary insertion DOS attacks

Prior to that, hash() was different for 32bit and 64bit builds anyway.

If you want something that does hash to the same thing every time, use one of the hashes in hashlib

>>> import hashlib >>> hashlib.algorithms ('md5', 'sha1', 'sha224', 'sha256', 'sha384', 'sha512')

115

answered Sep 24 '22 01:09

John La Rooy

for old python (at least, my Python 2.7), it seems that

hash(<some type>) = id(<type>) / 16

and for CPython id() is the address in memory - http://docs.python.org/2/library/functions.html#id

>>> id(int) / hash(int)                                                      16                                                                               >>> id(int) % hash(int)                                                  0

so my guess is that the Android port has some strange convention for memory addresses?

anyway, given the above, hashes for types (and other built-ins i guess) will differ across installs because functions are at different addresses.

in contrast, hashes for values (what i think you mean by "non-internal objects") (before the random stuff was added) are calculated from their values and so likely repeatable.

PS but there's at least one more CPython wrinkle:

>>> for i in range(-1000,1000): ...     if hash(i) != i: print(i) ... -1

there's an answer here somewhere explaining that one...

answered Sep 22 '22 01:09

andrew cooke

Related questions
                            
                                Resize image in Python without losing EXIF data
                            
                                Are there benefits to running X86-64 Python on a 64-bit CPU in a 64-bit OS?
                            
                                How can I get hours from a Python datetime?
                            
                                writing data from a python list to csv row-wise
                            
                                python json dumps
                            
                                matplotlib: Aligning y-axis labels in stacked scatter plots
                            
                                Factor Loadings using sklearn
                            
                                Difference between using commas, concatenation, and string formatters in Python
                            
                                'Finally' equivalent for If/Elif statements in Python
                            
                                SQLAlchemy Automap does not create class for tables without primary key
                            
                                How to define global function in Python?
                            
                                Filling missing values using forward and backward fill in pandas dataframe (ffill and bfill)
                            
                                error: Failed to load the native TensorFlow runtime
                            
                                Group by consecutive index numbers
                            
                                How do you break into the debugger from Python source code?
                            
                                Find a specific tag with BeautifulSoup
                            
                                What happened to thread.start_new_thread in python 3
                            
                                Slicing Sparse Matrices in Scipy -- Which Types Work Best?
                            
                                How to return array from C++ function to Python using ctypes
                            
                                While reading file on Python, I got a UnicodeDecodeError. What can I do to resolve this?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With