Is there anyway that I can hash a random string into a 8 digit number without implementing any algorithms myself?

Yes, you can use the built-in <code>hashlib</code> module or the built-in <code>hash</code> function. Then, chop-off the last eight digits using modulo operations or string slicing operations on the integer form of the hash: <pre class="prettyprint"><code>>>> s = 'she sells sea shells by the sea shore' >>> # Use hashlib >>> import hashlib >>> int(hashlib.sha1(s.encode("utf-8")).hexdigest(), 16) % (10 ** 8) 58097614L >>> # Use hash() >>> abs(hash(s)) % (10 ** 8) 82148974 </code></pre>

Raymond's answer is great for python2 (though, you don't need the abs() nor the parens around 10 ** 8). However, for python3, there are important caveats. First, you'll need to make sure you are passing an encoded string. These days, in most circumstances, it's probably also better to shy away from sha-1 and use something like sha-256, instead. So, the hashlib approach would be: <pre class="prettyprint"><code>>>> import hashlib >>> s = 'your string' >>> int(hashlib.sha256(s.encode('utf-8')).hexdigest(), 16) % 10**8 80262417 </code></pre> If you want to use the hash() function instead, the important caveat is that, unlike in Python 2.x, in Python 3.x, the result of hash() will only be consistent within a process, not across python invocations. See here: <pre class="prettyprint"><code>$ python -V Python 2.7.5 $ python -c 'print(hash("foo"))' -4177197833195190597 $ python -c 'print(hash("foo"))' -4177197833195190597 $ python3 -V Python 3.4.2 $ python3 -c 'print(hash("foo"))' 5790391865899772265 $ python3 -c 'print(hash("foo"))' -8152690834165248934 </code></pre> This means the hash()-based solution suggested, which can be shortened to just: <code>hash(s) % 10**8</code> will only return the same value within a given script run: <pre class="prettyprint"><code>#Python 2: $ python2 -c 's="your string"; print(hash(s) % 10**8)' 52304543 $ python2 -c 's="your string"; print(hash(s) % 10**8)' 52304543 #Python 3: $ python3 -c 's="your string"; print(hash(s) % 10**8)' 12954124 $ python3 -c 's="your string"; print(hash(s) % 10**8)' 32065451 </code></pre> So, depending on if this matters in your application (it did in mine), you'll probably want to stick to the hashlib-based approach.

How to hash a string into 8 digits?

2 Answers

Yes, you can use the built-in hashlib module or the built-in hash function. Then, chop-off the last eight digits using modulo operations or string slicing operations on the integer form of the hash:

>>> s = 'she sells sea shells by the sea shore'  >>> # Use hashlib >>> import hashlib >>> int(hashlib.sha1(s.encode("utf-8")).hexdigest(), 16) % (10 ** 8) 58097614L  >>> # Use hash() >>> abs(hash(s)) % (10 ** 8) 82148974

117

answered Sep 18 '22 09:09

Raymond Hettinger

Raymond's answer is great for python2 (though, you don't need the abs() nor the parens around 10 ** 8). However, for python3, there are important caveats. First, you'll need to make sure you are passing an encoded string. These days, in most circumstances, it's probably also better to shy away from sha-1 and use something like sha-256, instead. So, the hashlib approach would be:

>>> import hashlib >>> s = 'your string' >>> int(hashlib.sha256(s.encode('utf-8')).hexdigest(), 16) % 10**8 80262417

If you want to use the hash() function instead, the important caveat is that, unlike in Python 2.x, in Python 3.x, the result of hash() will only be consistent within a process, not across python invocations. See here:

$ python -V Python 2.7.5 $ python -c 'print(hash("foo"))' -4177197833195190597 $ python -c 'print(hash("foo"))' -4177197833195190597  $ python3 -V Python 3.4.2 $ python3 -c 'print(hash("foo"))' 5790391865899772265 $ python3 -c 'print(hash("foo"))' -8152690834165248934

This means the hash()-based solution suggested, which can be shortened to just:

hash(s) % 10**8

will only return the same value within a given script run:

#Python 2: $ python2 -c 's="your string"; print(hash(s) % 10**8)' 52304543 $ python2 -c 's="your string"; print(hash(s) % 10**8)' 52304543  #Python 3: $ python3 -c 's="your string"; print(hash(s) % 10**8)' 12954124 $ python3 -c 's="your string"; print(hash(s) % 10**8)' 32065451

So, depending on if this matters in your application (it did in mine), you'll probably want to stick to the hashlib-based approach.

answered Sep 21 '22 09:09

JJC

Related questions
                            
                                How to update Python?
                            
                                Best way to create a simple python web service [closed]
                            
                                Why does the expression 0 < 0 == 0 return False in Python?
                            
                                Suppress/ print without b' prefix for bytes in Python 3
                            
                                How can I profile Python code line-by-line?
                            
                                What is a None value?
                            
                                Complex numbers in python
                            
                                What is the difference between class and instance attributes?
                            
                                Iterating Over Dictionary Key Values Corresponding to List in Python
                            
                                How should I read a file line-by-line in Python?
                            
                                How to extract the year from a Python datetime object?
                            
                                How to "select distinct" across multiple data frame columns in pandas?
                            
                                Very Long If Statement in Python [duplicate]
                            
                                How to set class attribute with await in __init__
                            
                                unbound method f() must be called with fibo_ instance as first argument (got classobj instance instead)
                            
                                Binning a column with Python Pandas
                            
                                Given a URL to a text file, what is the simplest way to read the contents of the text file?
                            
                                Split string using a newline delimiter with Python
                            
                                [] and {} vs list() and dict(), which is better?
                            
                                how do you filter pandas dataframes by multiple columns

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to hash a string into 8 digits?

Tags:

python

arrays

algorithm

random

hash

Bob Fang

People also ask

2 Answers

Raymond Hettinger

JJC

Recent Activity

Donate For Us