Are SHA1 hashes distributed uniformly?

Tags:

I have a string in Python. I calculate the SHA1 hash of that string with hashlib. I convert it to its hexadecimal representation and take the last 16 characters to use as an identifier:

hash_str = "foobarbazάλφαβήταγάμμα..."
hash_obj = hashlib.sha1(hash_str, encode('utf-8'))
hash_id  = hash_obj.hexdigest()[:16]

My goal is an identifier that provides reasonable length and is unlikely to yield the same hash_id value for a different hash_str input.

If the probability of a SHA1 collision is 1/(2^160), or 1/(16^40), then if I take the last sixteen characters of the hex representation, is the probability of a collision only 1/(16^16)? Or are the bytes (or their hex equivalent) not distributed evenly?

492

asked Nov 06 '15 00:11

Alex Reynolds

1 Answers

Yes. Any hash function which exhibits the property of uniformity has equal chance of any value in its output range being generated by a randomly chosen input value. Therefore, each value of the truncated hash is equally likely too. SHA-1 is is hash function that demonstrates uniformity, therefore your conjecture is true.

answered Sep 21 '22 13:09

abligh

Related questions
                            
                                Storable.pm - corrupt when saving to non-truncated file
                            
                                Image Hash for very similar images [closed]
                            
                                C++: Suggestions about a hash function for a sequence of strings where the order of the strings is irrelevant
                            
                                Pre-hashed string keys for faster Python dictionaries lookup?
                            
                                Does md5 have any uniqueness guarantee for short strings (finite number of strings)?
                            
                                C# rhash generates hashes different than the rhash.exe and utorrent
                            
                                Perl DBI fetchall_hashref
                            
                                Is there a way to utilize Bcrypt for iOS development with Swift?
                            
                                How to define a Hash class for custom std::basic_string<> specialization class just like std::string?
                            
                                Web Application - Storing a Password
                            
                                Analyzing goals and choosing a good hash function
                            
                                How to create a hash table
                            
                                How to create a good hash_combine with 64 bit output (inspired by boost::hash_combine)
                            
                                Write to a CSV file from a hash perl
                            
                                Comparing SHA256 made with PHP hash() and NodeJS crypto.createHash()
                            
                                PHP array with default value for nonexisting indices
                            
                                Is it possible to calculate sha256 hashes in the browser using the user's video card, eg. by using WebGL or Flash?
                            
                                How does one retrieve the hash code of an enumeration without boxing it?
                            
                                Getting invalid android_key parameter error, after first time login in Facebook Android SDK
                            
                                Best Method to Intersect Huge HyperLogLogs in Redis

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Are SHA1 hashes distributed uniformly?

Tags:

hash

probability

sha1

Alex Reynolds

People also ask

1 Answers

abligh

Recent Activity

Donate For Us