Is it safe to ignore the possibility of SHA collisions in practice?

People also ask

Can hash collisions be avoided?

Chaining is a technique used for avoiding collisions in hash tables. A collision occurs when two keys are hashed to the same index in a hash table.

How likely is a SHA256 collision?

SHA256: The slowest, usually 60% slower than md5, and the longest generated hash (32 bytes). The probability of just two hashes accidentally colliding is approximately: 4.3*10-60. As you can see, the slower and longer the hash is, the more reliable it is.

Are there any known SHA256 collisions?

No, there is not any known SHA-256 collision. Publication of one, or of a remotely feasible method to obtain one, would be considered major. It is next to impossible that two distinct strings with the same SHA-256 have been computed so far. The most visible such computation is in bitcoin mining.

The usual answer goes thus: what is the probability that a rogue asteroid crashes on Earth within the next second, obliterating civilization-as-we-know-it, and killing off a few billion people? It can be argued that any unlucky event with a probability lower than that is not actually very important.

If we have a "perfect" hash function with output size n, and we have p messages to hash (individual message length is not important), then probability of collision is about p²/2ⁿ⁺¹ (this is an approximation which is valid for "small" p, i.e. substantially smaller than 2^n/2). For instance, with SHA-256 (n=256) and one billion messages (p=10⁹) then the probability is about 4.3*10^-60.

A mass-murderer space rock happens about once every 30 million years on average. This leads to a probability of such an event occurring in the next second to about 10^-15. That's 45 orders of magnitude more probable than the SHA-256 collision. Briefly stated, if you find SHA-256 collisions scary then your priorities are wrong.

In a security setup, where an attacker gets to choose the messages which will be hashed, then the attacker may use substantially more than a billion messages; however, you will find that the attacker's success probability will still be vanishingly small. That's the whole point of using a hash function with a 256-bit output: so that risks of collision can be neglected.

Of course, all of the above assumes that SHA-256 is a "perfect" hash function, which is far from being proven. Still, SHA-256 seems quite robust.

The possibility of a collision does not depend on the size of the files, only on their number.

This is an example of the birthday paradox. The Wikipedia page gives an estimate of the likelihood of a collision. If you run the numbers, you'll see that all harddisks ever produced on Earth can't hold enough 1MB files to get a likelihood of a collision of even 0.01% for SHA-256.

Basically, you can simply ignore the possibility.

First of all, it is not zero, but very close to zero.

The key question is what happens if a collision actually occurs? If the answer is "a nuclear power plant will explode" then you likely shouldn't ignore the collision possibility. In most cases the consequences are not that dire and so you can ignore the collision possibility.

Also don't forget that you software (or a tiny part of it) might be deployed and simultaneously used in a gazillion of computers (some tiny embedded microcomputers that are almost everywhere nowadays included). In such case you need to multiply the estimate you've got by the largest possible number of copies.

Related questions
                            
                                Why is XOR the default way to combine hashes?
                            
                                How to sort a Ruby Hash by number value?
                            
                                Best Practices: Salting & peppering passwords?
                            
                                How many random elements before MD5 produces collisions?
                            
                                How can I print the contents of a hash in Perl?
                            
                                Default implementation for Object.GetHashCode()
                            
                                Equivalent of .try() for a hash to avoid "undefined method" errors on nil? [duplicate]
                            
                                How can I update window.location.hash without jumping the document?
                            
                                Good Hash Function for Strings
                            
                                Fastest hash for non-cryptographic uses?
                            
                                Hash and salt passwords in C#
                            
                                Changing every value in a hash in Ruby
                            
                                Hashing a dictionary?
                            
                                How to add new item to hash
                            
                                Hash collision in git
                            
                                How to replace a hash key with another key
                            
                                How do I use Node.js Crypto to create a HMAC-SHA1 hash?
                            
                                How to get a specific output iterating a hash in Ruby?
                            
                                Is a Python dictionary an example of a hash table?
                            
                                SHA512 vs. Blowfish and Bcrypt [closed]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Is it safe to ignore the possibility of SHA collisions in practice?

Tags:

hash

sha

People also ask

Recent Activity

Donate For Us