Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

When is CRC more appropriate to use than MD5/SHA1?

Tags:

embedded

hash

crc

People also ask

Is CRC faster than MD5?

CRC32 IS much faster than MD5, when a cryptographic library is properly implement.

Why MD5 is no longer recommended for use?

Although originally designed as a cryptographic message authentication code algorithm for use on the internet, MD5 hashing is no longer considered reliable for use as a cryptographic checksum because security experts have demonstrated techniques capable of easily producing MD5 collisions on commercial off-the-shelf ...

Why is SHA1 better than MD5?

To conclude, MD5 generates a message digest of 128-bits, while SHA1 generates a message digest of 160-bit hash value. Hence, SHA1 is a relatively complex algorithm and provides better security than MD5.

Why is CRC not suited for cryptographic use?

It's inappropriate to use a CRC in place of a general purpose hash function because CRCs usually have biased output.


CRC works fine for detecting random errors in data that might occur, for example, from network interference, line noise, distortion, etc.

CRC is computationally much less complex than MD5 or SHA1. Using a hash function like MD5 is probably overkill for random error detection. However, using CRC for any kind of security check would be much less secure than a more complex hashing function such as MD5.

And yes, CRC is much easier to implement on embedded hardware, you can even get different packaged solutions for this on IC.


CRC is designed against unintentional changes in the data. That is, it's good for detecting unintentional errors, but will be useless as a way of making sure a data was not maliciously handled.

Also see this.


I found a study that shows how inappropriate CRC hashes are for hash tables. It also explains the actual characteristics of the algorithm. The study also includes evaluation of other hash algorithms and is a good reference to keep.

UPDATE

It seems the site is down. The internet archive has a copy though.

UPDATE 2

Oh dear. It turns out the study may have been faulty around the conclusions on CRC for use as a hash. Thanks @minexew for the link.


I ran every line of this PHP code in 1.000.000 loop. Results are in comments (#).

hash('crc32', 'The quick brown fox jumped over the lazy dog.');#  750ms   8 chars
hash('crc32b','The quick brown fox jumped over the lazy dog.');#  700ms   8 chars
hash('md5',   'The quick brown fox jumped over the lazy dog.');#  770ms  32 chars
hash('sha1',  'The quick brown fox jumped over the lazy dog.');#  880ms  40 chars
hash('sha256','The quick brown fox jumped over the lazy dog.');# 1490ms  64 chars
hash('sha384','The quick brown fox jumped over the lazy dog.');# 1830ms  96 chars
hash('sha512','The quick brown fox jumped over the lazy dog.');# 1870ms 128 chars

My conclusion:

  • Use "crc32b" when you need http://en.wikipedia.org/wiki/Cyclic_redundancy_check and you do not care about security.
  • Use "sha256" (or higher) when you need added security layer.

  • Do not use "md5" or "sha1" because they have:

    1. some security issues when you care about security
    2. longer hash string and are slower than "crc32b" when all you need is CRC

For CRC information on implementation, speed and reliability see A painless guide to CRC error detection algorithms. It has everything on CRCs.

Unless somebody is going to try and modify your data maliciously and hide the change CRC is sufficient. Just use a "Good" (standard) polinomial.


It all depends on your requirements and expectation.

Here are quick brief differences between these hash function algorithms:

CRC (CRC-8/16/32/64)

  • is not a cryptographic hashing algorithm (it's using a linear function based on cyclic redundancy checks)
  • can produce either 9, 17, 33 or 65 bits
  • not intended to be used for cryptographic purposes since makes no cryptographic guarantees,
  • unsuitable for use in digital signatures, because it's easily reversible2006,
  • should not be used for encryption purposes,
  • different strings can generate the collision,
  • invented in 1961 and used in Ethernet and many other standards,

MD5

  • is a cryptographic hash algorithm,
  • producing a 128-bit (16-byte) hash value (32 digit hexadecimal numbers)
  • it is a cryptographic hash, but is considered deprecated if you worry about security,
  • there are known strings which have the same MD5 hash value
  • can be used for encryption purposes,

SHA-1

  • is a cryptographic hash algorithm,

  • produces a 160-bit (20-byte) hash value known as a message digest

  • it is a cryptographic hash and since 2005 it's no longer considered secure,

  • can be used for encryption purposes,

  • an example of a sha1 collision has been found

  • first published in 1993 (as SHA-0), then 1995 as SHA-1,

  • series: SHA-0, SHA-1, SHA-2, SHA-3,

    In summary, using SHA-1 is no longer considered secure against well-funded opponents, because in 2005, cryptanalysts found attacks on SHA-1 which suggests it may be not secure enough for ongoing useschneier. U.S. NIST advise that federal agencies should stop using SHA1-1 for application which require collision resistance and must use SHA-2 after 2010NIST.

Therefore, if you're looking for simple and quick solution for checking the integrity of a files (against the corruption), or for some simple caching purposes in terms of performance, you can consider CRC-32, for hashing you may consider to use MD5, however if you're developing professional application (which should be secure and consistent), to avoid any collision probabilities - use SHA-2 and above (such as SHA-3).

Performance

Some simple benchmark test in PHP:

# Testing static text.

$ time php -r 'for ($i=0;$i<1000000;$i++) crc32("foo");'
real    0m0.845s
user    0m0.830s
sys     0m0.008s

$ time php -r 'for ($i=0;$i<1000000;$i++) md5("foo");'
real    0m1.103s
user    0m1.089s
sys     0m0.009s

$ time php -r 'for ($i=0;$i<1000000;$i++) sha1("foo");'
real    0m1.132s
user    0m1.116s
sys   0m0.010s

# Testing random number. 

$ time php -r 'for ($i=0;$i<1000000;$i++) crc32(rand(0,$i));'
real    0m1.754s
user    0m1.735s
sys     0m0.012s\

$ time php -r 'for ($i=0;$i<1000000;$i++) md5(rand(0,$i));'
real    0m2.065s
user    0m2.042s
sys     0m0.015s

$ time php -r 'for ($i=0;$i<1000000;$i++) sha1(rand(0,$i));'
real    0m2.050s
user    0m2.021s
sys     0m0.015s

Related:

  • What’s the difference between md5(), crc32() and sha1() crypto on PHP?