My understanding is that a hash code and checksum are similar things - a numeric value, computed for a block of data, that is relatively unique. i.e. The probability of two blocks of data yielding the same numeric hash/checksum value is low enough that it can be ignored for the purposes of the application. So do we have two words for the same thing, or are there important differences between hash codes and checksums?

There is a different purpose behind each of them: <ul> <li>Hash code - designed to be random across its domain (to minimize collisions in hash tables and such). Cryptographic hash codes are also designed to be computationally infeasible to reverse.</li> <li>Check sum - designed to detect the most common errors in the data and often to be fast to compute (for effective checksumming fast streams of data).</li> </ul> In practice, the same functions are often good for both purposes. In particular, a cryptographically strong hash code is a good checksum (it is almost impossible that a random error will break a strong hash function), if you can afford the computational cost.

Hash Code and Checksum - what's the difference?

2 Answers

I would say that a checksum is necessarily a hashcode. However, not all hashcodes make good checksums.

A checksum has a special purpose --- it verifies or checks the integrity of data (some can go beyond that by allowing for error-correction). "Good" checksums are easy to compute, and can detect many types of data corruptions (e.g. one, two, three erroneous bits).

A hashcode simply describes a mathematical function that maps data to some value. When used as a means of indexing in data structures (e.g. a hash table), a low collision probability is desirable.

117

answered Sep 30 '22 05:09

Zach Scrivena

There is a different purpose behind each of them:

Hash code - designed to be random across its domain (to minimize collisions in hash tables and such). Cryptographic hash codes are also designed to be computationally infeasible to reverse.
Check sum - designed to detect the most common errors in the data and often to be fast to compute (for effective checksumming fast streams of data).

In practice, the same functions are often good for both purposes. In particular, a cryptographically strong hash code is a good checksum (it is almost impossible that a random error will break a strong hash function), if you can afford the computational cost.

answered Sep 30 '22 04:09

Rafał Dowgird

Related questions
                            
                                Difference between parameter and argument [duplicate]
                            
                                Understanding how recursive functions work
                            
                                TDD vs. Unit testing [closed]
                            
                                What algorithm gives suggestions in a spell checker?
                            
                                Graph Algorithm To Find All Connections Between Two Arbitrary Vertices
                            
                                What is the optimal Jewish toenail cutting algorithm?
                            
                                Why shouldn't I use "Hungarian Notation"?
                            
                                design a stack such that getMinimum( ) should be O(1)
                            
                                Can hash tables really be O(1)?
                            
                                What is a debugger and how can it help me diagnose problems?
                            
                                Under what circumstances are linked lists useful?
                            
                                What's the point of OOP?
                            
                                File I/O in Every Programming Language [closed]
                            
                                Algorithm to generate a crossword [closed]
                            
                                Solving "Who owns the Zebra" programmatically?
                            
                                Convert light frequency to RGB?
                            
                                How to develop and test an app that sends emails (without filling someone's mailbox with test data)? [closed]
                            
                                Conventions for exceptions or error codes [closed]
                            
                                How do I check if a number is a palindrome?
                            
                                What is the difference between the Facade and Adapter Pattern?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Hash Code and Checksum - what's the difference?

Tags:

language-agnostic

computer-science

hash

checksum

Richard Ev

People also ask

2 Answers

Zach Scrivena

Rafał Dowgird

Recent Activity

Donate For Us