How Do Hashes Work in Programming?

Tags:

How do hashes work in programming? How I think of a hash is something that allows me the ability to use some unique value to retrieve some data. Like if we have an array and I start to put things in the array, if I have another variable that keeps track of what item is in slot 0,1,2... then I have that instant ability to find an item. Is that hashing?

What is the purpose of a hash?

When should a hash be implemented? What's a hash similar to in terms of data structure?

What I think I know about hashes is that it allows us the ability to retrieve the item within O(1). Is that correct?

202

asked Jan 09 '11 03:01

RoR

1 Answers

A hash is like a person's first name -- it's a short way of remembering a person, even though it doesn't have to be unique. If you need to find some information about someone, you can just look them up by their name, and you only need to perform other checks if two or more people have the same name.

That's the power of hashing, and just as remembering people is much easier by name than by Social Security Number, finding an object by its hash code is much easier than actually comparing the object to everything already in your collection.

Now, in this example, if you're looking someone up in a phone book by name, you'd probably find them in O(log n) time, because the names are sorted alphabetically, and because you need to do a binary search. If, however, you instead "hashed" 100 people born in the 1900s by their years of birth, then you'd only need at most 4 comparisons in the hashtable/phonebook (one per digit) to find any one year by hash, which is constant time. Then, if two people are born in the same year, you can use other information to find the person you need, and on average, if your table isn't too full (say, if you have at most 50 people for 100 different years of birth), your lookups will be constant-time.

(If your table gets more than, say, 50% full, you can always double its size, to keep the number of collisions low and hence to keep your lookups fast.)

More information:

If you've ever heard of ~~MD5 or SHA-1~~ SHA-2 hashes for files, they're like the "fingerprints" of the file. While it's possible to have two files with the same hash, this is made so unlikely that, for practical purposes, it's impossible; hence, if you have the hash of two files, you can compare the files by their fingerprints rather than by their data, which is immensely faster.

151

answered Oct 01 '22 19:10

user541686

Related questions
                            
                                Create hash from string and int
                            
                                Why does Hash.new({}) hide hash members? [duplicate]
                            
                                Why generated MD5 hash in sql server are not equal? [duplicate]
                            
                                How does c# figure out the hash code for an object?
                            
                                One-to-one integer mapping function
                            
                                How to hash strings into a float in [0:1]?
                            
                                Fast Cross-Platform C/C++ Hashing Library [closed]
                            
                                Ruby hash equivalent to Python dict setdefault
                            
                                Oracle STANDARD_HASH not available in PLSQL?
                            
                                Is it faster to search for a large string in a DB by its hashcode?
                            
                                How to determine a strings dna for likeness to another
                            
                                Ruby Inserting Key, Value elements in Hash
                            
                                Why is the SlowEquals function important to compare hashed passwords?
                            
                                Cache Performance in Hash Tables with Chaining vs Open Addressing
                            
                                Are the first 32 bits of an md5 hash just as "random" as any other substring?
                            
                                Why does the default Object.toString() return a hex representation of the hashCode?
                            
                                Marshal ruby hash with default proc - remove the default proc?
                            
                                How does a reduction function used with rainbow tables work?
                            
                                How secure is storing salts along with hashed password
                            
                                How to create simple short hash value? C#

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How Do Hashes Work in Programming?

Tags:

hash

programming-languages

RoR

People also ask

1 Answers

user541686

Recent Activity

Donate For Us