Why does a HashMap rehash the hashcode supplied by the key object?

Tags:

I am reading the code of the HashMap class provided by the Java 1.6 API and unable to fully understand the need of the following operation (found in the body of put and get methods):

Click to copy

int hash = hash(key.hashCode());

where the method hash() has the following body:

Click to copy

 private static int hash(int h) {
         h ^= (h >>> 20) ^ (h >>> 12);
    return h ^ (h >>> 7) ^ (h >>> 4);
}

This effectively recalculates the hash by executing bit operations on the supplied hashcode. I'm unable to understand the need to do so even though the API states it as follows:

This is critical because HashMap uses power-of-two length hash tables, that otherwise encounter collisions for hashCodes that do not differ in lower bits.

I do understand that the key value pars are stored in an array of data structures, and that the index location of an item in this array is determined by its hash. What I fail to understand is how would this function add any value to the hash distribution.

481

asked Mar 29 '10 13:03

VGDIV

2 Answers

As Helper wrote, it is there just in case the existing hash function for the key objects is faulty and does not do a good-enough job of mixing the lower bits. According to the source quoted by pgras,

Click to copy

 /**
  * Returns index for hash code h.
  */
 static int indexFor(int h, int length) {
     return h & (length-1);
 }

The hash is being ANDed in with a power-of-two length (therefore, length-1 is guaranteed to be a sequence of 1s). Due to this ANDing, only the lower bits of h are being used. The rest of h is ignored. Imagine that, for whatever reason, the original hash only returns numbers divisible by 2. If you used it directly, the odd-numbered positions of the hashmap would never be used, leading to a x2 increase in the number of collisions. In a truly pathological case, a bad hash function can make a hashmap behave more like a list than like an O(1) container.

Sun engineers must have run tests that show that too many hash functions are not random enough in their lower bits, and that many hashmaps are not large enough to ever use the higher bits. Under these circumstances, the bit operations in HashMap's hash(int h) can provide a net improvement over most expected use-cases (due to lower collision rates), even though extra computation is required.

143

answered Nov 15 '22 17:11

tucuxi

I somewhere read this is done to ensure a good distribution even if your hashCode implementation, well, err, sucks.

answered Nov 15 '22 17:11

helpermethod

Related questions
                            
                                UnknownHostException: name or service not known
                            
                                Filtering upwards path traversal in Java (or Scala) [closed]
                            
                                What exactly is getGlobalVisibleRect()?
                            
                                Exception 0xC0000005 from JNI_CreateJavaVM (jvm.dll)
                            
                                Can JavaFX be used on Raspberry Pi
                            
                                Is Java 8 findFirst().isPresent() more efficient than count() > 0?
                            
                                Integration test with TestRestTemplate for Multipart POST request returns 400
                            
                                Gradle application plugin with multiple main classes
                            
                                Round Corners in java fx pane
                            
                                Reactive java method hide()
                            
                                Kafka Producer NetworkException and Timeout Exceptions
                            
                                Why does inheritance behave differently in Java and C++ with superclasses calling (or not) subclasses' methods?
                            
                                Disabling "Download sources and javadoc" in eclipse
                            
                                Should I never use primitive types again?
                            
                                How can I call Perl from Java?
                            
                                Compare JSF implementations [closed]
                            
                                How resource intensive are Listeners in java?
                            
                                How do you bind a language (python, for example) to another (say, C++)?
                            
                                JSF Managed Bean auto-create?
                            
                                Zookeeper/Chubby -vs- MySql NDB

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Why does a HashMap rehash the hashcode supplied by the key object?

Tags:

java

collections

hashmap

hashcode

hash

VGDIV

People also ask

2 Answers

tucuxi

helpermethod

Recent Activity

Donate For Us