Understanding strange Java hash function

Tags:

Following is the source code for a hash function in java.util.HashMap. The comments explain well enough what it's accomplishing. but how? What are the ^ and >>> operators doing? Can someone explain how the code actually does what the comments say?

/**  * Applies a supplemental hash function to a given hashCode, which  * defends against poor quality hash functions.  This is critical  * because HashMap uses power-of-two length hash tables, that  * otherwise encounter collisions for hashCodes that do not differ  * in lower bits. Note: Null keys always map to hash 0, thus index 0.  */ static int hash(int h) {     // This function ensures that hashCodes that differ only by     // constant multiples at each bit position have a bounded     // number of collisions (approximately 8 at default load factor).      h ^= (h >>> 20) ^ (h >>> 12);     return h ^ (h >>> 7) ^ (h >>> 4); }

439

asked Feb 17 '12 20:02

calebds

2 Answers

Here is some code and the sample output:

public static void main ( String[] args ) {     int h = 0xffffffff;     int h1 = h >>> 20;     int h2 = h >>> 12;     int h3 = h1 ^ h2;     int h4 = h ^ h3;     int h5 = h4 >>> 7;     int h6 = h4 >>> 4;     int h7 = h5 ^ h6;     int h8 = h4 ^ h7;      printBin ( h );     printBin ( h1 );     printBin ( h2 );     printBin ( h3 );     printBin ( h4 );     printBin ( h5 );     printBin ( h6 );     printBin ( h7 );     printBin ( h8 );  }  static void printBin ( int h ) {     System.out.println ( String.format ( "%32s",          Integer.toBinaryString ( h ) ).replace ( ' ', '0' ) ); }

Which prints:

11111111111111111111111111111111 00000000000000000000111111111111 00000000000011111111111111111111 00000000000011111111000000000000 11111111111100000000111111111111 00000001111111111110000000011111 00001111111111110000000011111111 00001110000000001110000011100000 11110001111100001110111100011111

So, the code breaks down the hash function into steps so that you can see what is happening. The first shift of 20 positions xor with the second shift of 12 positions creates a mask that can flip 0 or more of the bottom 20 bits of the int. So you can get some randomness inserted into the bottom bits that makes use of the potentially better distributed higher bits. This is then applied via xor to the original value to add that randomness to the lower bits. The second shift of 7 positions xor the shift of 4 positions creates a mask that can flip 0 or more of the bottom 28 bits, which brings some randomness again to the lower bits and to some of the more significant ones by capitalizing on the prior xor which already addressed some of the distribution at the lower bits. The end result is a smoother distribution of bits through the hash value.

Since the hashmap in java computes the bucket index by combining the hash with the number of buckets you need to have an even distribution of the lower bits of the hash value to spread the entries evenly into each bucket.

As to proving the statement that this bounds the number of collisions, that one I don't have any input on. Also, see here for some good information on building hash functions and a few details on why the xor of two numbers tends towards random distribution of bits in the result.

181

answered Oct 06 '22 09:10

philwb

>>> is a bitshift with zero fill.

^ is an XOR.

XOR is also called exclusive or--it is a math operator that combines two numbers. See http://en.wikipedia.org/wiki/Exclusive_or

A right bitshift by n is like dropping the n lowest bits off of the number. So if the number is 00010111, and you shifted it right by 1, you'd get 00001011.

answered Oct 06 '22 08:10

StilesCrisis

Related questions
                            
                                HashCodeBuilder and EqualsBuilder usage style
                            
                                What does addScalar do?
                            
                                Unsupported major.minor version 51.0 but everything is set to JDK 1.6
                            
                                Mockito - separately verifying multiple invocations on the same method
                            
                                Cannot deserialize value of type `java.util.Date` from String
                            
                                How to use Hibernate @Any-related annotations?
                            
                                What is the preferred Throwable to use in a private utility class constructor?
                            
                                Launching activities within a tab in Android
                            
                                Quickly create class from an interface in eclipse
                            
                                Why does Java's URL class not recognize certain protocols?
                            
                                Maven - how to include empty directories
                            
                                Does Java's for-each call an embedded method (that returns the collection) for every iteration?
                            
                                Select MAX timestamp with JPA2 Criteria API
                            
                                How can I disable "Initialize Java Tooling" on Eclipse startup?
                            
                                Are recursive methods always better than iterative methods in Java? [closed]
                            
                                What are the default Akka dispatcher configuration values?
                            
                                How to do in-query in jDBI?
                            
                                Misplaced argument matcher detected here. You cannot use argument matchers outside of verification or stubbing in Mockito
                            
                                How to properly convert domain entities to DTOs while considering scalability & testability
                            
                                How can I convert an Object to Inputstream

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Understanding strange Java hash function

Tags:

java

hash

calebds

People also ask

2 Answers

philwb

StilesCrisis

Recent Activity

Donate For Us