Java: A "prime" number or a "power of two" as HashMap size?

Q: How HashMap increases its size in Java?

As the number of elements in the HashMap increases, the capacity is expanded. The load factor is the measure that decides when to increase the capacity of the Map. The default load factor is 75% of the capacity. The threshold of a HashMap is approximately the product of current capacity and load factor.

Q: Why is prime number preferred as the size of hash table?

They famously are only divisible by 1 and themselves. Thus, choosing to set your hash table length to a large prime number will greatly reduce the occurrence of collisions.

Tags:

java

hashmap

hashtable

hashcode

hash

Many books and tutorials say that the size of a hash table must be a prime to evenly distribute the keys in all the buckets. But Java's HashMap always uses a size that is a power of two. Shouldn't it be using a prime? What's better, a "prime" or a "power of two" as the hash table size?

980

asked Mar 15 '13 16:03

Nikunj Banka

2 Answers

Using a power of two effectively masks out top bits of the hash code. Thus a poor-quality hash function might perform particularly badly in this scenario.

Java's HashMap mitigates this by mistrusting the object's hashCode() implementation and applying a second level of hashing to its result:

Applies a supplemental hash function to a given hashCode, which defends against poor quality hash functions. This is critical because HashMap uses power-of-two length hash tables, that otherwise encounter collisions for hashCodes that do not differ in lower bits.

If you have a good hash function, or do something similar to what HashMap does, it does not matter whether you use prime numbers etc as the table size.

If, on the other hand, the hash function is of unknown or poor quality, then using a prime number would be a safer bet. It will, however, make dynamically-sized tables tricker to implement, since all of a sudden you need to be able to produce prime numbers instead of just multiplying the size by a constant factor.

answered Sep 20 '22 03:09

NPE

The standard HashMap implementation has a hash method which rehashes your object's hashcode to avoid that pitfall. The comment before the hash() method reads:

/**  * Retrieve object hash code and applies a supplemental hash function to the  * result hash, which defends against poor quality hash functions.  This is  * critical because HashMap uses power-of-two length hash tables, that  * otherwise encounter collisions for hashCodes that do not differ  * in lower bits. Note: Null keys always map to hash 0, thus index 0.  */

answered Sep 17 '22 03:09

assylias

Related questions
                            
                                Spring boot enable Global CORS support issue: only GET is working, POST, PUT and Delete are not working
                            
                                How can I set the color of android rating bar's stroke? (Not the color of the stars but the BORDER)
                            
                                (Kotlin) Backend Internal error: Exception during code generation
                            
                                Cannot access script base class 'org.gradle.kotlin.dsl.KotlinBuildScript'
                            
                                Latest org.json
                            
                                JAXB required=true doesn't seem to require
                            
                                What can AOP do that OOP can't do?
                            
                                Expecting anything as parameter to mock using EasyMock
                            
                                How to invoke default deserialize with gson
                            
                                Is servlet the singleton? [duplicate]
                            
                                When and why would I need a jboss-deployment-structure.xml for a Spring application?
                            
                                How do I create beans programmatically in Spring Boot?
                            
                                How to download a file using a Java REST service and a data stream
                            
                                Java 8 Stream: How to compare current element with next element?
                            
                                Android M onRequestPermissionsResult in non-Activity
                            
                                How to set DPI information in an image?
                            
                                Scala collection standard practice
                            
                                Google guava vs Scala collection framework comparison
                            
                                readValue and readTree in Jackson: when to use which?
                            
                                How to prevent Gson from converting a long number (a json string ) to scientific notation format?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With