Why does HashMap require that the initial capacity be a power of two?

Tags:

I was going through Java's HashMap source code when I saw the following

//The default initial capacity - MUST be a power of two. static final int DEFAULT_INITIAL_CAPACITY = 16;

My question is why does this requirement exists in the first place? I also see that the constructor which allows creating a HashMap with a custom capacity converts it into a power of two:

int capacity = 1; while (capacity < initialCapacity)   capacity <<= 1;

Why does the capacity always has to be a power of two?

Also, when automatic rehashing is performed, what exactly happens? Is the hash function altered too?

656

asked Dec 02 '11 06:12

Sushant

2 Answers

The map has to work out which internal table index to use for any given key, mapping any int value (could be negative) to a value in the range [0, table.length). When table.length is a power of two, that can be done really cheaply - and is, in indexFor:

static int indexFor(int h, int length) {     return h & (length-1); }

With a different table length, you'd need to compute a remainder and make sure it's non-negative . This is definitely a micro-optimization, but probably a valid one :)

Also, when automatic rehashing is performed, what exactly happens? Is the hash function altered too?

It's not quite clear to me what you mean. The same hash codes are used (because they're just computed by calling hashCode on each key) but they'll be distributed differently within the table due to the table length changing. For example, when the table length is 16, hash codes of 5 and 21 both end up being stored in table entry 5. When the table length increases to 32, they will be in different entries.

115

answered Sep 25 '22 06:09

Jon Skeet

The ideal situation is actually using prime number sizes for the backing array of an HashMap. That way your keys will be more naturally distributed across the array. However this works with mod division and that operation became slower and slower with every release of Java. In a sense, the power of 2 approach is the worst table size you can imagine because with poor hashcode implementations are more likely to produce key collosions in the array.

Therefor you'll find another very important method in Java's HashMap implementation, which is the hash(int), that compensates for poor hashcodes.

answered Sep 23 '22 06:09

M Platvoet

Related questions
                            
                                Differences between IntelliJ IDEA 13 and Android Studio
                            
                                How to reference a generic return type with multiple bounds
                            
                                Eclipse + Java 8 support?
                            
                                Codahale Metrics: using @Timed metrics annotation in plain Java
                            
                                Managing several versions of serialized Java objects
                            
                                Curly braces in "new" expression? (e.g. "new MyClass() { ... }")
                            
                                Difference between Throws in method signature and Throw Statements in Java
                            
                                Why doesn't String toCharArray use Arrays.copyOf?
                            
                                Exception handling in ThreadPools
                            
                                Whats the difference between \z and \Z in a regular expression and when and how do I use it?
                            
                                HashSet look-up complexity?
                            
                                What are GC roots for classes?
                            
                                R Error: java.lang.OutOfMemoryError: Java heap space
                            
                                Why does the java.util.Set<V> interface not provide a get(Object o) method? [closed]
                            
                                Simple Kerberos client in Java?
                            
                                Docker cache gradle dependencies
                            
                                Paths.get vs Path.of
                            
                                Why people are so afraid of using clone() (on collection and JDK classes)?
                            
                                Word wrap in Eclipse Java? [duplicate]
                            
                                should Class.getResourceAsStream be closed?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Why does HashMap require that the initial capacity be a power of two?

Tags:

java

hashmap

hashtable

hash

Sushant

People also ask

2 Answers

Jon Skeet

M Platvoet

Recent Activity

Donate For Us