This question is not about the well-known and documented fact that <code>HashMap</code> is not thread-safe, but about its specific failure modes on HotSpot and JDK code. I am surprised by how readily this code fails with an NPE: <pre class="prettyprint"><code>public static void main(String[] args) { Map<Integer, Integer> m = new HashMap<>(0, 0.75f); IntStream.range(0, 5).parallel().peek(i -> m.put(i, i)).map(m::get).count(); } </code></pre> There is no mystery as to where the NPE comes from: in the <code>.map(m::get)</code> step while trying to unbox a <code>null</code>. It fails in about 4 out of 5 runs. On my machine <code>Runtime#availableProcessors()</code> reports 8, so presumably the range of length 5 is split into 5 subtasks, each with just a single member. I also assume my code runs in interpreted mode. It might be calling into JIT-compiled <code>HashMap</code> or <code>Stream</code> methods, but the top level is interpreted, therefore precluding any variations where <code>HashMap</code> state is loaded into thread-local memory (registers/stack), thus delaying the observation of updates by another thread. If some of the five <code>put</code> operations don't execute literally during the same time on different cores, I don't expect it to destroy the <code>HashMap</code>s internal structure. The timing of individual tasks must be extremely precise given the little amount of work. Is it really the precise timing (<code>commonPool</code>'s threads must be unparked), or is there another route to cause this to fail on Oracle/OpenJDK HotSpot? My current version is <pre class="prettyprint"><code>java version "1.8.0_72" Java(TM) SE Runtime Environment (build 1.8.0_72-b15) Java HotSpot(TM) 64-Bit Server VM (build 25.72-b15, mixed mode) </code></pre> UPDATE: I find that even making just two insertions has a similarly high failure rate: <pre class="prettyprint"><code>IntStream.range(0, 2).parallel().peek(i -> m.put(i, i)).map(m::get).count(); </code></pre>

First, it’s not failing reliably. I managed to have some runs where no exception occurred. This, however doesn’t imply that the resulting map is correct. It’s also possible that each thread witnesses its own value being successfully put, while the resulting map misses several mappings. But indeed, failing with a <code>NullPointerException</code> happens quite often. I created the following debug code to illustrate the <code>HashMap</code>’s working: <pre class="prettyprint"><code>static <K,V> void debugPut(HashMap<K,V> m, K k, V v) { if(m.isEmpty()) debug(m); m.put(k, v); debug(m); } private static <K, V> void debug(HashMap<K, V> m) { for(Field f: FIELDS) try { System.out.println(f.getName()+": "+f.get(m)); } catch(ReflectiveOperationException ex) { throw new AssertionError(ex); } System.out.println(); } static final Field[] FIELDS; static { String[] name={ "table", "size", "threshold" }; Field[] f=new Field[name.length]; for (int ix = 0; ix < name.length; ix++) try { f[ix]=HashMap.class.getDeclaredField(name[ix]); } catch (NoSuchFieldException ex) { throw new ExceptionInInitializerError(ex); } AccessibleObject.setAccessible(f, true); FIELDS=f; } </code></pre> Using this with the simple sequential <code>for(int i=0; i<5; i++) debugPut(m, i, i);</code> printed: <pre class="prettyprint lang-none prettyprint-override"><code>table: null size: 0 threshold: 1 table: [Ljava.util.HashMap$Node;@70dea4e size: 1 threshold: 1 table: [Ljava.util.HashMap$Node;@5c647e05 size: 2 threshold: 3 table: [Ljava.util.HashMap$Node;@5c647e05 size: 3 threshold: 3 table: [Ljava.util.HashMap$Node;@33909752 size: 4 threshold: 6 table: [Ljava.util.HashMap$Node;@33909752 size: 5 threshold: 6 </code></pre> As you can see, due to the initial capacity of <code>0</code>, there are three different backing arrays created even during the sequential operation. Each time, the capacity is increased, there is a higher chance that a racy concurrent <code>put</code> misses the array update and creates its own array. This is especially relevant for the initial state of an empty map and several threads trying to put their first key, as all threads might encounter the initial state of a <code>null</code> table and create their own. Also, even when reading the state of a completed first <code>put</code>, there is a new array created for the second <code>put</code> as well. But step-by-step debugging revealed even more chances of breaking: Inside the method <code>putVal</code>, we see at the end: <pre class="prettyprint"><code>++modCount; if (++size > threshold) resize(); afterNodeInsertion(evict); return null; </code></pre> In other words, after the successful insertion of a new key, the table will get resized, if the new size exceeds the <code>threshold</code>. So on the first <code>put</code>, <code>resize()</code> is called at the beginning because the table is <code>null</code> and since your specified initial capacity is <code>0</code>, i.e. too low to store one mapping, the new capacity will be <code>1</code> and the new <code>threshold</code> will be <code>1 * loadFactor == 1 * 0.75f == 0.75f</code>, rounded to <code>0</code>. So right at the end of the first <code>put</code>, the new <code>threshold</code> is exceeded and another <code>resize()</code> operation triggered. So with an intial capacity of <code>0</code>, the first <code>put</code> already creates and populates two arrays, which gives much higher chances to break, if multiple threads perform this action concurrently, all encountering the initial state. And there is another point. Looking into the <code>resize()</code> operation we see the lines: <pre class="prettyprint"><code> @SuppressWarnings({"rawtypes","unchecked"}) Node<K,V>[] newTab = (Node<K,V>[])new Node[newCap]; table = newTab; if (oldTab != null) { … (transfer old contents to new array) </code></pre> In other words, the new array reference is stored into the heap before it has been populated with the old entries, so even without reordering of reads and writes, there is a chance that another thread reads that reference without seeing the old entries, including the one it has written itself previously. Actually, optimizations reducing the heap access may lower the chances of a thread not seeing its own update in an immediately following query. Still, it must also noted that the assumption that everything runs interpreted here, is not founded. Since <code>HashMap</code> is used by the JRE internally as well, even before your application starts, there is also a chance of encountering already compiled code when using <code>HashMap</code>.

What implementation detail makes this code fail so easily?

Tags:

java

java-8

java-stream

This question is not about the well-known and documented fact that HashMap is not thread-safe, but about its specific failure modes on HotSpot and JDK code. I am surprised by how readily this code fails with an NPE:

Click to copy

public static void main(String[] args) {     Map<Integer, Integer> m = new HashMap<>(0, 0.75f);     IntStream.range(0, 5).parallel().peek(i -> m.put(i, i)).map(m::get).count(); }

There is no mystery as to where the NPE comes from: in the .map(m::get) step while trying to unbox a null. It fails in about 4 out of 5 runs.

On my machine Runtime#availableProcessors() reports 8, so presumably the range of length 5 is split into 5 subtasks, each with just a single member. I also assume my code runs in interpreted mode. It might be calling into JIT-compiled HashMap or Stream methods, but the top level is interpreted, therefore precluding any variations where HashMap state is loaded into thread-local memory (registers/stack), thus delaying the observation of updates by another thread. If some of the five put operations don't execute literally during the same time on different cores, I don't expect it to destroy the HashMaps internal structure. The timing of individual tasks must be extremely precise given the little amount of work.

Is it really the precise timing (commonPool's threads must be unparked), or is there another route to cause this to fail on Oracle/OpenJDK HotSpot? My current version is

Click to copy

java version "1.8.0_72" Java(TM) SE Runtime Environment (build 1.8.0_72-b15) Java HotSpot(TM) 64-Bit Server VM (build 25.72-b15, mixed mode)

UPDATE: I find that even making just two insertions has a similarly high failure rate:

Click to copy

IntStream.range(0, 2).parallel().peek(i -> m.put(i, i)).map(m::get).count();

291

asked Dec 02 '16 08:12

Marko Topolnik

1 Answers

First, it’s not failing reliably. I managed to have some runs where no exception occurred. This, however doesn’t imply that the resulting map is correct. It’s also possible that each thread witnesses its own value being successfully put, while the resulting map misses several mappings.

But indeed, failing with a NullPointerException happens quite often. I created the following debug code to illustrate the HashMap’s working:

Click to copy

static <K,V> void debugPut(HashMap<K,V> m, K k, V v) {     if(m.isEmpty()) debug(m);     m.put(k, v);     debug(m); } private static <K, V> void debug(HashMap<K, V> m) {     for(Field f: FIELDS) try {         System.out.println(f.getName()+": "+f.get(m));     } catch(ReflectiveOperationException ex) {         throw new AssertionError(ex);     }     System.out.println(); } static final Field[] FIELDS; static {     String[] name={ "table", "size", "threshold" };     Field[] f=new Field[name.length];     for (int ix = 0; ix < name.length; ix++) try {         f[ix]=HashMap.class.getDeclaredField(name[ix]);     }     catch (NoSuchFieldException ex) {         throw new ExceptionInInitializerError(ex);     }     AccessibleObject.setAccessible(f, true);     FIELDS=f; }

Using this with the simple sequential for(int i=0; i<5; i++) debugPut(m, i, i); printed:

Click to copy

table: null size: 0 threshold: 1  table: [Ljava.util.HashMap$Node;@70dea4e size: 1 threshold: 1  table: [Ljava.util.HashMap$Node;@5c647e05 size: 2 threshold: 3  table: [Ljava.util.HashMap$Node;@5c647e05 size: 3 threshold: 3  table: [Ljava.util.HashMap$Node;@33909752 size: 4 threshold: 6  table: [Ljava.util.HashMap$Node;@33909752 size: 5 threshold: 6

As you can see, due to the initial capacity of 0, there are three different backing arrays created even during the sequential operation. Each time, the capacity is increased, there is a higher chance that a racy concurrent put misses the array update and creates its own array.

This is especially relevant for the initial state of an empty map and several threads trying to put their first key, as all threads might encounter the initial state of a null table and create their own. Also, even when reading the state of a completed first put, there is a new array created for the second put as well.

But step-by-step debugging revealed even more chances of breaking:

Inside the method putVal, we see at the end:

Click to copy

++modCount; if (++size > threshold)     resize(); afterNodeInsertion(evict); return null;

In other words, after the successful insertion of a new key, the table will get resized, if the new size exceeds the threshold. So on the first put, resize() is called at the beginning because the table is null and since your specified initial capacity is 0, i.e. too low to store one mapping, the new capacity will be 1 and the new threshold will be 1 * loadFactor == 1 * 0.75f == 0.75f, rounded to 0. So right at the end of the first put, the new threshold is exceeded and another resize() operation triggered. So with an intial capacity of 0, the first put already creates and populates two arrays, which gives much higher chances to break, if multiple threads perform this action concurrently, all encountering the initial state.

And there is another point. Looking into the resize() operation we see the lines:

Click to copy

 @SuppressWarnings({"rawtypes","unchecked"})  Node<K,V>[] newTab = (Node<K,V>[])new Node[newCap];  table = newTab;  if (oldTab != null) {      … (transfer old contents to new array)

In other words, the new array reference is stored into the heap before it has been populated with the old entries, so even without reordering of reads and writes, there is a chance that another thread reads that reference without seeing the old entries, including the one it has written itself previously. Actually, optimizations reducing the heap access may lower the chances of a thread not seeing its own update in an immediately following query.

Still, it must also noted that the assumption that everything runs interpreted here, is not founded. Since HashMap is used by the JRE internally as well, even before your application starts, there is also a chance of encountering already compiled code when using HashMap.

134

answered Sep 29 '22 22:09

Holger

Related questions
                            
                                Configure @MockBean component before application start
                            
                                Why is using Collection<String>.class illegal?
                            
                                Eclipse error, "The selection cannot be launched, and there are no recent launches" [duplicate]
                            
                                OkHttp: avoid leaked connection warning
                            
                                How to implement SAML SSO [closed]
                            
                                Java memory model: volatile variables and happens-before
                            
                                What is "convertView" parameter in ArrayAdapter getView() method
                            
                                Repeatable not found when using Spring 4 PropertySource
                            
                                Secure HTTP Post in Android
                            
                                Nodejs: Where or How to write complicated business logic?
                            
                                Java overriding methods when creating new instance of a class
                            
                                Calling Rust from Java
                            
                                How to deploy a JavaFX 11 Desktop application with a JRE
                            
                                Spring MVC @RequestMapping Inheritance
                            
                                Why following types are reifiable& non-reifiable in java?
                            
                                How do you organise your Dagger 2 modules and components? [closed]
                            
                                Why not remove type erasure from the next JVM?
                            
                                other open source alternatives to codahale's "metrics"? [closed]
                            
                                Strategy for debugging surefire "The forked VM terminated without saying properly goodbye. VM crash or System.exit called ?"
                            
                                java.net.ConnectException: failed to connect to /192.168.253.3 (port 2468): connect failed: ECONNREFUSED (Connection refused)

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

What implementation detail makes this code fail so easily?

Tags:

java

java-8

java-stream

Marko Topolnik

People also ask

1 Answers

Holger

Recent Activity

Donate For Us