In trying to learn Java lambdas, I came across an article (listed below), where under a section on the limitations of the stream API, he states that: "Stateful lambdas are usually not a problem when executing sequentially, but when the stream execution is parallelized, it breaks". He then gives this code as an example of problems due to execution order: <pre class="prettyprint"><code>List<String> ss = ...; List<String> result = ...; Stream<String> stream = ss.stream(); stream.map(s -> { synchronized (result) { if (result.size() < 10) { result.add(s); } } }) .forEach(e -> { }); </code></pre> I can see how this would be non-deterministic if it were parallelized, but what I can't see is how you would fix this with stateless lambdas -- isn't there something inherently non-deterministic about adding things to a list in a parallel fashion. An example that a six year old in a hat could understand, perhaps in C#, would be much appreciated. Link to original article http://blog.hartveld.com/2013/03/jdk-8-33-stream-api.html

I know where you are hinting at with your question, and I will do my best to explain. Consider an input list consisting of 8 elements: <code>[1, 2, 3, 4, 5, 6, 7, 8]</code> And assume streams would parallellize it in the following way, in reality they do not, the exact process of parallellization is quite difficult to understand. But for now, assume that they would divide the size by two until a two elements remain. The branching division would look like this: <ol> <li> First division: <code>[1, 2, 3, 4]</code> <code>[5, 6, 7, 8]</code> </li> <li> Second division: <code>[1, 2]</code> <code>[3, 4]</code> <code>[5, 6]</code> <code>[7, 8]</code> </li> </ol> Now we have four chunks that will (in our theory) be processed by four different threads, which have no knowledge of eachother. This can indeed be fixed by synchronizing on some external resource, but then you lose the benefits of parallellization, so we need to assume that we do not synchronize, and when we do not synchronize, the other threads will not see what any other threads have done, so our result will be garbage. Now onto the part of the question where you ask about statelessness, how could it then be processed in parallel correctly? How can you add elements that are processed in parallel in the correct order to a list? First assume a simple mapping function where you map with the lambda <code>i -> i + 10</code>, and then print it with <code>System.out::println</code> in a foreach. Now after the second division the following will occur: <code>[1, 2] -> [11, 12] -> { System.out.println(11); System.println(12); }</code> <code>[3, 4] -> [13, 14] -> { System.out.println(13); System.println(14); }</code> <code>[5, 6] -> [15, 16] -> { System.out.println(15); System.println(16); }</code> <code>[7, 8] -> [17, 18] -> { System.out.println(17); System.println(18); }</code> There is no guarantee on the order apart from that all elements processed by the same thread (internal state, not to rely upon) get processed in order. If you want to process them in order, then you need to use <code>forEachOrdered</code>, which will ensure that all threads operate in the correct order, and you do not lose too much of a parallellization benefit because of this as it applies only to the end state. To see how you can add items parelllized to an list, take a look at this, by using the <code>Collectors.toList()</code>, which provides methods for: <ul> <li>Creating a new list.</li> <li>Adding a value to the list.</li> <li>Merging two lists.</li> </ul> Now the following will happen after the second division: For every four threads it will do the following (only showing one thread here): <ol> <li>We had <code>[1, 2]</code>.</li> <li>We map it to <code>[11, 12]</code>.</li> <li>We create an empty <code>List<Integer></code>.</li> <li>We add <code>11</code> to the list.</li> <li>We add <code>12</code> to the list.</li> </ol> Now all threads have done this, and we have four lists of two elements. Now the following merges occur in the specified order: <ol> <li><code>[11, 12] ++ [13, 14] = [11, 12, 13, 14]</code></li> <li><code>[15, 16] ++ [17, 18] = [15, 16, 17, 18]</code></li> <li>Finally <code>[11, 12, 13, 14] ++ [15, 16, 17, 18] = [11, 12, 13, 14, 15, 16, 17, 18]</code> </li> </ol> And thus the resulting list is in order and the mapping has been done in parallel. Now you should also be able to see why parallallization needs a higher minimum as just two items, as else the creation of the new lists and merging get too expensive. I hope you understand now why stream operations should be stateless to get the full benefits of parallellization.

Java lambdas, stateless lambdas and parallel execution

Tags:

java

lambda

parallel-processing

java-8

In trying to learn Java lambdas, I came across an article (listed below), where under a section on the limitations of the stream API, he states that: "Stateful lambdas are usually not a problem when executing sequentially, but when the stream execution is parallelized, it breaks". He then gives this code as an example of problems due to execution order:

Click to copy

List<String> ss = ...;
List<String> result = ...;

Stream<String> stream = ss.stream();

stream.map(s -> {
    synchronized (result) {
      if (result.size() < 10) {
        result.add(s);
      }
    }
})
.forEach(e -> { });

I can see how this would be non-deterministic if it were parallelized, but what I can't see is how you would fix this with stateless lambdas -- isn't there something inherently non-deterministic about adding things to a list in a parallel fashion. An example that a six year old in a hat could understand, perhaps in C#, would be much appreciated.

Link to original article http://blog.hartveld.com/2013/03/jdk-8-33-stream-api.html

825

asked May 01 '14 15:05

John Powell

1 Answers

I know where you are hinting at with your question, and I will do my best to explain.

Consider an input list consisting of 8 elements:

[1, 2, 3, 4, 5, 6, 7, 8]

And assume streams would parallellize it in the following way, in reality they do not, the exact process of parallellization is quite difficult to understand.
But for now, assume that they would divide the size by two until a two elements remain.

The branching division would look like this:

First division:

[1, 2, 3, 4]
[5, 6, 7, 8]
Second division:

[1, 2]
[3, 4]
[5, 6]
[7, 8]

Now we have four chunks that will (in our theory) be processed by four different threads, which have no knowledge of eachother.
This can indeed be fixed by synchronizing on some external resource, but then you lose the benefits of parallellization, so we need to assume that we do not synchronize, and when we do not synchronize, the other threads will not see what any other threads have done, so our result will be garbage.

Now onto the part of the question where you ask about statelessness, how could it then be processed in parallel correctly? How can you add elements that are processed in parallel in the correct order to a list?

First assume a simple mapping function where you map with the lambda i -> i + 10, and then print it with System.out::println in a foreach.

Now after the second division the following will occur:

[1, 2] -> [11, 12] -> { System.out.println(11); System.println(12); }
[3, 4] -> [13, 14] -> { System.out.println(13); System.println(14); }
[5, 6] -> [15, 16] -> { System.out.println(15); System.println(16); }
[7, 8] -> [17, 18] -> { System.out.println(17); System.println(18); }

There is no guarantee on the order apart from that all elements processed by the same thread (internal state, not to rely upon) get processed in order.

If you want to process them in order, then you need to use forEachOrdered, which will ensure that all threads operate in the correct order, and you do not lose too much of a parallellization benefit because of this as it applies only to the end state.

To see how you can add items parelllized to an list, take a look at this, by using the Collectors.toList(), which provides methods for:

Creating a new list.
Adding a value to the list.
Merging two lists.

Now the following will happen after the second division:

For every four threads it will do the following (only showing one thread here):

We had [1, 2].
We map it to [11, 12].
We create an empty List<Integer>.
We add 11 to the list.
We add 12 to the list.

Now all threads have done this, and we have four lists of two elements.

Now the following merges occur in the specified order:

[11, 12] ++ [13, 14] = [11, 12, 13, 14]
[15, 16] ++ [17, 18] = [15, 16, 17, 18]
Finally [11, 12, 13, 14] ++ [15, 16, 17, 18] = [11, 12, 13, 14, 15, 16, 17, 18]

And thus the resulting list is in order and the mapping has been done in parallel. Now you should also be able to see why parallallization needs a higher minimum as just two items, as else the creation of the new lists and merging get too expensive.

I hope you understand now why stream operations should be stateless to get the full benefits of parallellization.

answered Oct 13 '22 19:10

skiwi

Related questions
                            
                                Application managed JPA, when is Transaction needed
                            
                                Java HashMap returning null on get() call
                            
                                Spring Starting throws an exception
                            
                                Oracle Database 12c in VM on Mac OSX
                            
                                How to skip generate-sources in Maven
                            
                                Fork Join optimization
                            
                                jvisualvm hangs when profiling a local process
                            
                                The attribute prefix fn does not correspond to any imported tag library
                            
                                It is possible to display pdf received bytes from service in web view in Android
                            
                                How do I view source code of built-in classes in Java (e.g. BigInteger etc.)?
                            
                                Apache Tomcat 8 not working. Throws HTTP Status 500 - java.lang.ClassNotFoundException: org.apache.jsp.index_jsp
                            
                                how to get value from counter Column in cassandra with multiple row keys?
                            
                                Number format a decimal in Birt reports
                            
                                Is "Code Conventions for the Java Programming Language" by Sun (1999) out of date? [closed]
                            
                                Why is DocumentBuilder.parse() not working
                            
                                JPA 2.1/Hibernate 4.3 deprecation warning
                            
                                UNEXPECTED TOP-LEVEL EXCEPTION: com.android.dex.DexException
                            
                                PowerMock Mockito [PowerMockito] @PrepareForTest -> java.lang.NoClassDefFoundError: javassist/NotFoundException
                            
                                BindException: Address already in use even with unique port
                            
                                Gradle error: "Could not find property '...' on root project '...'

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With