Running the following stream example in Java8: <pre class="prettyprint"><code> System.out.println(Stream .of("a", "b", "c", "d", "e", "f") .reduce("", (s1, s2) -> s1 + "/" + s2) ); </code></pre> yields: <pre class="prettyprint"><code>/a/b/c/d/e/f </code></pre> Which is - of course - no surprise. Due to http://docs.oracle.com/javase/8/docs/api/index.html?overview-summary.html it shouldn't matter whether the stream is executed sequentially or parallel: <blockquote> Except for operations identified as explicitly nondeterministic, such as findAny(), whether a stream executes sequentially or in parallel should not change the result of the computation. </blockquote> AFAIK <code>reduce()</code> is deterministic and <code>(s1, s2) -> s1 + "/" + s2</code> is associative, so that adding <code>parallel()</code> should yield the same result: <pre class="prettyprint"><code> System.out.println(Stream .of("a", "b", "c", "d", "e", "f") .parallel() .reduce("", (s1, s2) -> s1 + "/" + s2) ); </code></pre> However the result on my machine is: <pre class="prettyprint"><code>/a//b//c//d//e//f </code></pre> What's wrong here? BTW: using (the preferred) <code>.collect(Collectors.joining("/"))</code> instead of <code>reduce(...)</code> yields the same result <code>a/b/c/d/e/f</code> for sequential and parallel execution. JVM details: <pre class="prettyprint"><code>java.specification.version: 1.8 java.version: 1.8.0_31 java.vm.version: 25.31-b07 java.runtime.version: 1.8.0_31-b13 </code></pre>

From reduce's documentation: <blockquote> The identity value must be an identity for the accumulator function. This means that for all t, accumulator.apply(identity, t) is equal to t. </blockquote> Which is not true in your case - "" and "a" creates "/a". I have extracted the accumulator function and added a printout to show what happens: <pre class="prettyprint"><code>BinaryOperator<String> accumulator = (s1, s2) -> { System.out.println("joining \"" + s1 + "\" and \"" + s2 + "\""); return s1 + "/" + s2; }; System.out.println(Stream .of("a", "b", "c", "d", "e", "f") .parallel() .reduce("", accumulator) ); </code></pre> This is example output (it differs between runs): <pre class="prettyprint"><code>joining "" and "d" joining "" and "f" joining "" and "b" joining "" and "a" joining "" and "c" joining "" and "e" joining "/b" and "/c" joining "/e" and "/f" joining "/a" and "/b//c" joining "/d" and "/e//f" joining "/a//b//c" and "/d//e//f" /a//b//c//d//e//f </code></pre> You can add an if statement to your function to handle empty string separately: <pre class="prettyprint"><code>System.out.println(Stream .of("a", "b", "c", "d", "e", "f") .parallel() .reduce((s1, s2) -> s1.isEmpty()? s2 : s1 + "/" + s2) ); </code></pre> As Marko Topolnik noticed, checking <code>s2</code> is not required as accumulator doesn't have to be commutative function.

Java8 streams sequential and parallel execution produce different results?

Tags:

java

lambda

java-8

java-stream

Running the following stream example in Java8:

    System.out.println(Stream         .of("a", "b", "c", "d", "e", "f")         .reduce("", (s1, s2) -> s1 + "/" + s2)     );

yields:

/a/b/c/d/e/f

Which is - of course - no surprise. Due to http://docs.oracle.com/javase/8/docs/api/index.html?overview-summary.html it shouldn't matter whether the stream is executed sequentially or parallel:

Except for operations identified as explicitly nondeterministic, such as findAny(), whether a stream executes sequentially or in parallel should not change the result of the computation.

AFAIK reduce() is deterministic and (s1, s2) -> s1 + "/" + s2 is associative, so that adding parallel() should yield the same result:

    System.out.println(Stream             .of("a", "b", "c", "d", "e", "f")             .parallel()             .reduce("", (s1, s2) -> s1 + "/" + s2)     );

However the result on my machine is:

/a//b//c//d//e//f

What's wrong here?

BTW: using (the preferred) .collect(Collectors.joining("/")) instead of reduce(...) yields the same result a/b/c/d/e/f for sequential and parallel execution.

JVM details:

java.specification.version: 1.8 java.version: 1.8.0_31 java.vm.version: 25.31-b07 java.runtime.version: 1.8.0_31-b13

321

asked Feb 25 '15 16:02

Udo

2 Answers

From reduce's documentation:

The identity value must be an identity for the accumulator function. This means that for all t, accumulator.apply(identity, t) is equal to t.

Which is not true in your case - "" and "a" creates "/a".

I have extracted the accumulator function and added a printout to show what happens:

BinaryOperator<String> accumulator = (s1, s2) -> {     System.out.println("joining \"" + s1 + "\" and \"" + s2 + "\"");     return s1 + "/" + s2; }; System.out.println(Stream                 .of("a", "b", "c", "d", "e", "f")                 .parallel()                 .reduce("", accumulator) );

This is example output (it differs between runs):

joining "" and "d" joining "" and "f" joining "" and "b" joining "" and "a" joining "" and "c" joining "" and "e" joining "/b" and "/c" joining "/e" and "/f" joining "/a" and "/b//c" joining "/d" and "/e//f" joining "/a//b//c" and "/d//e//f" /a//b//c//d//e//f

You can add an if statement to your function to handle empty string separately:

System.out.println(Stream         .of("a", "b", "c", "d", "e", "f")         .parallel()         .reduce((s1, s2) -> s1.isEmpty()? s2 : s1 + "/" + s2) );

As Marko Topolnik noticed, checking s2 is not required as accumulator doesn't have to be commutative function.

114

answered Oct 16 '22 01:10

Jaroslaw Pawlak

To add to other answer,

You might want to use Mutable reduction, the doc specify that doing something like

String concatenated = strings.reduce("", String::concat)

Will give bad performance result.

We would get the desired result, and it would even work in parallel. However, we might not be happy about the performance! Such an implementation would do a great deal of string copying, and the run time would be O(n^2) in the number of characters. A more performant approach would be to accumulate the results into a StringBuilder, which is a mutable container for accumulating strings. We can use the same technique to parallelize mutable reduction as we do with ordinary reduction.

So you should use a StringBuilder instead.

answered Oct 16 '22 00:10

Jean-François Savard

Related questions
                            
                                Understanding code of ConcurrentHashMap compute method
                            
                                Java PriorityQueue with fixed size
                            
                                What does the generic nature of the class Class<T> mean? What is T?
                            
                                Best java server implementation for socket.io
                            
                                spring-boot default log location
                            
                                Where can I download the jar for org.apache.http package?
                            
                                HQL left join of un-related entities
                            
                                Not-null property references a transient value - transient instance must be saved before current operation
                            
                                SEVERE: ContainerBase.addChild: start:org.apache.catalina.LifecycleException: Failed to start error
                            
                                Using wrapper Integer class or int primitive in hibernate mapping
                            
                                How can I convert a .jar to an .exe?
                            
                                What is the exact use of java nio package when already methods are available with io package
                            
                                Do we need to make ConcurrentHashMap volatile?
                            
                                Java Arrays & Generics : Java Equivalent to C# IEnumerable<T>
                            
                                Java map with values limited by key's type parameter
                            
                                How do I prevent JAXBElement<String> from being generated in a CXF Web Service client?
                            
                                Non Deprecated findPreference() Method? - Android
                            
                                Secure Nashorn JS Execution
                            
                                How to find specified name and its value in JSON-string from Java? [closed]
                            
                                How to find out number of files currently open by Java application?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With