What does the Java 8 Collector UNORDERED characteristic mean?

2 Answers

In the absence of special pleading, stream operations must behave as if the elements are processed in the encounter order of the source. For some operations -- such as reduction with an associative operation -- one can obey this constraint and still get efficient parallel execution. For others, though, this constraint is very limiting. And, for some problems, this constraint isn't meaningful to the user. Consider the following stream pipeline:

people.stream()
      .collect(groupingBy(Person::getLastName, 
                          mapping(Person::getFirstName));

Is it important that the list of first names associated with "Smith" appear in the map in the order they appeared in the initial stream? For some problems, yes, for some no -- we don't want the stream library guessing for us. An unordered collector says that it's OK to insert the first names into the list in an order inconsistent with the order in which Smith-surnamed people appear in the input source. By relaxing this constraint, sometimes (not always), the stream library can give a more efficient execution.

For example, if you didn't care about this order preservation, you could execute it as:

people.parallelStream()
      .collect(groupingByConcurrent(Person::getLastName, 
                                    mapping(Person::getFirstName));

The concurrent collector is unordered, which permits the optimization of sharing an underlying ConcurrentMap, rather than having O(log n) map-merge steps. Relaxing the ordering constraint enables a real algorithmic advantage -- but we can't assume the constraint doesn't matter, we need for the user to tell us this. Using an UNORDERED collector is one way to tell the stream library that these optimizations are fair game.

answered Sep 22 '22 16:09

Brian Goetz

UNORDERED essentially means that the collector is both associative (required by the spec) and commutative (not required).

Associativity allows splitting the computation into subparts and then combining them into the full result, but requires the combining step to be strictly ordered. Examine this snippet from the docs:

 A a2 = supplier.get();
 accumulator.accept(a2, t1);
 A a3 = supplier.get();
 accumulator.accept(a3, t2);
 R r2 = finisher.apply(combiner.apply(a2, a3));  // result with splitting

In the last step, combiner.apply(a2, a3), the arguments must appear in exactly this order, which means that the entire computation pipeline must track the order and respect it in the end.

Another way of saying this is that the tree we get from recursive splitting must be ordered.

On the other hand, if the combining operation is commutative, we can combine any subpart with any other, in no particular order, and always obtain the same result. Clearly this leads to many optimization opportunities in both space and time dimensions.

It should be noted that there are UNORDERED collectors in the JDK which don't guarantee commutativity. The main category are the "higher-order" collectors which are composed with other downstream collectors, but they don't enforce the UNORDERED property on them.

answered Sep 20 '22 16:09

Marko Topolnik

Related questions
                            
                                Creating a random 4 digit number, and storing it to a string [duplicate]
                            
                                Difference between this.variable and variable in Java [duplicate]
                            
                                Two Maven Dependency for latest and old version conflicts
                            
                                Encode image from URL in Base64 in Java
                            
                                BigInteger::intValueExact() - what's the point?
                            
                                How to find button element with webdriver?
                            
                                How to use labels in java code?
                            
                                Thread join() does not wait
                            
                                Accessing appView from Cordova 5.0.0
                            
                                How to create 'testng.xml' using Eclipse
                            
                                Perform action inside stream of operation in Java 8
                            
                                Convert java.util.Calendar ISO 8601 format to java.sql.Timestamp
                            
                                Error:cannot find symbol class RecyclerView
                            
                                Difference between using fully qualified name and import in Java
                            
                                String sorting null values
                            
                                Oracle Sql developer error: could not install some modules
                            
                                Execute jar file as Administrator in Windows
                            
                                Spring Boot and Thymeleaf 3.0.0.RELEASE integration
                            
                                EventBus - @Subscribe annotated method is never used
                            
                                Cannot resolve symbol 'menu' errors in Android Studio

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

What does the Java 8 Collector UNORDERED characteristic mean?

Tags:

java

java-8

java-stream

collectors

csharpfolk

People also ask

2 Answers

Brian Goetz

Marko Topolnik

Recent Activity

Donate For Us