We know Java 8 introduces a new Stream API and <code>java.util.stream.Collector</code> is the interface to define how to aggregate/collect the data stream. However, the Collector interface is designed like this: <pre class="prettyprint"><code>public interface Collector<T, A, R> { Supplier<A> supplier(); BiConsumer<A, T> accumulator(); BinaryOperator<A> combiner(); Function<A, R> finisher(); } </code></pre> Why is it not designed like the following? <pre class="prettyprint"><code>public interface Collector<T, A, R> { A supply(); void accumulate(A accumulator, T value); A combine(A left, A right); R finish(A accumulator); } </code></pre> The latter one is much easier to implement. What were the consideration to design it as the former one?

Actually it was originally designed similarly to what you propose. See the early implementation in project lambda repository (<code>makeResult</code> is now <code>supplier</code>). It was later updated to the current design. I believe, the rationale of such update is to simplify collector combinators. I did not find any specific discussion on this topic, but my guess is supported by the fact that <code>mapping</code> collector appeared in the same changeset. Consider the implementation of <code>Collectors.mapping</code>: <pre class="prettyprint"><code>public static <T, U, A, R> Collector<T, ?, R> mapping(Function<? super T, ? extends U> mapper, Collector<? super U, A, R> downstream) { BiConsumer<A, ? super U> downstreamAccumulator = downstream.accumulator(); return new CollectorImpl<>(downstream.supplier(), (r, t) -> downstreamAccumulator.accept(r, mapper.apply(t)), downstream.combiner(), downstream.finisher(), downstream.characteristics()); } </code></pre> This implementation needs to redefine <code>accumulator</code> function only, leaving <code>supplier</code>, <code>combiner</code> and <code>finisher</code> as is, so you don't have additional indirection when calling <code>supplier</code>, <code>combiner</code> or <code>finisher</code>: you just call directly the functions returned by the original collector. It's even more important with <code>collectingAndThen</code>: <pre class="prettyprint"><code>public static<T,A,R,RR> Collector<T,A,RR> collectingAndThen(Collector<T,A,R> downstream, Function<R,RR> finisher) { // ... some characteristics transformations ... return new CollectorImpl<>(downstream.supplier(), downstream.accumulator(), downstream.combiner(), downstream.finisher().andThen(finisher), characteristics); } </code></pre> Here only <code>finisher</code> is changed, but original <code>supplier</code>, <code>accumulator</code> and <code>combiner</code> are used. As <code>accumulator</code> is called for every element, reducing the indirection could be pretty important. Try to rewrite <code>mapping</code> and <code>collectingAndThen</code> with your proposed design and you will see the problem. New JDK-9 collectors like <code>filtering</code> and <code>flatMapping</code> also benefit from current design.

Why is the Java 8 'Collector' class designed in this way?

Tags:

java

java-8

java-stream

collectors

We know Java 8 introduces a new Stream API and java.util.stream.Collector is the interface to define how to aggregate/collect the data stream.

However, the Collector interface is designed like this:

public interface Collector<T, A, R> {     Supplier<A> supplier();     BiConsumer<A, T> accumulator();     BinaryOperator<A> combiner();     Function<A, R> finisher(); }

Why is it not designed like the following?

public interface Collector<T, A, R> {     A supply();     void accumulate(A accumulator, T value);     A combine(A left, A right);     R finish(A accumulator); }

The latter one is much easier to implement. What were the consideration to design it as the former one?

363

asked Apr 29 '16 03:04

popcorny

1 Answers

Actually it was originally designed similarly to what you propose. See the early implementation in project lambda repository (makeResult is now supplier). It was later updated to the current design. I believe, the rationale of such update is to simplify collector combinators. I did not find any specific discussion on this topic, but my guess is supported by the fact that mapping collector appeared in the same changeset. Consider the implementation of Collectors.mapping:

public static <T, U, A, R> Collector<T, ?, R> mapping(Function<? super T, ? extends U> mapper,                            Collector<? super U, A, R> downstream) {     BiConsumer<A, ? super U> downstreamAccumulator = downstream.accumulator();     return new CollectorImpl<>(downstream.supplier(),                                (r, t) -> downstreamAccumulator.accept(r, mapper.apply(t)),                                downstream.combiner(), downstream.finisher(),                                downstream.characteristics()); }

This implementation needs to redefine accumulator function only, leaving supplier, combiner and finisher as is, so you don't have additional indirection when calling supplier, combiner or finisher: you just call directly the functions returned by the original collector. It's even more important with collectingAndThen:

public static<T,A,R,RR> Collector<T,A,RR> collectingAndThen(Collector<T,A,R> downstream,                                                             Function<R,RR> finisher) {     // ... some characteristics transformations ...     return new CollectorImpl<>(downstream.supplier(),                                downstream.accumulator(),                                downstream.combiner(),                                downstream.finisher().andThen(finisher),                                characteristics); }

Here only finisher is changed, but original supplier, accumulator and combiner are used. As accumulator is called for every element, reducing the indirection could be pretty important. Try to rewrite mapping and collectingAndThen with your proposed design and you will see the problem. New JDK-9 collectors like filtering and flatMapping also benefit from current design.

answered Oct 14 '22 15:10

Tagir Valeev

Related questions
                            
                                Android O casting to findViewById not needed anymore? [duplicate]
                            
                                Is there an equivalent to the Scanner class in C# for strings?
                            
                                Can 0.99999999999 be rounded to 1.0 when multiplying?
                            
                                java.lang.UnsupportedClassVersionError: Unsupported major.minor version 51.0 (unable to load class frontend.listener.StartupListener) [duplicate]
                            
                                How to throw a custom fault on a JAX-WS web service?
                            
                                How to call a method in DLL in a Java program
                            
                                Spring-MVC Problem using @Controller on controller implementing an interface
                            
                                Do I have to worry about InterruptedExceptions if I don't interrupt anything myself?
                            
                                How can I create a simple docx file with Apache POI?
                            
                                Cassandra - transaction support
                            
                                Convert object from java.nio.file.Path to java.io.File [duplicate]
                            
                                Downsides to immutable objects in Java? [closed]
                            
                                Guice best practices and anti-patterns
                            
                                When obfuscating with ProGuard, does -keepattributes SourceFile,LineNumberTable make the resulting apk easier to reverse engineer?
                            
                                Recognize a number from an image
                            
                                Mockito verify that a specific lambda has been passed as an argument in mock's method
                            
                                Regex to replace characters that Windows doesn't accept in a filename
                            
                                Is there a java classfile / bytecode editor to edit instructions? [closed]
                            
                                What is the algorithm for finding the center of a circle from three points?
                            
                                How to compile a single Java file

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With