This is a question about API desing. When extension methods were added in C#, <code>IEnumerable</code> got all the methods that enabled using lambda expression directly on all Collections. With the advent of lambdas and default methods in Java, I would expect that <code>Collection</code> would implement <code>Stream</code> and provide default implementations for all its methods. This way, we would not need to call <code>stream()</code> in order to leverage the power it provides. What is the reason the library architects opted for the less convenient approach?

<ol> <li>A Collection is an object model </li> <li>A Stream is a subject model</li> </ol> Collection definition in doc : <blockquote> A collection represents a group of objects, known as its elements. </blockquote> Stream definition in doc : <blockquote> A sequence of elements supporting sequential and parallel aggregate operations </blockquote> Seen this way, a stream is a specific collection. Not the way around. Thus Collection should not Implement Stream, regardless of backward compatibility. So why doesnt <code>Stream<T></code> implement <code>Collection<T></code> ? Because It is another way of looking at a bunch of objects. Not as a group of elements, but by the operations you can perform on it. Thus this is why I say a Collection is an object model while a Stream is a subject model

Why doesn't Collection<T> Implement Stream<T>? [duplicate]

Tags:

java

lambda

java-8

api-design

This is a question about API desing. When extension methods were added in C#, IEnumerable got all the methods that enabled using lambda expression directly on all Collections.

With the advent of lambdas and default methods in Java, I would expect that Collection would implement Stream and provide default implementations for all its methods. This way, we would not need to call stream() in order to leverage the power it provides.

What is the reason the library architects opted for the less convenient approach?

891

asked Feb 11 '15 16:02

Vitaliy

2 Answers

From Maurice Naftalin's Lambda FAQ:

Why are Stream operations not defined directly on Collection?

Early drafts of the API exposed methods like filter, map, and reduce on Collection or Iterable. However, user experience with this design led to a more formal separation of the “stream” methods into their own abstraction. Reasons included:
Methods on Collection such as removeAll make in-place modifications, in contrast to the new methods which are more functional in nature. Mixing two different kinds of methods on the same abstraction forces the user to keep track of which are which. For example, given the declaration
Collection strings;
the two very similar-looking method calls
strings.removeAll(s -> s.length() == 0);
strings.filter(s -> s.length() == 0);          // not supported in the current API
would have surprisingly different results; the first would remove all empty String objects from the collection, whereas the second would return a stream containing all the non-empty Strings, while having no effect on the collection.

Instead, the current design ensures that only an explicitly-obtained stream can be filtered:
strings.stream().filter(s.length() == 0)...;
where the ellipsis represents further stream operations, ending with a terminating operation. This gives the reader a much clearer intuition about the action of filter;
With lazy methods added to Collection, users were confused by a perceived—but erroneous—need to reason about whether the collection was in “lazy mode” or “eager mode”. Rather than burdening Collection with new and different functionality, it is cleaner to provide a Stream view with the new functionality;

The more methods added to Collection, the greater the chance of name collisions with existing third-party implementations. By only adding a few methods (stream, parallel) the chance for conflict is greatly reduced;
A view transformation is still needed to access a parallel view; the asymmetry between the sequential and the parallel stream views was unnatural. Compare, for example
coll.filter(...).map(...).reduce(...);
with
coll.parallel().filter(...).map(...).reduce(...);
This asymmetry would be particularly obvious in the API documentation, where Collection would have many new methods to produce sequential streams, but only one to produce parallel streams, which would then have all the same methods as Collection. Factoring these into a separate interface, StreamOps say, would not help; that would still, counterintuitively, need to be implemented by both Stream and Collection;
A uniform treatment of views also leaves room for other additional views in the future.

127

answered Nov 10 '22 06:11

John Kugelman

A Collection is an object model
A Stream is a subject model

Collection definition in doc :

A collection represents a group of objects, known as its elements.

Stream definition in doc :

A sequence of elements supporting sequential and parallel aggregate operations

Seen this way, a stream is a specific collection. Not the way around. Thus Collection should not Implement Stream, regardless of backward compatibility.

So why doesnt Stream<T> implement Collection<T> ? Because It is another way of looking at a bunch of objects. Not as a group of elements, but by the operations you can perform on it. Thus this is why I say a Collection is an object model while a Stream is a subject model

answered Nov 10 '22 08:11

UmNyobe

Related questions
                            
                                How to add Profiling And Logging perspective to Eclipse Luna?
                            
                                Who loads java.lang.ClassLoader?
                            
                                jtable how to use rs2xml
                            
                                How many strings are in jvm string pool intern
                            
                                Gradle: Copy subproject resources
                            
                                readFully not defined with Java Nashorn Javascript Engine
                            
                                Internal graphics not initialized yet: javafx
                            
                                Nested wildcards
                            
                                how to get table structure for h2 database using metadata
                            
                                Convert String to Joda LocalTime format (HH:mm:ss) and Remove milliseconds
                            
                                Java: how to require subclasses to call super() when super's argument is a vararg
                            
                                Scanner will not scan negative numbers
                            
                                Java 8 Compiler Confusion With overloaded methods
                            
                                Bean Transaction Timeout in WebSphere using EJB Timer
                            
                                Accessing main arguments from a static initializer
                            
                                NPE in Spring Data JPA with IN clause
                            
                                Recursive Karatsuba multiplication not working?
                            
                                Fatal exception handling in Java
                            
                                What should the type of parameter be in Java when it is a "timestamp without time zone" in postgresql?
                            
                                Why the "cannot select from a type variable"

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With