I have stumbled upon an interesting detail in <code>java.util.Spliterator</code> (Java 8). Method trySplit() is supposed to return an instance of Spliterator or <code>null</code>, if it can't be split. The java doc says the following: <pre class="prettyprint"><code> * @return a {@code Spliterator} covering some portion of the * elements, or {@code null} if this spliterator cannot be split. </code></pre> It appears to me as a perfect place to use <code>java.util.Optional</code>. As per javadoc: <pre class="prettyprint"><code> * A container object which may or may not contain a non-null value. </code></pre> Are there any reasons, why Optional was not used? Googling did not help much, except this question in lambda-dev mailing list, which was not answered.

There are a couple of reasons it's the way it is. Of course, conceptually, <code>trySplit</code> could return <code>Optional<Spliterator<T>></code>, but there are some design forces that pushed away from this. One reason is that there's a difference between methods such as <code>findFirst</code> that return <code>Optional</code> vs. methods such as <code>trySplit</code> that return value-or-null. <ul> <li>Methods like <code>findFirst</code> are called by and return values to application code.</li> <li>Methods like <code>trySplit</code> are called by and return values to library code.</li> </ul> A design aspect of the JDK class libraries is that the library APIs are (or should be) designed to make things easier for application code, and library code will often take on more complexity in order to make things simpler for applications. One of the main reasons for <code>Optional</code> is to avoid passing nulls from the library to application code, because improper null handling is a common source of <code>NullPointerException</code>s. Instead of <code>null</code>, APIs like <code>findFirst</code> will return an empty <code>Optional</code>, which is supported by a rich set of methods such as <code>orElse</code>, <code>map</code>, <code>filter</code>, <code>flatMap</code>, etc. that provide a great deal of flexibility to applications for dealing with the not-found case. Note that the nullable return value from <code>trySplit</code> is going in the opposite direction: from the application to the library. Having application code pass or return a nullable value to the library is considerably less error-prone for the application than having it receive a nullable value from the library. If you're writing an application and the API says that you should pass or return a null to the library, there's no possibility that this will generate an NPE in your code. Indeed, there are a variety of places in the APIs (<code>List.sort(null)</code> comes to mind) where <code>null</code> has particular semantics in the API. <code>trySplit</code> is called from relatively few places in the library, and the library maintainers are taking on the burden of dealing properly with <code>null</code> in all of those cases. Another prime consideration is performance. Splitting is in the critical path of setting up a parallel pipeline. It's performed sequentially, before work is handed off to different threads to be executed in parallel. Per Amdahl's Law, in order to make parallelism as efficient as possible, you want to minimize the sequential setup overhead. The fact is that an <code>Optional</code> is a box, and there is a cost to boxing and unboxing a value to and from an <code>Optional</code>. The JIT compiler might be able to optimize this away in some cases, but it might not. Even if it does, there's a period of time where the code is running but the <code>Optional</code> hasn't yet been optimized away. That's additional overhead. Since the library code is willing to bear the burden of handling <code>null</code> properly, we can guarantee there's no boxing overhead simply by not using <code>Optional</code> at all in this case.

<code>Spliterator</code> is the part of internal stream implementation. It should not be used in business logic where <code>Optional</code> would be convenient. Its quite low-level interface which main goal is to be fast. So there's no reason for <code>Optional</code> there. You might argue that <code>Optional</code> usually can be eliminated by JIT compiler. However that's not always the case. For example, default max depth of calls for inlining in Hotspot JIT compiler is 10 and usual stream processing has more stack frames, so even one additional stack frame may prevent optimization.

Spliterator trySplit return type

Tags:

java

lambda

java-8

spliterator

I have stumbled upon an interesting detail in java.util.Spliterator (Java 8).

Method trySplit() is supposed to return an instance of Spliterator or null, if it can't be split. The java doc says the following:

Click to copy

 * @return a {@code Spliterator} covering some portion of the
 * elements, or {@code null} if this spliterator cannot be split.

It appears to me as a perfect place to use java.util.Optional. As per javadoc:

Click to copy

 * A container object which may or may not contain a non-null value.

Are there any reasons, why Optional was not used?

Googling did not help much, except this question in lambda-dev mailing list, which was not answered.

304

asked May 12 '15 14:05

Andrew

2 Answers

There are a couple of reasons it's the way it is. Of course, conceptually, trySplit could return Optional<Spliterator<T>>, but there are some design forces that pushed away from this.

One reason is that there's a difference between methods such as findFirst that return Optional vs. methods such as trySplit that return value-or-null.

Methods like findFirst are called by and return values to application code.
Methods like trySplit are called by and return values to library code.

A design aspect of the JDK class libraries is that the library APIs are (or should be) designed to make things easier for application code, and library code will often take on more complexity in order to make things simpler for applications.

One of the main reasons for Optional is to avoid passing nulls from the library to application code, because improper null handling is a common source of NullPointerExceptions. Instead of null, APIs like findFirst will return an empty Optional, which is supported by a rich set of methods such as orElse, map, filter, flatMap, etc. that provide a great deal of flexibility to applications for dealing with the not-found case.

Note that the nullable return value from trySplit is going in the opposite direction: from the application to the library.

Having application code pass or return a nullable value to the library is considerably less error-prone for the application than having it receive a nullable value from the library. If you're writing an application and the API says that you should pass or return a null to the library, there's no possibility that this will generate an NPE in your code. Indeed, there are a variety of places in the APIs (List.sort(null) comes to mind) where null has particular semantics in the API.

trySplit is called from relatively few places in the library, and the library maintainers are taking on the burden of dealing properly with null in all of those cases.

Another prime consideration is performance. Splitting is in the critical path of setting up a parallel pipeline. It's performed sequentially, before work is handed off to different threads to be executed in parallel. Per Amdahl's Law, in order to make parallelism as efficient as possible, you want to minimize the sequential setup overhead.

The fact is that an Optional is a box, and there is a cost to boxing and unboxing a value to and from an Optional. The JIT compiler might be able to optimize this away in some cases, but it might not. Even if it does, there's a period of time where the code is running but the Optional hasn't yet been optimized away. That's additional overhead. Since the library code is willing to bear the burden of handling null properly, we can guarantee there's no boxing overhead simply by not using Optional at all in this case.

191

answered Sep 28 '22 09:09

Stuart Marks

Spliterator is the part of internal stream implementation. It should not be used in business logic where Optional would be convenient. Its quite low-level interface which main goal is to be fast. So there's no reason for Optional there.

You might argue that Optional usually can be eliminated by JIT compiler. However that's not always the case. For example, default max depth of calls for inlining in Hotspot JIT compiler is 10 and usual stream processing has more stack frames, so even one additional stack frame may prevent optimization.

answered Sep 28 '22 08:09

Tagir Valeev

Related questions
                            
                                Resizing icon to fit on JButton in Java?
                            
                                How to query using an Enum parameter mapped as ORDINAL using JPA and Hibernate
                            
                                how to get android element's 'resource id' and 'content description' attributes value in appium java?
                            
                                Problems with Collections.sort in Java 8
                            
                                Are lambdas garbage collected?
                            
                                Installed Nodeclipse on Eclipse Kepler, cannot turn off Java spell check
                            
                                When is a Java Implicit Constructor called compared to the Base Class Constructor?
                            
                                What Java collection considers permutations to be equal?
                            
                                Confused about the idea of implicit narrowing on primitives in Java
                            
                                How can I prevent the overlapping random numbers
                            
                                Best way to store locators
                            
                                Compiling multi module maven projects without installing to local repository
                            
                                Best way to add local dependency to Maven project
                            
                                Unsupported major.minor version 52.0 in ubuntu
                            
                                How can I fake the date returned by java.time.LocalDate?
                            
                                How can I force Idea and Maven download all sources for my project?
                            
                                ClassCastException Integer to Double
                            
                                From a servlet, how do I set a Cookie that never expires?
                            
                                Calling hashCode() from equals()
                            
                                Need to Get the current DateTime in the Talend Studio?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With