Unsplittable Spliterators

Tags:

I'm trying to understand how Spliterator works, and how spliterators are designed. I recognize that trySplit() is likely one of the more important methods of Spliterator, but when I see some third-party Spliterator implementations, sometimes I see that their spliterators return null for trySplit() unconditionally.

The questions:

Is there a difference between an ordinary iterator and a Spliterator that returns null unconditionally? It seems like such a spliterator defeats the point of, well, splitting.
Of course, there are legitimate use cases of spliterators that conditionally return null on trySplit(), but is there a legitimate use case of a spliterator that unconditionally returns null?

597

asked Mar 05 '15 02:03

Kelvin Chung

2 Answers

While the main advantage of Spliterator over Iterator is, as you said, its trySplit() method which allows it to be parallelized, there are other significant advantages:

http://docs.oracle.com/javase/8/docs/api/java/util/Spliterator.html

The Spliterator API was designed to support efficient parallel traversal in addition to sequential traversal, by supporting decomposition as well as single-element iteration. In addition, the protocol for accessing elements via a Spliterator is designed to impose smaller per-element overhead than Iterator, and to avoid the inherent race involved in having separate methods for hasNext() and next().

Furthermore, Spliterators can be directly converted to Streams using StreamSupport.stream to make use of Java8's streams.

167

answered Sep 25 '22 06:09

Adrian Leonhard

One of the purposes of a Spliterator is to be able to split, but that's not the only purpose. The other main purpose is as a support class for creating your own Stream source. One way to create a Stream source is to implement your own Spliterator and pass it to StreamSupport.stream. The simplest thing to do is often to write a Spliterator that can't split. Doing so forces the stream to execute sequentially, but that might be acceptable for whatever you're trying to do.

There are other cases where writing a non-splittable Spliterator makes sense. For example, in OpenJDK, there are implementations such as EmptySpliterator that contain no elements. Of course it can't be split. A similar case is a singleton spliterator that contains exactly one element. It can't be split either. Both implementations return null unconditionally from trySplit.

Another case is where writing a non-splittable Spliterator is easy and effective, and the amount of code necessary to implement a splittable one is prohibitive. (At least, not worth the effort of writing one into a Stack Overflow answer.) For example, see the example Spliterator from this answer. The case here is that the Spliterator implementation wants to wrap another Spliterator and do something special, in this case check to see if it's not empty. Otherwise it just delegates everything to the wrapped Spliterator. Doing this with a non-splittable Spliterator is pretty easy.

Notice that there's discussion in that answer, the comment on that answer, in my answer to the same question, and the comment thread on my answer, about how one would make a splittable (i.e., parallel-ready) Spliterator. But nobody actually wrote out the code to do the splitting. :-) Depending upon how much laziness you want to preserve from the original stream, and how much parallel efficiency you want, writing a splittable Spliterator can get pretty complicated.

In my estimation it's somewhat easier to do this sort of stuff by writing an Iterator instead of a Spliterator (as in my answer noted above). It turns out that Spliterators.spliteratorUnknownSize can provide a limited amount of parallelism, even from an Iterator, which is apparently a purely sequential construct. It does so within IteratorSpliterator, which pulls multiple elements from the Iterator and processes them in batches. Unfortunately the batch size is hardcoded, but at least this gives the opportunity for processing elements pulled from an Iterator in parallel in certain cases.

answered Sep 26 '22 06:09

Stuart Marks

Related questions
                            
                                While debugging java app what information is shown for a variable in a stack frame [duplicate]
                            
                                GSON and InstanceCreator issue
                            
                                Vaadin 7 - Good framework but not for my project [closed]
                            
                                How do you specify a single test to be run by play framework's "test-only"command
                            
                                URI/URL and String what is the difference?
                            
                                Java EE 7 - Injection into Runnable/Callable object
                            
                                How to use Asynchronous Callbacks in Jersey 2 in tomcat 7
                            
                                How can I include ChromeDriver in a JAR?
                            
                                type erasure in implementation of ArrayList in Java
                            
                                jaxb2-maven-plugin only executing first execution
                            
                                What is the right way to use Cassandra driver from a web application
                            
                                Android - Scrolling Vertically with a GridLayout
                            
                                Spring security and custom AuthenticationFilter with Spring boot
                            
                                Is there a java look and feel based on the flat design concept? [duplicate]
                            
                                How do you quickly close a nonresponsive websocket in Java Spring Tomcat?
                            
                                Difference between SynchronousQueue vs TransferQueue
                            
                                jar edit and re-compile in simple way
                            
                                Java switch statement using class.getSimpleName() gives Constant express required error
                            
                                Writing a multithreaded mapping iterator in Java
                            
                                set a (method) breakpoint for a particular object (and not all instances of that type) in java

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Unsplittable Spliterators

Tags:

java

java-8

spliterator

Kelvin Chung

People also ask

2 Answers

Adrian Leonhard

Stuart Marks

Recent Activity

Donate For Us