To take advantage of the wide range of query methods included in <code>java.util.stream</code> of Jdk 8 I am attempted to design domain models where getters of relationship with <code>*</code> multiplicity (with zero or more instances ) return a <code>Stream<T></code>, instead of an <code>Iterable<T></code> or <code>Iterator<T></code>. My doubt is if there is any additional overhead incurred by the <code>Stream<T></code> in comparison to the <code>Iterator<T></code>? So, is there any disadvantage of compromising my domain model with a <code>Stream<T></code>? Or instead, should I always return an <code>Iterator<T></code> or <code>Iterable<T></code>, and leave to the end-user the decision of choosing whether to use a stream, or not, by converting that iterator with the <code>StreamUtils</code>? Note that returning a <code>Collection</code> is not a valid option because in this case most of the relationships are lazy and with unknown size.

There's lots of performance advice here, but sadly much of it is guesswork, and little of it points to the real performance considerations. @Holger gets it right by pointing out that we should resist the seemingly overwhelming tendency to let the performance tail wag the API design dog. While there are a zillion considerations that can make a stream slower than, the same as, or faster than some other form of traversal in any given case, there are some factors that point to streams haven a performance advantage where it counts -- on big data sets. There is some additional fixed startup overhead of creating a <code>Stream</code> compared to creating an <code>Iterator</code> -- a few more objects before you start calculating. If your data set is large, it doesn't matter; it's a small startup cost amortized over a lot of computation. (And if your data set is small, it probably also doesn't matter -- because if your program is operating on small data sets, performance is generally not your #1 concern either.) Where this does matter is when going parallel; any time spent setting up the pipeline goes into the serial fraction of Amdahl's law; if you look at the implementation, we work hard to keep the object count down during stream setup, but I'd be happy to find ways to reduce it as that has a direct effect on the breakeven data set size where parallel starts to win over sequential. But, more important than the fixed startup cost is the per-element access cost. Here, streams actually win -- and often win big -- which some may find surprising. (In our performance tests, we routinely see stream pipelines which can outperform their for-loop over <code>Collection</code> counterparts.) And, there's a simple explanation for this: <code>Spliterator</code> has fundamentally lower per-element access costs than <code>Iterator</code>, even sequentially. There are several reasons for this. <ol> <li>The Iterator protocol is fundamentally less efficient. It requires calling two methods to get each element. Further, because Iterators must be robust to things like calling <code>next()</code> without <code>hasNext()</code>, or <code>hasNext()</code> multiple times without <code>next()</code>, both of these methods generally have to do some defensive coding (and generally more statefulness and branching), which adds to inefficiency. On the other hand, even the slow way to traverse a spliterator (<code>tryAdvance</code>) doesn't have this burden. (It's even worse for concurrent data structures, because the <code>next</code>/<code>hasNext</code> duality is fundamentally racy, and <code>Iterator</code> implementations have to do more work to defend against concurrent modifications than do <code>Spliterator</code> implementations.)</li> <li><code>Spliterator</code> further offers a "fast-path" iteration -- <code>forEachRemaining</code> -- which can be used most of the time (reduction, forEach), further reducing the overhead of the iteration code that mediates access to the data structure internals. This also tends to inline very well, which in turn increases the effectiveness of other optimizations such as code motion, bounds check elimination, etc. </li> <li>Further, traversal via <code>Spliterator</code> tend to have many fewer heap writes than with <code>Iterator</code>. With <code>Iterator</code>, every element causes one or more heap writes (unless the <code>Iterator</code> can be scalarized via escape analysis and its fields hoisted into registers.) Among other issues, this causes GC card mark activity, leading to cache line contention for the card marks. On the other hand, <code>Spliterators</code> tend to have less state, and industrial-strength <code>forEachRemaining</code> implementations tend to defer writing anything to the heap until the end of the traversal, instead storing its iteration state in locals which naturally map to registers, resulting in reduced memory bus activity.</li> </ol> Summary: don't worry, be happy. <code>Spliterator</code> is a better <code>Iterator</code>, even without parallelism. (They're also generally just easier to write and harder to get wrong.)

Iterator versus Stream of Java 8

Tags:

java

java-8

java-stream

domain-driven-design

domain-model

To take advantage of the wide range of query methods included in java.util.stream of Jdk 8 I am attempted to design domain models where getters of relationship with * multiplicity (with zero or more instances ) return a Stream<T>, instead of an Iterable<T> or Iterator<T>.

My doubt is if there is any additional overhead incurred by the Stream<T> in comparison to the Iterator<T>?

So, is there any disadvantage of compromising my domain model with a Stream<T>?

Or instead, should I always return an Iterator<T> or Iterable<T>, and leave to the end-user the decision of choosing whether to use a stream, or not, by converting that iterator with the StreamUtils?

Note that returning a Collection is not a valid option because in this case most of the relationships are lazy and with unknown size.

544

asked Jul 03 '15 16:07

Miguel Gamboa

1 Answers

There's lots of performance advice here, but sadly much of it is guesswork, and little of it points to the real performance considerations.

@Holger gets it right by pointing out that we should resist the seemingly overwhelming tendency to let the performance tail wag the API design dog.

While there are a zillion considerations that can make a stream slower than, the same as, or faster than some other form of traversal in any given case, there are some factors that point to streams haven a performance advantage where it counts -- on big data sets.

There is some additional fixed startup overhead of creating a Stream compared to creating an Iterator -- a few more objects before you start calculating. If your data set is large, it doesn't matter; it's a small startup cost amortized over a lot of computation. (And if your data set is small, it probably also doesn't matter -- because if your program is operating on small data sets, performance is generally not your #1 concern either.) Where this does matter is when going parallel; any time spent setting up the pipeline goes into the serial fraction of Amdahl's law; if you look at the implementation, we work hard to keep the object count down during stream setup, but I'd be happy to find ways to reduce it as that has a direct effect on the breakeven data set size where parallel starts to win over sequential.

But, more important than the fixed startup cost is the per-element access cost. Here, streams actually win -- and often win big -- which some may find surprising. (In our performance tests, we routinely see stream pipelines which can outperform their for-loop over Collection counterparts.) And, there's a simple explanation for this: Spliterator has fundamentally lower per-element access costs than Iterator, even sequentially. There are several reasons for this.

The Iterator protocol is fundamentally less efficient. It requires calling two methods to get each element. Further, because Iterators must be robust to things like calling next() without hasNext(), or hasNext() multiple times without next(), both of these methods generally have to do some defensive coding (and generally more statefulness and branching), which adds to inefficiency. On the other hand, even the slow way to traverse a spliterator (tryAdvance) doesn't have this burden. (It's even worse for concurrent data structures, because the next/hasNext duality is fundamentally racy, and Iterator implementations have to do more work to defend against concurrent modifications than do Spliterator implementations.)
Spliterator further offers a "fast-path" iteration -- forEachRemaining -- which can be used most of the time (reduction, forEach), further reducing the overhead of the iteration code that mediates access to the data structure internals. This also tends to inline very well, which in turn increases the effectiveness of other optimizations such as code motion, bounds check elimination, etc.
Further, traversal via Spliterator tend to have many fewer heap writes than with Iterator. With Iterator, every element causes one or more heap writes (unless the Iterator can be scalarized via escape analysis and its fields hoisted into registers.) Among other issues, this causes GC card mark activity, leading to cache line contention for the card marks. On the other hand, Spliterators tend to have less state, and industrial-strength forEachRemaining implementations tend to defer writing anything to the heap until the end of the traversal, instead storing its iteration state in locals which naturally map to registers, resulting in reduced memory bus activity.

Summary: don't worry, be happy. Spliterator is a better Iterator, even without parallelism. (They're also generally just easier to write and harder to get wrong.)

153

answered Sep 21 '22 17:09

Brian Goetz

Related questions
                            
                                How to generate JAXB classes from just XML
                            
                                HMAC-SHA1: How to do it properly in Java?
                            
                                How to add element at specific index/position in LinkedHashMap?
                            
                                How do you create a Spring MVC project in Eclipse?
                            
                                Why does Java's BigInteger have TEN and ONE as constants? Any Practical use? [closed]
                            
                                Should I close the servlet outputstream? [duplicate]
                            
                                Efficiently compute Intersection of two Sets in Java?
                            
                                How can CopyOnWriteArrayList be thread-safe?
                            
                                How to roll back migrations using Flyway?
                            
                                Resume http file download in java
                            
                                How to wait until an element is present in Selenium?
                            
                                What is @Override for in Java? [duplicate]
                            
                                Maven adding mainClass in pom.xml with the right folder path
                            
                                Addition assignment += behavior in expression
                            
                                Where to put @Transactional? In interface specification or implementation? [duplicate]
                            
                                Stop Java Coffee Cup icon from appearing in the Dock on Mac OSX
                            
                                Get value from one Optional or another
                            
                                Asynchronous IO in Java?
                            
                                How to do an array of hashmaps?
                            
                                How does Keep-alive work with ThreadPoolExecutor?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With