I was wondering, whether there is a preferred way to get from a stream of lists to a collection containing the elements of all the lists in the stream. I can think of two ways to get there: <pre class="prettyprint"><code>final Stream<List<Integer>> stream = Stream.empty(); final List<Integer> one = stream.collect(ArrayList::new, ArrayList::addAll, ArrayList::addAll); final List<Integer> two = stream.flatMap(List::stream).collect(Collectors.toList()); </code></pre> The second option looks much nicer to me, but I guess the first one is more efficient in parallel streams. Are there further arguments for or against one of the two methods?

The main difference is that <code>flatMap</code> is an intermediate operation. while <code>collect</code> is a terminal operation. So <code>flatMap</code> is the only way to process the flattened stream items if you want to do other operations than <code>collect</code>ing immediately. Further <code>collect(ArrayList::new, ArrayList::addAll, ArrayList::addAll)</code> is very hard to read given the fact that you have two identical method references <code>ArrayList::addAll</code> with completely different semantics. Regarding parallel processing, your guess is wrong. The first one has lesser capabilities of parallel processing as it relies on <code>ArrayList.addAll</code> applied to the stream items (sub-lists) which can’t be broken into parallel sub-steps. In contrast, <code>Collectors.toList()</code> applied to a <code>flatMap</code> can do parallel processing of sub-list items if the particular <code>List</code>s encountered in the stream support it. But this will be relevant only if you have a rather small stream of rather big sub-lists. The only drawback of <code>flatMap</code> is the intermediate stream creation which adds an overhead in the case that you have a lot of very small sub-lists. But in your example, the stream is empty so it doesn’t matter (scnr).

Is there a preferred way collect a stream of lists into a flat list?

Tags:

java

java-8

java-stream

I was wondering, whether there is a preferred way to get from a stream of lists to a collection containing the elements of all the lists in the stream. I can think of two ways to get there:

final Stream<List<Integer>> stream = Stream.empty();
final List<Integer> one = stream.collect(ArrayList::new, ArrayList::addAll, ArrayList::addAll);
final List<Integer> two = stream.flatMap(List::stream).collect(Collectors.toList());

The second option looks much nicer to me, but I guess the first one is more efficient in parallel streams. Are there further arguments for or against one of the two methods?

327

asked Sep 02 '14 14:09

muued

2 Answers

The main difference is that flatMap is an intermediate operation. while collect is a terminal operation.

So flatMap is the only way to process the flattened stream items if you want to do other operations than collecting immediately.

Further collect(ArrayList::new, ArrayList::addAll, ArrayList::addAll) is very hard to read given the fact that you have two identical method references ArrayList::addAll with completely different semantics.

Regarding parallel processing, your guess is wrong. The first one has lesser capabilities of parallel processing as it relies on ArrayList.addAll applied to the stream items (sub-lists) which can’t be broken into parallel sub-steps. In contrast, Collectors.toList() applied to a flatMap can do parallel processing of sub-list items if the particular Lists encountered in the stream support it. But this will be relevant only if you have a rather small stream of rather big sub-lists.

The only drawback of flatMap is the intermediate stream creation which adds an overhead in the case that you have a lot of very small sub-lists.

But in your example, the stream is empty so it doesn’t matter (scnr).

answered Nov 15 '22 19:11

Holger

I think the intent of option two is much clearer than that of option one. It took me a few seconds to work out what was happening with the first one, it doesn't look "right" - although it seems valid. Option two was more obvious to me.

Essentially, the intent of what you are doing is a flatmap. If that's the case I'd expect to see flatmap used rather than using addAll().

answered Nov 15 '22 18:11

Ian Fairman

Related questions
                            
                                Configure Windows to use 32-bit JRE instead of 64-bit JRE
                            
                                Slick 2.0.0-M3 table definition - clarification on the tag attribute
                            
                                jOOQ not generating DAOs with <daos> flag set to true
                            
                                Intercept JAX-RS Request: Register a ContainerRequestFilter with tomcat
                            
                                How to set a frequency for the fm radio in android?
                            
                                Java 8: Duplicate method name&signature lambda
                            
                                How to make a runnable jar for an application that uses JavaFX without native installers
                            
                                Why Java StringReader throws IOException?
                            
                                Java - Sort Strings like Windows Explorer
                            
                                Handle Exception with Mockito
                            
                                Which hashing algorithms are available in Android?
                            
                                When to use java.util.concurrent.Semaphore's acquire() and acquireUninterruptibly() method?
                            
                                HttpClient: What is the difference between ServiceUnavailableRetryStrategy and HttpRequestRetryHandler?
                            
                                Jersey and Java 8 (Lambda expression)
                            
                                How do you get the mantissa of a float in java?
                            
                                Spring Unit test JPA repository
                            
                                Gson deserialize into map
                            
                                Compiler options missing in Android Studio >= 0.8.2
                            
                                How to disable http port in play framework?
                            
                                How to allow anonymous uploads to cloud storage

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With