This may sounds like I'm begging to start a flame war, but hear me out. In some languages laziness is expensive. For example, in Ruby, where I have the most recent experience, laziness is slow because it's achieved using fibers, so it's only attractive when: <ul> <li>you must trade off cpu for memory (think paging through large data set)</li> <li>the performance penalty is worth it to hide details (yielding to fibers is a great way to abstract away complexity instead of passing down blocks to run in mysterious places)</li> </ul> Otherwise you'll definitely want to use the normal, eager methods. My initial investigation suggests that the overhead for laziness in Elixir is much lower (this thread on reddit backs me up), so there seems little reason to ever use Enum instead of Stream for those things which Stream can do. Is there something I'm missing, since I assume Enum exists for a reason and implements some of the same functions as Stream. What cases, if any, would I want to use Enum instead of Stream when I could use Stream?

The methods in Stream essentially create a "recipe list" of transformations over your data while the methods in Enum actually resolve these transformations. So you eventually will have to use an Enum function to resolve your data transformation even if everything else is a Stream. Also some concepts, namely Reduce, have no real meaning in Stream and you must use Enum. As for performance, if you have a series of transformations you're performing, a possibly infinite stream of data, or you're reading a file, use Stream. If you've just one transformation over a finite enumerable or you need to resolve a Stream, use Enum.

Should I ever prefer Enum to Stream in Elixir?

Tags:

performance

stream

enums

elixir

This may sounds like I'm begging to start a flame war, but hear me out.

In some languages laziness is expensive. For example, in Ruby, where I have the most recent experience, laziness is slow because it's achieved using fibers, so it's only attractive when:

you must trade off cpu for memory (think paging through large data set)
the performance penalty is worth it to hide details (yielding to fibers is a great way to abstract away complexity instead of passing down blocks to run in mysterious places)

Otherwise you'll definitely want to use the normal, eager methods.

My initial investigation suggests that the overhead for laziness in Elixir is much lower (this thread on reddit backs me up), so there seems little reason to ever use Enum instead of Stream for those things which Stream can do.

Is there something I'm missing, since I assume Enum exists for a reason and implements some of the same functions as Stream. What cases, if any, would I want to use Enum instead of Stream when I could use Stream?

325

asked Oct 31 '16 19:10

G Gordon Worley III

2 Answers

The methods in Stream essentially create a "recipe list" of transformations over your data while the methods in Enum actually resolve these transformations. So you eventually will have to use an Enum function to resolve your data transformation even if everything else is a Stream.

Also some concepts, namely Reduce, have no real meaning in Stream and you must use Enum.

As for performance, if you have a series of transformations you're performing, a possibly infinite stream of data, or you're reading a file, use Stream. If you've just one transformation over a finite enumerable or you need to resolve a Stream, use Enum.

answered Oct 07 '22 05:10

greggreg

For short lists, Stream will be slower than simply using Enum, but there's no clear rule there without benchmarking exactly what you are doing. There are also some functions that exist in Enum, but don't have corresponding functions in Stream. (for example, Enum.reverse )

The real reason you need both is that Stream is just a composition of functions. Every pipeline that needs results, rather than side effects needs to end in an Enum to get the pipeline to run.

They go hand in hand, Stream couldn't stand alone. What Stream is largely doing is giving you a very handy abstraction for creating very complex reduce functions.

answered Oct 07 '22 05:10

Fred the Magic Wonder Dog

Related questions
                            
                                Slow formatting of RichTextBox
                            
                                Why are (constant) expressions not evaluated at compile time in Haskell?
                            
                                SSE code to set float variable to 0.0f or 1.0f based on comparison
                            
                                Does using the 'this' keyword affect Java performance?
                            
                                How many concurrent setTimeouts before performance issues?
                            
                                How can we check if any 2 intervals of a unique ID overlaps?
                            
                                Oracle CLOB performance
                            
                                Mysql index configuration
                            
                                Hibernate Query vs Criteria Performance
                            
                                Why is Dictionary.First() so slow?
                            
                                Javascript performance problems with too many dom nodes?
                            
                                How to set the process priority in C++
                            
                                Why Scala's foldLeft has lower performance than iterating with an index on strings?
                            
                                SimpleXML vs DOMDocument performance
                            
                                SQL Server stored procedure a lot slower than straight query
                            
                                Most Efficient - Performance wise - for inter JVM communication
                            
                                Feasible implementation of a Prime Counting Function [closed]
                            
                                python time measure for every function [duplicate]
                            
                                HTML5 Canvas save() and restore() performance
                            
                                Android: Find out which third party library is requesting a permission?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With