What is the difference between the following two formulas? <pre class="prettyprint"><code>cp [] = [[]] cp (xs:xss) = [x:ys | x <- xs, ys <- cp xss] ---------------------------------------------- cp [] = [[]] cp (xs:xss) = [x:ys | x <- xs, ys <- yss] where yss = cp xss </code></pre> Sample output: <code>cp [[1,2,3],[4,5]] => [[1,4],[1,5],[2,4],[2,5],[3,4],[3,5]]</code> According to Thinking Functionally With Haskell (p. 92), the second version is "a more efficient definition...[which] guarantees that cp xss is computed just once," though the author never explains why. I would have thought they were equivalent.

The two definitions are equivalent in the sense that they denote the same value, of course. Operationally they differ in the sharing behavior under call-by-need evaluation. jcast already explained why, but I want to add a shortcut that does not require explicitly desugaring the list comprehension. The rule is: any expression that is syntactically in a position where it could depend on a variable <code>x</code> will be recomputed each time the variable <code>x</code> is bound to a value, even if the expression does not actually depend on <code>x</code>. In your case, in the first definition, <code>x</code> is in scope in the position where <code>cp xss</code> appears, so <code>cp xss</code> will be re-evaluated for each element <code>x</code> of <code>xs</code>. In the second definition <code>cp xss</code> appears outside the scope of <code>x</code> so it will be computed just once. Then the usual disclaimers apply, namely: <ul> <li>The compiler is not required to adhere to the operational semantics of call-by-need evaluation, only the denotational semantics. So it might compute things fewer times (floating out) or more times (floating in) than you would expect based on the above rule.</li> <li>It's not true in general that more sharing is better. In this case, for example, it's probably not better because the size of <code>cp xss</code> grows as quickly as the amount of work that it took to compute it in the first place. In this situation the cost of reading the value back from memory can exceed that of recomputing the value (due to the cache hierarchy and the GC).</li> </ul>

where clauses in list comprehensions

Tags:

haskell

list-comprehension

What is the difference between the following two formulas?

cp [] = [[]]
cp (xs:xss) = [x:ys | x <- xs, ys <- cp xss]
----------------------------------------------
cp [] = [[]]
cp (xs:xss) = [x:ys | x <- xs, ys <- yss]
              where yss = cp xss

Sample output: cp [[1,2,3],[4,5]] => [[1,4],[1,5],[2,4],[2,5],[3,4],[3,5]]

According to Thinking Functionally With Haskell (p. 92), the second version is "a more efficient definition...[which] guarantees that cp xss is computed just once," though the author never explains why. I would have thought they were equivalent.

719

asked Jul 21 '15 02:07

planarian

1 Answers

The two definitions are equivalent in the sense that they denote the same value, of course.

Operationally they differ in the sharing behavior under call-by-need evaluation. jcast already explained why, but I want to add a shortcut that does not require explicitly desugaring the list comprehension. The rule is: any expression that is syntactically in a position where it could depend on a variable x will be recomputed each time the variable x is bound to a value, even if the expression does not actually depend on x.

In your case, in the first definition, x is in scope in the position where cp xss appears, so cp xss will be re-evaluated for each element x of xs. In the second definition cp xss appears outside the scope of x so it will be computed just once.

Then the usual disclaimers apply, namely:

The compiler is not required to adhere to the operational semantics of call-by-need evaluation, only the denotational semantics. So it might compute things fewer times (floating out) or more times (floating in) than you would expect based on the above rule.
It's not true in general that more sharing is better. In this case, for example, it's probably not better because the size of cp xss grows as quickly as the amount of work that it took to compute it in the first place. In this situation the cost of reading the value back from memory can exceed that of recomputing the value (due to the cache hierarchy and the GC).

answered Sep 19 '22 14:09

Reid Barton

Related questions
                            
                                Multiline if statement Haskell
                            
                                What is the purpose of the extra result parameter of atomicModifyIORef?
                            
                                Is there a zipWith analogue for tuples?
                            
                                Passing list elements as parameters to curried function
                            
                                Specifying class constraints in value constructors
                            
                                Haskell repa --- mapping with indices
                            
                                multi-parameter newtype faked with a tuple?
                            
                                Haskell real-time update and lookup performance
                            
                                Haskell equivalent of Boost.Fusion
                            
                                Haskell "dependent" fields of a record?
                            
                                Is there a way in Haskell to query thread state using ThreadID after a forkIO?
                            
                                Transform a GADT without constraints to another GADT with constraints when such constraints hold
                            
                                Multi-parameter type synonym instances
                            
                                How does Haskell's lens package handle fields that are also keywords?
                            
                                Why would my datatype need to an instance of Monoid to use this lens?
                            
                                pattern matching of the form: Option{..} <-
                            
                                Downsides to ScopedTypeVariables
                            
                                Writing an instance declaration for Typeable without deriving
                            
                                Why can't i use function parameters in a record update notation?
                            
                                Is there an automatic way to memoise global polymorphic values in Haskell?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With