This is the usual definition of the fixed-point combinator in Haskell: <pre class="prettyprint"><code>fix :: (a -> a) -> a fix f = let x = f x in x </code></pre> On https://wiki.haskell.org/Prime_numbers, they define a different fixed-point combinator: <pre class="prettyprint"><code>_Y :: (t -> t) -> t _Y g = g (_Y g) -- multistage, non-sharing, g (g (g (g ...))) -- g (let x = g x in x) -- two g stages, sharing </code></pre> <blockquote> <code>_Y</code> is a non-sharing fixpoint combinator, here arranging for a recursive "telescoping" multistage primes production (a tower of producers). </blockquote> What exactly does this mean? What is "sharing" vs. "non-sharing" in that context? How does <code>_Y</code> differ from <code>fix</code>?

<code>_Y</code> is translated to the following STG: <pre class="prettyprint"><code>_Y f = let x = _Y f in f x </code></pre> <code>fix</code> is translated identically to the Haskell source: <pre class="prettyprint"><code>fix f = let x = f x in x </code></pre> So <code>fix f</code> sets up a recursive thunk <code>x</code> and returns it, while <code>_Y</code> is a recursive function, and importantly it’s not tail-recursive. Forcing <code>_Y f</code> enters <code>f</code>, passing a new call to <code>_Y f</code> as an argument, so each recursive call sets up a new thunk; forcing the <code>x</code> returned by <code>fix f</code> enters <code>f</code>, passing <code>x</code> itself as an argument, so each recursive call is into the same thunk—this is what’s meant by “sharing”. The sharing version usually has better memory usage, and also lets the GHC RTS detect some kinds of infinite loop. When a thunk is forced, before evaluation starts, it’s replaced with a “black hole”; if at any point during evaluation of a thunk a black hole is reached from the same thread, then we know we have an infinite loop and can throw an exception (which you may have seen displayed as <code>Exception: <<loop>></code>).

Sharing vs. non-sharing fixed-point combinator

Tags:

haskell

fixpoint-combinators

letrec

y-combinator

This is the usual definition of the fixed-point combinator in Haskell:

fix :: (a -> a) -> a
fix f = let x = f x in x

On https://wiki.haskell.org/Prime_numbers, they define a different fixed-point combinator:

_Y   :: (t -> t) -> t
_Y g = g (_Y g)                -- multistage, non-sharing,  g (g (g (g ...)))
    -- g (let x = g x in x)    -- two g stages, sharing

_Y is a non-sharing fixpoint combinator, here arranging for a recursive "telescoping" multistage primes production (a tower of producers).

What exactly does this mean? What is "sharing" vs. "non-sharing" in that context? How does _Y differ from fix?

628

asked Dec 11 '18 00:12

Joseph Sible-Reinstate Monica

2 Answers

"Sharing" means f x re-uses the x that it creates; but with _Y g = g . g . g . g . ..., each g calculates its output anew (cf. this and this).

In that context, the sharing version has much worse memory usage, leads to a space leak.¹

The definition of _Y mirrors the usual lambda calculus definition's effect for the Y combinator, which emulates recursion by duplication, while true recursion refers to the same (hence, shared) entity.

    x      = f x
    (_Y g) = g (_Y g)

both xs refer to the same entity, but each of (_Y g)s refer to equivalent, but separate, entity. That's the intention of it, anyway.

Of course thanks to referential transparency there's no guarantee in Haskell the language for any of this. But GHC the compiler does behave this way.

_Y g is a common sub-expression and it could be "eliminated" by a compiler by giving it a name and reusing that named entity, subverting the whole purpose of it. That's why the GHC has the "no common sub-expressions elimination" -fno-cse flag which prevents this explicitly. It used to be that you had to use this flag to achieve the desired behaviour here, but not anymore. GHC won't be as aggressive at common sub-expressions elimination anymore, with the more recent (read: several years now) versions.

disclaimer: I'm the author of that part of the page you're referring to. Was hoping for the back-and-forth that's usual on wiki pages, but it never came, so my work didn't get reviewed like that. Either no-one bothered, or it is passable (lacking major errors). The wiki seems to be largely abandoned for many years now.

¹ The g function involved,

(3:) . minus [5,7..] . foldr (\ (x:xs) ⟶ (x:) . union xs) [] 
                      . map (\ p ⟶ [p², p² + 2p..])

produces an increasing stream of all odd primes given an increasing stream of all odd primes. To produce a prime N in value, it consumes its input stream up to the first prime above sqrt(N) in value, at least. Thus the production points are given roughly by repeated squaring, and there are ~ log (log N) of such g functions in total in the chain (or "tower") of these primes producers, each immediately garbage collectible, the lowest one producing its primes given just the first odd prime, 3, known a priori.

And with the two-staged _Y2 g = g x where { x = g x } there would be only two of them in the chain, but only the top one would be immediately garbage collectible, as discussed at the referenced link above.

187

answered Sep 25 '22 14:09

Will Ness

_Y is translated to the following STG:

_Y f = let x = _Y f in f x

fix is translated identically to the Haskell source:

fix f = let x = f x in x

So fix f sets up a recursive thunk x and returns it, while _Y is a recursive function, and importantly it’s not tail-recursive. Forcing _Y f enters f, passing a new call to _Y f as an argument, so each recursive call sets up a new thunk; forcing the x returned by fix f enters f, passing x itself as an argument, so each recursive call is into the same thunk—this is what’s meant by “sharing”.

The sharing version usually has better memory usage, and also lets the GHC RTS detect some kinds of infinite loop. When a thunk is forced, before evaluation starts, it’s replaced with a “black hole”; if at any point during evaluation of a thunk a black hole is reached from the same thread, then we know we have an infinite loop and can throw an exception (which you may have seen displayed as Exception: <<loop>>).

answered Sep 23 '22 14:09

Jon Purdy

Related questions
                            
                                Haskell's Either v. Error
                            
                                The type variable ‘a0’ is ambiguous
                            
                                Haskell function that tests if a list has repeated (duplicate) elements
                            
                                Installing Cabal has missing modules
                            
                                Helping GHC unbox an Int in equality on a constant
                            
                                Is polykinded type application injective?
                            
                                Perform simple IO in Haskeline, inside InputT monad, without having to resort to unsafePerformIO
                            
                                Infinite type error when defining zip with foldr only; can it be fixed?
                            
                                Implementing Backtracking on Haskell
                            
                                Recreating Lisp's `apply` in Haskell using GADTs
                            
                                Recursion-schemes generalization of `tails`
                            
                                Applicative functor that reverses order of effects
                            
                                Can't find inerface-file declaration for variable
                            
                                Function Composition Do Notation
                            
                                Are Float and Double instances of Fractional?
                            
                                Idiomatic way to run test-suites in stack
                            
                                Is print in Haskell a pure function?
                            
                                type "name" = Int - is it worth declaring?
                            
                                What is the meaning for the convention "runSomething" in the Haskell community?
                            
                                How to write this polyvariadic composition function in Haskell?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With