I have little experience with algebraic data types, because I work in a language without native support. Usually one can use continuation passing style to get a remotely similar experience, but the handling of CPS encoded types is less comfortable. Considering this why would a library like Parsec use CPS? <pre class="prettyprint"><code>newtype ParsecT s u m a = ParsecT {unParser :: forall b . State s u -> (a -> State s u -> ParseError -> m b) -- consumed ok -> (ParseError -> m b) -- consumed err -> (a -> State s u -> ParseError -> m b) -- empty ok -> (ParseError -> m b) -- empty err -> m b } </code></pre> One clue would be the <code>try</code> parser, which rules out the consumed error case by passing the empty error continuation in both cases: <pre class="prettyprint"><code>try :: ParsecT s u m a -> ParsecT s u m a try p = ParsecT $ \s cok _ eok eerr -> unParser p s cok eerr eok eerr -- ^^^^ ^^^^ </code></pre> This is possible, because both continuations <code>cerr</code> and <code>eerr</code> have the same type and only differ in their position, which reminds me of structural typing. While I think you cannot do this with ADTs, there is probably a way to implement the same behavior with them. Apart from this, a single combinator wouldn't justify the far-reachuing decision to rely on CPS. So why was this decision made?

One big issue is that <code>ParsecT</code> is a monad transformer, and monad transformers defined using ADTs block optimizations more than monad transformers using CPS. The expression <code>pure x >>= k :: ExceptT e m a</code> gives a minimal illustrative example. <ol> <li> With <code>ExceptT e m a</code> defined as <code>m (Either e a)</code>, that expressions doesn't simplify well, because it involves <code>(>>=)</code> of the underlying monad <code>m</code> which is abstract. </li> <li> With <code>ExceptT e m a</code> defined as <code>forall r. (Either e a -> m r) -> m r</code>, <code>pure x >>= k</code> basically beta-reduces to <code>k x</code>, without making any assumption about <code>m</code>. </li> </ol> You need to specialize <code>m</code> to optimize a term of type <code>m (Either e a)</code> at all, whereas the continuation-based variant can go a long way without specialization. However, CPS is also not the ideal representation in Haskell, because the continuations are functions which get allocated on the heap. Functions are cheap but not zero cost. At the end of the day the abstractness of <code>m</code> in monad transformers really gets in the way, to optimize, you have to specialize, i.e., break modularity. Thus a more promising approach is to use some special primitive monad (<code>IO</code>) with dedicated support from the run-time system, as the basis of an effect system. <hr> See also the talk Effects for Less, by Alexis King, and the related library eff.

What are the requirements to prefer CPS over algebraic data types?

Tags:

functional-programming

haskell

algebraic-data-types

continuation-passing

I have little experience with algebraic data types, because I work in a language without native support. Usually one can use continuation passing style to get a remotely similar experience, but the handling of CPS encoded types is less comfortable.

Considering this why would a library like Parsec use CPS?

newtype ParsecT s u m a
    = ParsecT {unParser :: forall b .
                 State s u
              -> (a -> State s u -> ParseError -> m b) -- consumed ok
              -> (ParseError -> m b)                   -- consumed err
              -> (a -> State s u -> ParseError -> m b) -- empty ok
              -> (ParseError -> m b)                   -- empty err
              -> m b
             }

One clue would be the try parser, which rules out the consumed error case by passing the empty error continuation in both cases:

try :: ParsecT s u m a -> ParsecT s u m a
try p =
    ParsecT $ \s cok _ eok eerr ->
    unParser p s cok eerr eok eerr
--                   ^^^^     ^^^^

This is possible, because both continuations cerr and eerr have the same type and only differ in their position, which reminds me of structural typing. While I think you cannot do this with ADTs, there is probably a way to implement the same behavior with them. Apart from this, a single combinator wouldn't justify the far-reachuing decision to rely on CPS. So why was this decision made?

600

asked Jan 18 '21 10:01

Iven Marquardt

Video Answer

2 Answers

One big issue is that ParsecT is a monad transformer, and monad transformers defined using ADTs block optimizations more than monad transformers using CPS.

The expression pure x >>= k :: ExceptT e m a gives a minimal illustrative example.

With ExceptT e m a defined as m (Either e a), that expressions doesn't simplify well, because it involves (>>=) of the underlying monad m which is abstract.
With ExceptT e m a defined as forall r. (Either e a -> m r) -> m r, pure x >>= k basically beta-reduces to k x, without making any assumption about m.

You need to specialize m to optimize a term of type m (Either e a) at all, whereas the continuation-based variant can go a long way without specialization.

However, CPS is also not the ideal representation in Haskell, because the continuations are functions which get allocated on the heap. Functions are cheap but not zero cost.

At the end of the day the abstractness of m in monad transformers really gets in the way, to optimize, you have to specialize, i.e., break modularity. Thus a more promising approach is to use some special primitive monad (IO) with dedicated support from the run-time system, as the basis of an effect system.

See also the talk Effects for Less, by Alexis King, and the related library eff.

138

answered Sep 18 '22 17:09

Li-yao Xia

This change was made March 2, 2009 in commit a98a3ccb by Antoine Latter. The commit comment was just:

commit a98a3ccbca9835fe749b8cd2d475a0494a84a460
Author: Antoine Latter <[email protected]>
Date:   Mon Mar 2 00:20:00 2009 +0000

    move core data type over to CPS

and it replaced the original ADT for ParsecT:

data ParsecT s u m a
    = ParsecT { runParsecT :: State s u -> m (Consumed (m (Reply s u a))) }

with the new version in your question, adding a runParsecT adapter to convert everything back:

runParsecT :: Monad m => ParsecT s u m a -> State s u -> m (Consumed (m (Reply s u a)))
runParsecT p s = unParser p s cok cerr eok eerr
    where cok a s' err = return . Consumed . return $ Ok a s' err
          cerr err = return . Consumed . return $ Error err
          eok a s' err = return . Empty . return $ Ok a s' err
          eerr err = return . Empty . return $ Error err

I see that he made a blog post in February 2009 where he wrote about finally understanding CPS style and wrote about a CPS version of MaybeT. He didn't talk about any performance advantages, but merely noted that the CPS style had the advantage that Monad and MonadPlus instances for MaybeT could be written without calling >>= or return on the underlying monad or doing explicit case analysis of Maybe values.

In a later December 2009 blog post, he writes about his "obsession with functionalizing monad transformers", gives an example of ErrorT and explicitly notes:

I haven't benchmarked whether or not this is faster for anything, but I find the whole thing a lot of fun.

However, he then goes on in the same blog post to talk about how adding functionality to Parsec 3 to make it a monad transformer instead of a plain old monad and parametrizing it over the input type led to poor performance (about 1.8x slower on some benchmarks). He discovered that converting over to CPS style made Parsec 3 as fast as Parsec 2, at least when those new abstractions (transformers) weren't being used.

Speculating about the timing, I think that Antoine thought CPS was "cool" and had stylistic advantages that appealed to him, so he had it top of mind when working on Parsec 3. When new abstractions in Parsec 3 led to a performance problem, he fortuitously discovered that converting it to CPS appeared to fix them, though he didn't do a detailed study of why (just some speculation in the blog post). However, it's a little unclear to me if he actually converted to CPS first, before discovering the performance advantage, or tried CPS with the expectation that it might help performance. The blog post reads like the conversion was done intentionally to address performance, but that might have just been for sake of simpler exposition in the blog post.

answered Sep 19 '22 17:09

K. A. Buhr

Related questions
                            
                                Display function types in Haskell
                            
                                Why does my Haskell do-notation break when I try to desugar it?
                            
                                How to detect if a program has been compiled using -threaded?
                            
                                Haskell `forever` type signature
                            
                                Stack (Haskell) throws TlsException in Windows
                            
                                Is there any guarantee about the evaluation order within a pattern match?
                            
                                How Functional language are different from the language implementation point of view
                            
                                The difference between +1 and -1
                            
                                Haskell "not a visible field of constructor" error
                            
                                Web service frameworks in haskell
                            
                                Using STM and Database transactions together
                            
                                Do guarded pipes behave the same as pipes using await?
                            
                                Haskell numbers and type system?
                            
                                Are functions of arity-n really just an n-category due to currying? Can they be made into a 1-category?
                            
                                STUArray s i e - space leak only when i == Int?
                            
                                `cabal update` failed due to `out of memory`
                            
                                Haskell: Template Haskell and the scope
                            
                                What are the key differences between Java 8's Optional, Scala's Option and Haskell's Maybe?
                            
                                How to implement the equivalent of Go's select statement for Haskell STM channels?
                            
                                Understanding the Reader monad

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With