Using <code>proc</code> notation for <code>Arrow</code> seems to kill performance in my project. Here is a toy example of the problem: We define Coroutine newtype (mostly copying from Generalizing Streams into Coroutines) to represent Mealy machines (i.e. functions that carry some state) with instances of <code>Category</code> and <code>Arrow</code>, write <code>scan</code> wrapper function and <code>evalList</code> runner function for lists. Then we have <code>sumArr</code> and <code>sumArr'</code> functions where the latter is the former called within <code>proc</code> block. Compiling with <code>stack ghc -- --make test.hs -O2</code> using ghc-8.0.2 on OS X I get runtime of 0.087 secs for <code>sumArr</code> and 3.263 secs for <code>sumArr'</code> (with a heavy memory footprint). I would like to know if this in fact caused by using <code>proc</code> and if I can do something to have normal runtime behaviour when using <code>proc</code> notation (writing arrow code without it is painful). Thank you. <pre class="prettyprint"><code>{-# LANGUAGE Arrows #-} {-# LANGUAGE BangPatterns #-} import Prelude hiding (id, (.)) import Control.Arrow import Control.Category import qualified Data.List as L newtype Coroutine i o = Coroutine { runC :: i -> (o, Coroutine i o) } instance Category Coroutine where id = Coroutine $ \i -> (i, id) cof . cog = Coroutine $ \i -> let (x, cog') = runC cog i (y, cof') = runC cof x in (y, cof' . cog') instance Arrow Coroutine where arr f = Coroutine $ \i -> (f i, arr f) first co = Coroutine $ \(a,b) -> let (c, co') = runC co a in ((c,b), first co') scan :: (o -> t -> o) -> o -> Coroutine t o scan f = go where go i = Coroutine $ step i where step a b = let !a' = f a b in (a', go a') evalList :: Coroutine a b -> [a] -> [b] evalList a = L.map fst . L.drop 1 . L.scanl' (\(_, acc) v -> let !x = runC acc v in x) (undefined, a) sumArr, sumArr' :: Coroutine Int Int sumArr = scan (\acc x -> let !newAcc = acc + x in newAcc) 0 sumArr' = proc v -> do sumArr -< v testData :: [Int] testData = [1..1000000] main = print $ L.last $ evalList sumArr' testData </code></pre>

Yeah, this is probably caused by <code>proc</code> notation. The desugaring is very low-level, introducing a lot of (needless) <code>arr</code>s and not taking advantage of <code>&&&</code> or <code>***</code> at all. For example, last I checked, this: <pre class="prettyprint"><code>mulA f g = proc x -> do a <- f -< x b <- g -< x returnA -< a * b </code></pre> Is desugared to something like this: <pre class="prettyprint"><code>mulA f g = arr dup >>> first f >>> arr swap >>> first g >>> arr mul where dup x = (x, x) swap (x, y) = (y, x) mul = uncurry (*) </code></pre> When it could be just this: <pre class="prettyprint"><code>mulA f g = f &&& g >>> arr mul </code></pre> And this: <pre class="prettyprint"><code>proc x -> do a <- f -< x b <- g -< a returnA -</pre> Becomes something like this: <pre class="prettyprint"><code>arr id >>> f >>> arr id >>> g >>> arr id >>> returnA </code></pre> Instead of this: <pre class="prettyprint"><code>f >>> g </code></pre> Moreover I don’t think there are any GHC rewrite rules that take advantage of the arrow laws to help account for this.

Proc syntax in Haskell Arrows leads to severe performance penalty

Tags:

haskell

arrows

Using proc notation for Arrow seems to kill performance in my project. Here is a toy example of the problem:

We define Coroutine newtype (mostly copying from Generalizing Streams into Coroutines) to represent Mealy machines (i.e. functions that carry some state) with instances of Category and Arrow, write scan wrapper function and evalList runner function for lists.

Then we have sumArr and sumArr' functions where the latter is the former called within proc block.

Compiling with stack ghc -- --make test.hs -O2 using ghc-8.0.2 on OS X I get runtime of 0.087 secs for sumArr and 3.263 secs for sumArr' (with a heavy memory footprint).

I would like to know if this in fact caused by using proc and if I can do something to have normal runtime behaviour when using proc notation (writing arrow code without it is painful). Thank you.

{-# LANGUAGE Arrows #-}
{-# LANGUAGE BangPatterns #-}

import Prelude hiding (id, (.))
import Control.Arrow
import Control.Category
import qualified Data.List as L

newtype Coroutine i o = Coroutine { runC :: i -> (o, Coroutine i o) }

instance Category Coroutine where
    id = Coroutine $ \i -> (i, id)

    cof . cog = Coroutine $ \i ->
        let (x, cog') = runC cog i
            (y, cof') = runC cof x
        in (y, cof' . cog')

instance Arrow Coroutine where
    arr f = Coroutine $ \i -> (f i, arr f)

    first co = Coroutine $ \(a,b) ->
        let (c, co') = runC co a in ((c,b), first co')

scan :: (o -> t -> o) -> o -> Coroutine t o
scan f = go where
    go i = Coroutine $ step i where
            step a b = let !a' = f a b in (a', go a')

evalList :: Coroutine a b -> [a] -> [b]
evalList a = L.map fst . L.drop 1 . L.scanl' (\(_, acc) v -> let !x = runC acc v in x) (undefined, a)

sumArr, sumArr' :: Coroutine Int Int
sumArr = scan (\acc x -> let !newAcc = acc + x in newAcc) 0
sumArr' = proc v -> do sumArr -< v

testData :: [Int]
testData = [1..1000000]

main = print $ L.last $ evalList sumArr' testData

765

asked Jul 22 '17 23:07

Artem Solod

1 Answers

Yeah, this is probably caused by proc notation. The desugaring is very low-level, introducing a lot of (needless) arrs and not taking advantage of &&& or *** at all.

For example, last I checked, this:

mulA f g = proc x -> do
  a <- f -< x
  b <- g -< x
  returnA -< a * b

Is desugared to something like this:

mulA f g = arr dup
  >>> first f
  >>> arr swap
  >>> first g
  >>> arr mul
  where
    dup x = (x, x)
    swap (x, y) = (y, x)
    mul = uncurry (*)

When it could be just this:

mulA f g = f &&& g >>> arr mul

And this:

proc x -> do
  a <- f -< x
  b <- g -< a
  returnA -< b

Becomes something like this:

arr id
  >>> f
  >>> arr id
  >>> g
  >>> arr id
  >>> returnA

Instead of this:

f >>> g

Moreover I don’t think there are any GHC rewrite rules that take advantage of the arrow laws to help account for this.

159

answered Nov 14 '22 01:11

Jon Purdy

Related questions
                            
                                :sprint for polymorphic values?
                            
                                If MonadPlus is the "generator" class, then what is the "consumer" class?
                            
                                Why is my little STRef Int require allocating gigabytes?
                            
                                Is my experience with setting up Haskell dev environment for the first time common or a one-off?
                            
                                Find the value that failed for quickcheck
                            
                                How to put constraints on the associated data?
                            
                                How unpacking strict fields goes together with polymorphism?
                            
                                When (and when not) to define a Monad
                            
                                How to build an AngularJS app with Yesod
                            
                                Is there a Codensity MonadPlus that asymptotically optimizes a sequence of MonadPlus operations?
                            
                                Why is this Haskell code so much slower than the C equivalent? Unboxed vectors and bangs already used
                            
                                How to upgrade GHC with Stack
                            
                                Why is the Haddock documentation not showing up on Hackage?
                            
                                How do you override Haskell type class instances provided by package code?
                            
                                How to write a simple HTTP server in Haskell using Network.HTTP.receiveHTTP
                            
                                Why isn't the Prelude's words function written more simply?
                            
                                Haskell: Lazy vs. Strict Text values, which one is recommended when?
                            
                                GHC rewrite rules with class constraints
                            
                                How can I deal with comments in my AST?
                            
                                Word foldl' isn't optimized as well as Int foldl'

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With