Given the program: <pre class="prettyprint"><code>import Debug.Trace main = print $ trace "hit" 1 + trace "hit" 1 </code></pre> If I compile with <code>ghc -O</code> (7.0.1 or higher) I get the output: <pre class="prettyprint"><code>hit 2 </code></pre> i.e. GHC has used common sub-expression elimination (CSE) to rewrite my program as: <pre class="prettyprint"><code>main = print $ let x = trace "hit" 1 in x + x </code></pre> If I compile with <code>-fno-cse</code> then I see <code>hit</code> appearing twice. Is it possible to avoid CSE by modifying the program? Is there any sub-expression <code>e</code> for which I can guarantee <code>e + e</code> will not be CSE'd? I know about <code>lazy</code>, but can't find anything designed to inhibit CSE. The background of this question is the cmdargs library, where CSE breaks the library (due to impurity in the library). One solution is to ask users of the library to specify <code>-fno-cse</code>, but I'd prefer to modify the library.

How about removing the source of the trouble -- the implicit effect -- by using a sequencing monad that introduces that effect? E.g. the strict identity monad with tracing: <pre class="prettyprint"><code>data Eval a = Done a | Trace String a instance Monad Eval where return x = Done x Done x >>= k = k x Trace s a >>= k = trace s (k a) runEval :: Eval a -> a runEval (Done x) = x track = Trace </code></pre> now we can write stuff with a guaranteed ordering of the <code>trace</code> calls: <pre class="prettyprint"><code>main = print $ runEval $ do t1 <- track "hit" 1 t2 <- track "hit" 1 return (t1 + t2) </code></pre> while still being pure code, and GHC won't try to get to clever, even with <code>-O2</code>: <pre class="prettyprint"><code> $ ./A hit hit 2 </code></pre> So we introduce just the computation effect (tracing) sufficient to teach GHC the semantics we want. This is extremely robust to compile optimizations. So much so that GHC optimizes the math to <code>2</code> at compile time, yet still retains the ordering of the <code>trace</code> statements. <hr> As evidence of how robust this approach is, here's the core with <code>-O2</code> and aggressive inlining: <pre class="prettyprint"><code>main2 = case Debug.Trace.trace string trace2 of Done x -> case x of I# i# -> $wshowSignedInt 0 i# [] Trace _ _ -> err trace2 = Debug.Trace.trace string d d :: Eval Int d = Done n n :: Int n = I# 2 string :: [Char] string = unpackCString# "hit" </code></pre> So GHC has done everything it could to optimize the code -- including computing the math statically -- while still retaining the correct tracing. <hr> References: the useful <code>Eval</code> monad for sequencing was introduced by Simon Marlow.

Reading the source code to GHC, the only expressions that aren't eligible for CSE are those which fail the <code>exprIsBig</code> test. Currently that means the <code>Expr</code> values <code>Note</code>, <code>Let</code> and <code>Case</code>, and expressions which contain those. Therefore, an answer to the above question would be: <pre class="prettyprint"><code>unit = reverse "" `seq` () main = print $ trace "hit" (case unit of () -> 1) + trace "hit" (case unit of () -> 1) </code></pre> Here we create a value <code>unit</code> which resolves to <code>()</code>, but which GHC can't determine the value for (by using a recursive function GHC can't optimise away - <code>reverse</code> is just a simple one to hand). This means GHC can't CSE the <code>trace</code> function and it's 2 arguments, and we get <code>hit</code> printed twice. This works with both GHC 6.12.4 and 7.0.3 at <code>-O2</code>.

How to prevent common sub-expression elimination (CSE) with GHC

Q: Which of the following is an example of common subexpression elimination?

(D) x = 4 &lowast; 5 => x = 20 is an example of common subexpression elimination.

Tags:

optimization

haskell

compiler-construction

ghc

Given the program:

import Debug.Trace main = print $ trace "hit" 1 + trace "hit" 1

If I compile with ghc -O (7.0.1 or higher) I get the output:

hit 2

i.e. GHC has used common sub-expression elimination (CSE) to rewrite my program as:

main = print $ let x = trace "hit" 1 in x + x

If I compile with -fno-cse then I see hit appearing twice.

Is it possible to avoid CSE by modifying the program? Is there any sub-expression e for which I can guarantee e + e will not be CSE'd? I know about lazy, but can't find anything designed to inhibit CSE.

The background of this question is the cmdargs library, where CSE breaks the library (due to impurity in the library). One solution is to ask users of the library to specify -fno-cse, but I'd prefer to modify the library.

868

asked May 07 '11 09:05

Neil Mitchell

2 Answers

How about removing the source of the trouble -- the implicit effect -- by using a sequencing monad that introduces that effect? E.g. the strict identity monad with tracing:

data Eval a = Done a             | Trace String a  instance Monad Eval where   return x = Done x    Done x    >>= k = k x   Trace s a >>= k = trace s (k a)  runEval :: Eval a -> a runEval (Done x) = x  track = Trace

now we can write stuff with a guaranteed ordering of the trace calls:

main = print $ runEval $ do             t1 <- track "hit" 1             t2 <- track "hit" 1             return (t1 + t2)

while still being pure code, and GHC won't try to get to clever, even with -O2:

    $ ./A     hit     hit     2

So we introduce just the computation effect (tracing) sufficient to teach GHC the semantics we want.

This is extremely robust to compile optimizations. So much so that GHC optimizes the math to 2 at compile time, yet still retains the ordering of the trace statements.

As evidence of how robust this approach is, here's the core with -O2 and aggressive inlining:

main2 =   case Debug.Trace.trace string trace2 of     Done x -> case x of          I# i# -> $wshowSignedInt 0 i# []     Trace _ _ -> err  trace2 = Debug.Trace.trace string d  d :: Eval Int d = Done n  n :: Int n = I# 2  string :: [Char] string = unpackCString# "hit"

So GHC has done everything it could to optimize the code -- including computing the math statically -- while still retaining the correct tracing.

References: the useful Eval monad for sequencing was introduced by Simon Marlow.

173

answered Sep 28 '22 09:09

Don Stewart

Reading the source code to GHC, the only expressions that aren't eligible for CSE are those which fail the exprIsBig test. Currently that means the Expr values Note, Let and Case, and expressions which contain those.

Therefore, an answer to the above question would be:

unit = reverse "" `seq` ()  main = print $ trace "hit" (case unit of () -> 1) +                trace "hit" (case unit of () -> 1)

Here we create a value unit which resolves to (), but which GHC can't determine the value for (by using a recursive function GHC can't optimise away - reverse is just a simple one to hand). This means GHC can't CSE the trace function and it's 2 arguments, and we get hit printed twice. This works with both GHC 6.12.4 and 7.0.3 at -O2.

answered Sep 28 '22 09:09

Neil Mitchell

Related questions
                            
                                PHP - Function inside a Function. Good or bad?
                            
                                Effect of Screen Updating
                            
                                Saving time and memory using parfor?
                            
                                three.js: how to control rendering order
                            
                                Can -ffast-math be safely used on a typical project?
                            
                                How to remove common lines between two files without sorting? [duplicate]
                            
                                How to concatenate two integers in C
                            
                                What do you have in your log4net config? Hacks, optimizations, observations?
                            
                                optimized memcpy
                            
                                Does java support and optimize away tail-recursive calls?
                            
                                How does the GCC implementation of modulo (%) work, and why does it not use the div instruction?
                            
                                Are C++11 move semantics doing something new, or just making semantics clearer?
                            
                                Optimizing alternatives to DateTime.Now
                            
                                Rewriting as a practical optimization technique in GHC: Is it really needed?
                            
                                Laravel artisan optimize Best Practices
                            
                                Have you ever obtained a significant speedup by using boost::pool?
                            
                                OrderedDict performance (compared to deque)
                            
                                PHP website Optimization
                            
                                How fast can you make linear search?
                            
                                GCC multiple optimization flags

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to prevent common sub-expression elimination (CSE) with GHC

Tags:

optimization

haskell

compiler-construction

ghc

Neil Mitchell

People also ask

2 Answers

Don Stewart

Neil Mitchell

Recent Activity

Donate For Us