I need to force evaluation of pure value in <code>IO</code> monad. I'm writing higher-level interface to C bindings. On lower level I have, say <code>newFile</code> function and <code>freeFile</code> function. <code>newFile</code> returns some id, opaque object I've defined on lower level. You cannot basically do anything with that but to use it to free the file and purely calculate something associated with that file. So, I have (simplified): <pre class="prettyprint"><code>execGetter :: FilePath -> TagGetter a -> IO a execGetter path g = do fid <- newFile path -- ‘fid’ stands for “file id” let x = runGetter g fid freeFile fid return x </code></pre> This is initial version of the function. We need to calculate <code>x</code> before <code>freeFile</code> is called. (The code works, if I remove <code>freeFile</code> it's all fine, but I want to free the resource, you know.) First attempt (we will use <code>seq</code> to “force” evaluation): <pre class="prettyprint"><code>execGetter :: FilePath -> TagGetter a -> IO a execGetter path g = do fid <- newFile path let x = runGetter g fid x `seq` freeFile fid return x </code></pre> Segmentation fault. Go straight to documentation of <code>seq</code>: <blockquote> The value of <code>seq a b</code> is bottom if <code>a</code> is bottom, and otherwise equal to <code>b</code>. <code>seq</code> is usually introduced to improve performance by avoiding unneeded laziness. A note on evaluation order: the expression <code>seq a b</code> does not guarantee that <code>a</code> will be evaluated before <code>b</code>. The only guarantee given by <code>seq</code> is that the both <code>a</code> and <code>b</code> will be evaluated before <code>seq</code> returns a value. In particular, this means that <code>b</code> may be evaluated before <code>a</code>. If you need to guarantee a specific order of evaluation, you must use the function <code>pseq</code> from the "parallel" package. </blockquote> A good note, indeed, I've seen people claiming different things about order of evaluation in this case. What about <code>pseq</code>? Do I need to depend on <code>parallel</code> just because of <code>pseq</code>, hmm… may be there is another way. <pre class="prettyprint"><code>{-# LANGUAGE BangPatterns #-} execGetter :: FilePath -> TagGetter a -> IO a execGetter path g = do fid <- newFile path let !x = runGetter g fid freeFile fid return x </code></pre> Segmentation fault. Well, that answer doesn't work in my case. But it suggests <code>evaluate</code>, let's try it Too: <pre class="prettyprint"><code>Control.Exception (evaluate) Control.Monad (void) execGetter :: FilePath -> TagGetter a -> IO a execGetter path g = do fid <- newFile path let x = runGetter g fid void $ evaluate x freeFile fid return x </code></pre> Segmentation fault. Maybe we should use value returned by <code>evaluate</code>? <pre class="prettyprint"><code>Control.Exception (evaluate) Control.Monad (void) execGetter :: FilePath -> TagGetter a -> IO a execGetter path g = do fid <- newFile path let x = runGetter g fid x' <- evaluate x freeFile fid return x' </code></pre> No, bad idea. Maybe we could chain <code>seq</code>: <pre class="prettyprint"><code>execGetter :: FilePath -> TagGetter a -> IO a execGetter path g = do fid <- newFile path let x = runGetter g fid x `seq` freeFile fid `seq` return x </code></pre> This works. But is this the right way to do it? Maybe it only works due to some volatile optimization logic? I don't know. If <code>seq</code> associates to the left in this case then according to that description both <code>x</code> and <code>freeFile</code> are evaluated when <code>return x</code> returns its value. But again, which of them, <code>x</code> or <code>freeFile</code> is evaluated first? Since I don't get seg fault, it must be <code>x</code>, but is this result reliable? Do you know how to force evaluation of <code>x</code> before <code>freeFile</code> properly?

One possible problem is that <code>newFile</code> is doing some lazy IO, and that <code>runGetter</code> is a sufficiently lazy consumer that running <code>seq</code> on its output does not force all of <code>newFile</code>'s IO to actually happen. This can be fixed by using <code>deepseq</code> instead of <code>seq</code>: <pre class="prettyprint"><code>execGetter :: NFData a => FilePath -> TagGetter a -> IO a execGetter path g = do fid <- newFile path let x = runGetter g fid x `deepseq` freeFile fid return x </code></pre> Another possibility that this will address is that <code>runGetter</code> is claiming to be pure, but actually isn't (and is a lazy producer). However, if that's the case, the correct fix is not to use <code>deepseq</code> here, but to eliminate the uses of <code>unsafePerformIO</code> from <code>runGetter</code>, then use: <pre class="prettyprint"><code>execGetter :: FilePath -> TagGetter a -> IO a execGetter path g = do fid <- newFile path x <- runGetter g fid freeFile fid return x </code></pre> which should then work without further fiddling with forcing.

How to properly force evaluation of pure value in IO monad?

Tags:

haskell

lazy-evaluation

I need to force evaluation of pure value in IO monad. I'm writing higher-level interface to C bindings. On lower level I have, say newFile function and freeFile function. newFile returns some id, opaque object I've defined on lower level. You cannot basically do anything with that but to use it to free the file and purely calculate something associated with that file.

So, I have (simplified):

execGetter :: FilePath -> TagGetter a -> IO a
execGetter path g = do
  fid <- newFile path -- ‘fid’ stands for “file id”
  let x = runGetter g fid
  freeFile fid
  return x

This is initial version of the function. We need to calculate x before freeFile is called. (The code works, if I remove freeFile it's all fine, but I want to free the resource, you know.)

First attempt (we will use seq to “force” evaluation):

execGetter :: FilePath -> TagGetter a -> IO a
execGetter path g = do
  fid <- newFile path
  let x = runGetter g fid
  x `seq` freeFile fid
  return x

Segmentation fault. Go straight to documentation of seq:

The value of seq a b is bottom if a is bottom, and otherwise equal to b. seq is usually introduced to improve performance by avoiding unneeded laziness.

A note on evaluation order: the expression seq a b does not guarantee that a will be evaluated before b. The only guarantee given by seq is that the both a and b will be evaluated before seq returns a value. In particular, this means that b may be evaluated before a. If you need to guarantee a specific order of evaluation, you must use the function pseq from the "parallel" package.

A good note, indeed, I've seen people claiming different things about order of evaluation in this case. What about pseq? Do I need to depend on parallel just because of pseq, hmm… may be there is another way.

{-# LANGUAGE BangPatterns #-}

execGetter :: FilePath -> TagGetter a -> IO a
execGetter path g = do
  fid <- newFile path
  let !x = runGetter g fid
  freeFile fid
  return x

Segmentation fault. Well, that answer doesn't work in my case. But it suggests evaluate, let's try it Too:

Control.Exception (evaluate)
Control.Monad (void)

execGetter :: FilePath -> TagGetter a -> IO a
execGetter path g = do
  fid <- newFile path
  let x = runGetter g fid
  void $ evaluate x
  freeFile fid
  return x

Segmentation fault. Maybe we should use value returned by evaluate?

Control.Exception (evaluate)
Control.Monad (void)

execGetter :: FilePath -> TagGetter a -> IO a
execGetter path g = do
  fid <- newFile path
  let x = runGetter g fid
  x' <- evaluate x
  freeFile fid
  return x'

No, bad idea. Maybe we could chain seq:

execGetter :: FilePath -> TagGetter a -> IO a
execGetter path g = do
  fid <- newFile path
  let x = runGetter g fid
  x `seq` freeFile fid `seq` return x

This works. But is this the right way to do it? Maybe it only works due to some volatile optimization logic? I don't know. If seq associates to the left in this case then according to that description both x and freeFile are evaluated when return x returns its value. But again, which of them, x or freeFile is evaluated first? Since I don't get seg fault, it must be x, but is this result reliable? Do you know how to force evaluation of x before freeFile properly?

258

asked Oct 29 '15 20:10

Mark Karpov

1 Answers

One possible problem is that newFile is doing some lazy IO, and that runGetter is a sufficiently lazy consumer that running seq on its output does not force all of newFile's IO to actually happen. This can be fixed by using deepseq instead of seq:

execGetter :: NFData a => FilePath -> TagGetter a -> IO a
execGetter path g = do
  fid <- newFile path
  let x = runGetter g fid
  x `deepseq` freeFile fid
  return x

Another possibility that this will address is that runGetter is claiming to be pure, but actually isn't (and is a lazy producer). However, if that's the case, the correct fix is not to use deepseq here, but to eliminate the uses of unsafePerformIO from runGetter, then use:

execGetter :: FilePath -> TagGetter a -> IO a
execGetter path g = do
  fid <- newFile path
  x <- runGetter g fid
  freeFile fid
  return x

which should then work without further fiddling with forcing.

170

answered Oct 07 '22 15:10

Daniel Wagner

Related questions
                            
                                XMonad: SpawnOn workspace that had focus when spawn key was pressed
                            
                                Haskell Data.List.Class and syntax
                            
                                Parallel computation in Haskell
                            
                                Scala criterion equivalent
                            
                                Idiomatic way to take a substring of a ByteString
                            
                                Memoization of multi-parameter function in Haskell
                            
                                How is this function equivalent to getting the last item in a list?
                            
                                Folding without Monoid instance
                            
                                Best Practice on design and usage of data type in Haskell [closed]
                            
                                Module, that exports another ones
                            
                                Why does MFunctor's 'hoist' not have 'Monad n' constraint?
                            
                                How to do store algebraic data type in persistent
                            
                                Alive GUI library with FRP support for Haskell [closed]
                            
                                is there a command to apply hlint suggestions in emacs?
                            
                                What about John Hughes' `foldtree` am I misunderstanding?
                            
                                Is there a recommended way to update version bounds on cabal packages?
                            
                                Is it safe to use trace inside a STM stransaction?
                            
                                What is this Haskell Syntax (type level operators?)
                            
                                Group a list of tuples by their 1st element
                            
                                Can GHC derive Functor and Applicative instances for a monad transformer?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With