Haskell IO is often explained in terms of the entire program being a pure function (<code>main</code>) that returns an IO value (often described as an imperative IO program), which is then executed by the runtime. This mental model works fine for simple examples, but fell over for me as soon as I saw a recursive <code>main</code> in Learn You A Haskell. For example: <pre class="prettyprint"><code>main = do line <- getLine putStrLn line main </code></pre> Or, if you prefer: <pre class="prettyprint"><code>main = getLine >>= putStrLn >> main </code></pre> Since <code>main</code> never terminates, it never actually returns an IO value, yet the program endlessly reads and echoes back lines just fine - so the simple explanation above doesn't quite work. Am I missing something simple or is there a more complete explanation (or is it 'simply' compiler magic) ?

The value <code>main</code> denotes is an infinite program: <pre class="prettyprint"><code>main = do line <- getLine putStrLn line line <- getLine putStrLn line line <- getLine putStrLn line line <- getLine putStrLn line line <- getLine putStrLn line line <- getLine putStrLn line ... </code></pre> But it's represented in memory as a recursive structure that references itself. That representation is finite, unless someone tries to unfold the entire thing to get a non-recursive representation of the entire program - that would never finish. But just as you can probably figure out how to start executing the infinite program I wrote above without waiting for me to tell you "all" of it, so can Haskell's runtime system figure out how to execute <code>main</code> without unfolding the recursion up-front. Haskell's lazy evaluation is actually interleaved with the runtime system's execution of the <code>main</code> IO program, so this works even for a function that returns an <code>IO</code> action which recursively invokes the function, like: <pre class="prettyprint"><code>main = foo 1 foo :: Integer -> IO () foo x = do print x foo (x + 1) </code></pre> Here <code>foo 1</code> is not a recursive value (it contains <code>foo 2</code>, not <code>foo 1</code>), but it's still an infinite program. However this works just fine, because the program denoted by <code>foo 1</code> is only generated lazily on-demand; it can be produced as the runtime system's execution of <code>main</code> goes along. By default Haskell's laziness means that nothing is evaluated until it's needed, and then only "just enough" to get past the current block. Ultimately the source of all the "need" in "until it's needed" comes from the runtime system needing to know what the next step in the <code>main</code> program is so it can execute it. But it's only ever the next step; the rest of the program after that can remain unevaluated until after the next step has been fully executed. So infininte programs can be executed and do useful work so long as it's always only a finite amount of work to generate "one more step".

Why/how does recursive IO work?

Tags:

haskell

Haskell IO is often explained in terms of the entire program being a pure function (main) that returns an IO value (often described as an imperative IO program), which is then executed by the runtime.

This mental model works fine for simple examples, but fell over for me as soon as I saw a recursive main in Learn You A Haskell. For example:

main = do
  line <- getLine
  putStrLn line
  main

Or, if you prefer:

main = getLine >>= putStrLn >> main

Since main never terminates, it never actually returns an IO value, yet the program endlessly reads and echoes back lines just fine - so the simple explanation above doesn't quite work. Am I missing something simple or is there a more complete explanation (or is it 'simply' compiler magic) ?

727

asked Jan 28 '15 21:01

DNA

2 Answers

In this case, main is a value of type IO () rather than a function. You can think of it as a sequence of IO a values:

main = getLine >>= putStrLn >> main

This makes it a recursive value, not unlike infinite lists:

foo = 1 : 2 : foo

We can return a value like this without needing to evaluate the whole thing. In fact, it's a reasonably common idiom.

foo will loop forever if you try to use the whole thing. But that's true of main too: unless you use some external method to break out of it, it will never stop looping! But you can start getting elements out of foo, or executing parts of main, without evaluating all of it.

answered Oct 17 '22 15:10

Tikhon Jelvis

The value main denotes is an infinite program:

main = do
  line <- getLine
  putStrLn line
  line <- getLine
  putStrLn line
  line <- getLine
  putStrLn line
  line <- getLine
  putStrLn line
  line <- getLine
  putStrLn line
  line <- getLine
  putStrLn line
  ...

But it's represented in memory as a recursive structure that references itself. That representation is finite, unless someone tries to unfold the entire thing to get a non-recursive representation of the entire program - that would never finish.

But just as you can probably figure out how to start executing the infinite program I wrote above without waiting for me to tell you "all" of it, so can Haskell's runtime system figure out how to execute main without unfolding the recursion up-front.

Haskell's lazy evaluation is actually interleaved with the runtime system's execution of the main IO program, so this works even for a function that returns an IO action which recursively invokes the function, like:

main = foo 1

foo :: Integer -> IO ()
foo x = do
  print x
  foo (x + 1)

Here foo 1 is not a recursive value (it contains foo 2, not foo 1), but it's still an infinite program. However this works just fine, because the program denoted by foo 1 is only generated lazily on-demand; it can be produced as the runtime system's execution of main goes along.

By default Haskell's laziness means that nothing is evaluated until it's needed, and then only "just enough" to get past the current block. Ultimately the source of all the "need" in "until it's needed" comes from the runtime system needing to know what the next step in the main program is so it can execute it. But it's only ever the next step; the rest of the program after that can remain unevaluated until after the next step has been fully executed. So infininte programs can be executed and do useful work so long as it's always only a finite amount of work to generate "one more step".

answered Oct 17 '22 14:10

Ben

Related questions
                            
                                Are Ord and Enum sometimes incompatible in Haskell?
                            
                                Haskell: Monitor a file without polling (à la inotify in linux)
                            
                                Compiler switch to turn debugging messages on/off?
                            
                                How to install Haskell Platform on Linux Debian Wheezy?
                            
                                composing functions with higher arity
                            
                                How do I model inheritance in Haskell?
                            
                                How do I give a Functor instance to a datatype built for general recursion schemes?
                            
                                How do I use the Church encoding for Free Monads?
                            
                                Is there a "chain" monad function in Haskell?
                            
                                Haskell - How does this average function work?
                            
                                Haskell: Can I use a where clause after a block with bind operators (>>=)?
                            
                                Dealing with large files in Haskell
                            
                                dealing with IO vs pure code in haskell
                            
                                How can I specify that two operations commute in a typeclass?
                            
                                Are Haskell List Comprehensions Inefficient?
                            
                                The type signature of Parsec function 'parse' and the class 'Stream'
                            
                                How to parse a decimal fraction into Rational in Haskell?
                            
                                Find max element and index of a list in Haskell
                            
                                Is it a reasonable practice to serialize Haskell data structures to disk just using Show/Read
                            
                                Useful instantiations of “fix” on non-function types?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With