People argue that Haskell has an advantage in parallelism since it has immutable data structures. But Haskell is also lazy. It means data actually can be mutated from thunk to evaluated result. So it seems laziness can harm the advantage of immutability. Am I wrong or does Haskell have countermeasures for this problem? Or is this Haskell's own feature?

Yes, GHC’s RTS uses thunks to implement non-strict evaluation, and they use mutation under the hood, so they require some synchronisation. However, this is simplified due to the fact that most heap objects are immutable and functions are referentially transparent. In a multithreaded program, evaluation of a thunk proceeds as follows: <ul> <li>The thunk is atomically&dagger; replaced with a <code>BLACKHOLE</code> object</li> <li>If the same thread attempts to force the thunk after it’s been updated to a <code>BLACKHOLE</code>, this represents an infinite loop, and the RTS throws an exception (<code><<loop>></code>)</li> <li>If a different thread attempts to force the thunk while it’s a <code>BLACKHOLE</code>, it blocks until the original thread has finished evaluating the thunk and updated it with a value</li> <li>When evaluation is complete, the original thread atomically&dagger; replaces the thunk with its result</li> </ul> &dagger; e.g., using a compare-and-swap (CAS) instruction So there is a potential race here: if two threads attempt to force the same thunk at the same time, they may both begin evaluating it. In that case, they will do some redundant work—however, one thread will succeed in overwriting the <code>BLACKHOLE</code> with the result, and the other thread will simply discard the result that it calculated, because its CAS will fail. Safe code cannot detect this, because it can’t obtain the address of an object or determine the state of a thunk. And in practice, this type of collision is rare for a couple of reasons: <ul> <li>Concurrent code generally partitions workloads across threads in a manner suited to the particular problem, so there is low risk of overlap</li> <li>Evaluation of thunks is generally fairly “shallow” before you reach weak head normal form, so the probability of a “collision” is low</li> </ul> So thunks ultimately provide a good performance tradeoff when implementing non-strict evaluation, even in a concurrent context.

How do laziness and parallelism coexist in Haskell?

1 Answers

Yes, GHC’s RTS uses thunks to implement non-strict evaluation, and they use mutation under the hood, so they require some synchronisation. However, this is simplified due to the fact that most heap objects are immutable and functions are referentially transparent.

In a multithreaded program, evaluation of a thunk proceeds as follows:

The thunk is atomically^† replaced with a BLACKHOLE object
If the same thread attempts to force the thunk after it’s been updated to a BLACKHOLE, this represents an infinite loop, and the RTS throws an exception (<<loop>>)
If a different thread attempts to force the thunk while it’s a BLACKHOLE, it blocks until the original thread has finished evaluating the thunk and updated it with a value
When evaluation is complete, the original thread atomically^† replaces the thunk with its result

^† e.g., using a compare-and-swap (CAS) instruction

So there is a potential race here: if two threads attempt to force the same thunk at the same time, they may both begin evaluating it. In that case, they will do some redundant work—however, one thread will succeed in overwriting the BLACKHOLE with the result, and the other thread will simply discard the result that it calculated, because its CAS will fail.

Safe code cannot detect this, because it can’t obtain the address of an object or determine the state of a thunk. And in practice, this type of collision is rare for a couple of reasons:

Concurrent code generally partitions workloads across threads in a manner suited to the particular problem, so there is low risk of overlap
Evaluation of thunks is generally fairly “shallow” before you reach weak head normal form, so the probability of a “collision” is low

So thunks ultimately provide a good performance tradeoff when implementing non-strict evaluation, even in a concurrent context.

151

answered Sep 23 '22 02:09

Jon Purdy

Related questions
                            
                                Floating point math in different programming languages
                            
                                Defining own Ord for a data type
                            
                                Char to string function
                            
                                Let Haskell functors sink in.
                            
                                Is there ever a good reason to use unsafePerformIO?
                            
                                Have I upgraded my cabal-install?
                            
                                Best way of "looping over a 2-D array", using Repa
                            
                                Haskell for a server?
                            
                                How to compose `not` with a function of arbitrary arity?
                            
                                How do I get good (small) shrinks out of QuickCheck?
                            
                                Haddock numbered list continuation
                            
                                Unsafe coerce and more efficient Agda code (-ftrust-me-im-agda)
                            
                                What are the rules about concurrently accessing a persistent database
                            
                                Subsumption in polymorphic types
                            
                                Why there needs to be a $ in calls like "runSomeMonad $ do ..."?
                            
                                Perplexed by how this code is processed by Haskell's Layout facility
                            
                                Biapplicative and Bimonad?
                            
                                What does a comma in the guard syntax do?
                            
                                Building with runtime flags using cabal and ghc
                            
                                Can a `ST`-like monad be executed purely (without the `ST` library)?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How do laziness and parallelism coexist in Haskell?

Tags:

haskell

parallel-processing

lazy-evaluation

immutability

damhiya

People also ask

1 Answers

Jon Purdy

Recent Activity

Donate For Us