I've recently asked a number of questions regarding <code>TVar</code>, and I still have concerns about livelock. So I thought of this structure: <ol> <li>Each transaction gets a unique priority (perhaps allocated in order of creation). </li> <li>Transactions attempt to get read/write locks on data they access. Naturally, simultaneous reads are okay, but one write lock excludes all others (both read and write). </li> <li>Say transaction A has higher priority than transaction B. If A holds the lock, B waits, but if B holds the lock and A wants it, B is booted from the lock, A obtains it, and transaction B restarts (like with <code>TVar</code>). B however keeps its current priority for the retry. </li> <li>When a lock is freed and there are transactions waiting, it goes to the highest priority transaction, and the rest continue to wait.</li> </ol> This system I believe prevents deadlocks, but also prevents starvation (unlike <code>TVar</code>). I was wondering if anyone has implemented such a system, as it seems fairly obvious and I don't want to reinvent the wheel. Of course, such an approach could easily be extended to allow the user to specify priorities. Priority could be the pair <code>(user_supplied_prio, auto_increment)</code>, with <code>user_supplied_prio</code> taking precedence, but equal priority tasks resolving in FIFO order. Comment/Solution: Actually, when I think about it, what I describe already exists in Haskell, simply by using one <code>IORef</code> wrapped around all the data, and only using <code>atomicModifyIORef</code>. The <code>atomicModifyIORef</code> will ensure transactions are applied in sequence. However, one may think that this means that the data structure is sequential (i.e. effectively limited to one thread) but it is actually parallel due to laziness. To explain this, consider an expensive function <code>f</code>. We are going to apply this to a <code>Data.Map</code> to the data with the key "foo". This replaces <code>(foo -> x)</code> with <code>(foo -> future(f x))</code>. This thread shall continue to work out what <code>(f x)</code> actually is, but in the meantime we can apply g to "bar". Since applying g to "bar" does not need the result of "foo", we can work this out at the same time. No deadlocks, no starvation, eventually all transactions will be processed (roughly in the order they are received).

You can set up a worker thread to process all requests in a deterministic way, so nobody gets starved. This strategy would be reasonably efficient and immune to livelock. <pre class="prettyprint"><code>-- yes, this is a horrible name createManagerFactory :: a -> IO ((IO a), IO (((a -> a) -> IO a))) </code></pre> IO a is an action that safely and quickly queries the value with a read-only STM action. (a -> a) is a pure function that modifies the value, so ((a -> a) -> IO a) is an action that takes a modifier function, safely modifies the value, and returns the new value. Run this once to initialize the factory. <pre class="prettyprint"><code>(query, modifierFactory) <- createManagerVactory initValue </code></pre> Then for each thread run this to generate a new modifier. <pre class="prettyprint"><code>myModify <- modifierFactory </code></pre> createManagerFactory would do the following: <ul> <li>Create a TVar containing initValue (call it valueTVar). </li> <li>Create a TVar containing an empty collection of TVar (Either a (a -> a)) (call it the modifyTVarCollection)</li> <li>return (atomically $ readTVar valueTVar) as the 'query' result</li> <li>return a modifierFactory that knows about the modifyTVarCollection</li> </ul> modifierFactory would do this: <ul> <li>Create a new TVar (Either a (a -> a)) (call it modifyTVar), initialize it to a (Left a) with the current value of the valueTVar, and add it to modifyTVarCollection</li> <li>return a modifier action that loads (Right (a -> a)) into the modifyTVar in one STM action, then in another STM action retries until the modifyTVar contains a (Left a) result value, then return that value.</li> </ul> The worker thread would run this loop: <ul> <li>In one action, grab all the query TVars from the modifyTVarCollection, and check their values. If they all contain (Left a) values, retry (this would block until some other thread loaded their modifyTVar with a modifier function, or the modifierFactory created a new modifierTVar and added it to the collection). Return a list of all modifyTVars containing a Right (a -> a)</li> <li>Iterate over all the returned modifyTVars. For each TVar, perform an action that reads the modifier function, safely perform the modification, and puts the result back into the modifyTVar as a (Left a)</li> </ul>

Concurrent generic data structure without deadlocks or resource starvation

Tags:

haskell

concurrency

deadlock

stm

I've recently asked a number of questions regarding TVar, and I still have concerns about livelock.

So I thought of this structure:

Each transaction gets a unique priority (perhaps allocated in order of creation).
Transactions attempt to get read/write locks on data they access. Naturally, simultaneous reads are okay, but one write lock excludes all others (both read and write).
Say transaction A has higher priority than transaction B. If A holds the lock, B waits, but if B holds the lock and A wants it, B is booted from the lock, A obtains it, and transaction B restarts (like with TVar). B however keeps its current priority for the retry.
When a lock is freed and there are transactions waiting, it goes to the highest priority transaction, and the rest continue to wait.

This system I believe prevents deadlocks, but also prevents starvation (unlike TVar). I was wondering if anyone has implemented such a system, as it seems fairly obvious and I don't want to reinvent the wheel.

Of course, such an approach could easily be extended to allow the user to specify priorities.

Priority could be the pair (user_supplied_prio, auto_increment), with user_supplied_prio taking precedence, but equal priority tasks resolving in FIFO order.

Comment/Solution:

Actually, when I think about it, what I describe already exists in Haskell, simply by using one IORef wrapped around all the data, and only using atomicModifyIORef. The atomicModifyIORef will ensure transactions are applied in sequence. However, one may think that this means that the data structure is sequential (i.e. effectively limited to one thread) but it is actually parallel due to laziness.

To explain this, consider an expensive function f. We are going to apply this to a Data.Map to the data with the key "foo".

This replaces (foo -> x) with (foo -> future(f x)). This thread shall continue to work out what (f x) actually is, but in the meantime we can apply g to "bar". Since applying g to "bar" does not need the result of "foo", we can work this out at the same time.

No deadlocks, no starvation, eventually all transactions will be processed (roughly in the order they are received).

247

asked Apr 11 '12 07:04

Clinton

1 Answers

You can set up a worker thread to process all requests in a deterministic way, so nobody gets starved. This strategy would be reasonably efficient and immune to livelock.

-- yes, this is a horrible name createManagerFactory :: a -> IO ((IO a), IO (((a -> a) -> IO a)))

IO a is an action that safely and quickly queries the value with a read-only STM action. (a -> a) is a pure function that modifies the value, so ((a -> a) -> IO a) is an action that takes a modifier function, safely modifies the value, and returns the new value.

Run this once to initialize the factory.

(query, modifierFactory) <- createManagerVactory initValue

Then for each thread run this to generate a new modifier.

myModify <- modifierFactory

createManagerFactory would do the following:

Create a TVar containing initValue (call it valueTVar).
Create a TVar containing an empty collection of TVar (Either a (a -> a)) (call it the modifyTVarCollection)
return (atomically $ readTVar valueTVar) as the 'query' result
return a modifierFactory that knows about the modifyTVarCollection

modifierFactory would do this:

Create a new TVar (Either a (a -> a)) (call it modifyTVar), initialize it to a (Left a) with the current value of the valueTVar, and add it to modifyTVarCollection
return a modifier action that loads (Right (a -> a)) into the modifyTVar in one STM action, then in another STM action retries until the modifyTVar contains a (Left a) result value, then return that value.

The worker thread would run this loop:

In one action, grab all the query TVars from the modifyTVarCollection, and check their values. If they all contain (Left a) values, retry (this would block until some other thread loaded their modifyTVar with a modifier function, or the modifierFactory created a new modifierTVar and added it to the collection). Return a list of all modifyTVars containing a Right (a -> a)
Iterate over all the returned modifyTVars. For each TVar, perform an action that reads the modifier function, safely perform the modification, and puts the result back into the modifyTVar as a (Left a)

157

answered Sep 18 '22 02:09

NovaDenizen

Related questions
                            
                                Multiple assignments to the same register in an RTL block with Kansas Lava
                            
                                Does the free monad always exist?
                            
                                How to install/use a local version of package using Stack?
                            
                                Where can I read up on the haskell "->" operator?
                            
                                How to mock for testing in Haskell?
                            
                                What exactly does "effectful" mean
                            
                                Where are the functional gui users?
                            
                                Minimal Warp webserver example
                            
                                Haskell Monad bind operator confusion
                            
                                Is there any chance to write "C major" instead of "major C"?
                            
                                Histomorphisms, Zygomorphisms and Futumorphisms specialised to lists
                            
                                What does a nontrivial comonoid look like?
                            
                                Efficient heaps in purely functional languages
                            
                                How do I determine my GHC version?
                            
                                Is there a way to see the list of functions in a module, in GHCI?
                            
                                How to get the value of a Maybe in Haskell
                            
                                How does Data.MemoCombinators work?
                            
                                Where can I find a list of all GHC extensions
                            
                                New instance declaration for Show
                            
                                A Haskell function of type: IO String-> String

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With