Parallel Dynamic Programming

2 Answers

We recently published a paper showing how to parallelize any d.p. on a shared memory multicore computer by means of a shared lock-free hash table:

Stivala, A. and Stuckey, P. J. and Garcia de la Banda, M. and Hermenegildo, M. and Wirth, A. 2010 "Lock-free parallel dynamic programming" J. Parallel Distrib. Comput. 70:839-848 doi:10.1016/j.jpdc.2010.01.004

http://dx.doi.org/10.1016/j.jpdc.2010.01.004

Essentially, you start multiple threads, all running the same code starting at the value of the d.p. you want to compute, computing it top-down (recursively), and memoizing in a shared lock-free hash table, but randomizing the order in which subproblems are computed so that the threads diverge in which subproblems they compute.

In terms of implementation, we just used C and pthreads on UNIX type systems, all you need is to be able to have shared memory, and CompareAndSwap (CAS) for lock-free synchronization between threads.

Because this paper was published in an Elsevier journal, you'll need to access the above through a University library or similar with a subscription to it. You might be able to get a pre-print copy via Prof. Stuckey's webpage though.

answered Sep 27 '22 17:09

Alex Stivala

IIRC, what you typically do with dynamic programming is to recursively divide a problem into subproblems, and assemble optimal solutions from optimal subsolutions. What makes it effective is that all optimal subsolutions are built into a cache so they need not be recomputed.

If the problem can be divided several ways, you can fork the solver for each subsolution. If each(sub) problem averages 1+epsilon (for epsilon interestingly more than zero) possible subsolutions, then you'll get a lot of parallelism this way. You'll probably need locks on the cache entries to protect the individual solutions from being constructed more than once.

You need a language in which you can fork subtasks more cheaply than the work to solve them, and which is happy to have lots of forked tasks at once. The typical parallel offerings in most languages do this badly; you can't have lots of forked tasks in systems that use "the big stack model" (see How does a stackless language work?).

We implemented our parallel programming langauge, PARLANSE, to get a language that had the right properties.

answered Sep 27 '22 18:09

Ira Baxter

Related questions
                            
                                TPL Parallel.For with long running tasks
                            
                                Parallel implementation for multiple SVDs using CUDA
                            
                                Performance optimization of foreach loop in C#
                            
                                Traverse a graph in parallel
                            
                                If data fits on a single machine does it make sense to use Spark?
                            
                                How to use parallel 'for' loop in Octave or Scilab?
                            
                                Basic Java threading (4 threads) slower than non-threading
                            
                                How to do large file parallel encryption using GnuPG and GNU parallel?
                            
                                Assignment of a value from a foreach loop
                            
                                Scikit-learn: Parallelize stochastic gradient descent
                            
                                how to optimize matrix multiplication (matmul) code to run fast on a single processor core
                            
                                Missing par method from Scala collections
                            
                                How to realize parallel loop in Delphi?
                            
                                Start and finish lock in different methods
                            
                                Why is it bad to pause/abort threads?
                            
                                Parallel I/O SSD vs HDD surprising results
                            
                                R: How can I export methods provided by a package to a PSOCK cluster?
                            
                                Is F# better than C# in scenarios where you need complete parallelism in parts of an application?
                            
                                Is restrict(amp) more restrictive than CUDA kernel code?
                            
                                How to parallel 4 works with PARFOR with a Core i3 in Matlab

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Parallel Dynamic Programming

Tags:

parallel-processing

dynamic-programming

program-transformation

adk

People also ask

2 Answers

Alex Stivala

Ira Baxter

Recent Activity

Donate For Us