Advice on starting a large multi-threaded programming project

Tags:

My company currently runs a third-party simulation program (natural catastrophe risk modeling) that sucks up gigabytes of data off a disk and then crunches for several days to produce results. I will soon be asked to rewrite this as a multi-threaded app so that it runs in hours instead of days. I expect to have about 6 months to complete the conversion and will be working solo.

We have a 24-proc box to run this. I will have access to the source of the original program (written in C++ I think), but at this point I know very little about how it's designed.

I need advice on how to tackle this. I'm an experienced programmer (~ 30 years, currently working in C# 3.5) but have no multi-processor/multi-threaded experience. I'm willing and eager to learn a new language if appropriate. I'm looking for recommendations on languages, learning resources, books, architectural guidelines. etc.

Requirements: Windows OS. A commercial grade compiler with lots of support and good learning resources available. There is no need for a fancy GUI - it will probably run from a config file and put results into a SQL Server database.

Edit: The current app is C++ but I will almost certainly not be using that language for the re-write. I removed the C++ tag that someone added.

528

asked Dec 14 '09 17:12

Sisiutl

2 Answers

Numerical process simulations are typically run over a single discretised problem grid (for example, the surface of the Earth or clouds of gas and dust), which usually rules out simple task farming or concurrency approaches. This is because a grid divided over a set of processors representing an area of physical space is not a set of independent tasks. The grid cells at the edge of each subgrid need to be updated based on the values of grid cells stored on other processors, which are adjacent in logical space.

In high-performance computing, simulations are typically parallelised using either MPI or OpenMP. MPI is a message passing library with bindings for many languages, including C, C++, Fortran, Python, and C#. OpenMP is an API for shared-memory multiprocessing. In general, MPI is more difficult to code than OpenMP, and is much more invasive, but is also much more flexible. OpenMP requires a memory area shared between processors, so is not suited to many architectures. Hybrid schemes are also possible.

This type of programming has its own special challenges. As well as race conditions, deadlocks, livelocks, and all the other joys of concurrent programming, you need to consider the topology of your processor grid - how you choose to split your logical grid across your physical processors. This is important because your parallel speedup is a function of the amount of communication between your processors, which itself is a function of the total edge length of your decomposed grid. As you add more processors, this surface area increases, increasing the amount of communication overhead. Increasing the granularity will eventually become prohibitive.

The other important consideration is the proportion of the code which can be parallelised. Amdahl's law then dictates the maximum theoretically attainable speedup. You should be able to estimate this before you start writing any code.

Both of these facts will conspire to limit the maximum number of processors you can run on. The sweet spot may be considerably lower than you think.

I recommend the book High Performance Computing, if you can get hold of it. In particular, the chapter on performance benchmarking and tuning is priceless.

An excellent online overview of parallel computing, which covers the major issues, is this introduction from Lawerence Livermore National Laboratory.

answered Sep 20 '22 19:09

ire_and_curses

Your biggest problem in a multithreaded project is that too much state is visible across threads - it is too easy to write code that reads / mutates data in an unsafe manner, especially in a multiprocessor environment where issues such as cache coherency, weakly consistent memory etc might come into play.

Debugging race conditions is distinctly unpleasant.

Approach your design as you would if, say, you were considering distributing your work across multiple machines on a network: that is, identify what tasks can happen in parallel, what the inputs to each task are, what the outputs of each task are, and what tasks must complete before a given task can begin. The point of the exercise is to ensure that each place where data becomes visible to another thread, and each place where a new thread is spawned, are carefully considered.

Once such an initial design is complete, there will be a clear division of ownership of data, and clear points at which ownership is taken / transferred; and so you will be in a very good position to take advantage of the possibilities that multithreading offers you - cheaply shared data, cheap synchronisation, lockless shared data structures - safely.

answered Sep 21 '22 19:09

moonshadow

Related questions
                            
                                Executor and Daemon in Java
                            
                                Using SynchronizationContext for sending events back to the UI for WinForms or WPF
                            
                                Why are functional languages considered a boon for multi threaded environments?
                            
                                Create new thread, passing parameters
                            
                                Is there RxJava equivalent of Handler.postDelayed(Runnable r, long delayMillis)
                            
                                Thread safety and `const`
                            
                                MultiThreading Vs ThreadPoolExecutor
                            
                                why there are two way of using thread in java? [duplicate]
                            
                                Javafx Platform.runLater never running
                            
                                Android thread problem, why ui still blocks when i have used a worker thread?
                            
                                Must create DependencySource on same Thread as DependencyObject
                            
                                ConfigureAwait pushes the continuation to a pool thread
                            
                                Async call in Objective-C
                            
                                static variables in multithreading
                            
                                How can I kill a thread in python [duplicate]
                            
                                How do I pick the best number of threads for hyptherthreading/multicore?
                            
                                ScheduledExecutorService - Check if scheduled task has already been completed
                            
                                Set value of label with C# Cross Threading
                            
                                How to execute a method periodically from WPF client application using threading or timer [closed]
                            
                                Dispatch_barrier_async and serial queue in GCD, what're differences between them?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Advice on starting a large multi-threaded programming project

Tags:

architecture

multithreading

parallel-processing

simulation

Sisiutl

People also ask

2 Answers

ire_and_curses

moonshadow

Recent Activity

Donate For Us