OpenMP program on different hosts

Tags:

I want to know if it would be possible to run an OpenMP program on multiple hosts. So far I only heard of programs that can be executed on multiple thread but all within the same physical computer. Is it possible to execute a program on two (or more) clients? I don't want to use MPI.

496

asked Nov 20 '12 14:11

gkris

2 Answers

Yes, it is possible to run OpenMP programs on a distributed system, but I doubt it is within the reach of every user around. ScaleMP offers vSMP - an expensive commercial hypervisor software that allows one to create a virtual NUMA machine on top of many networked hosts, then run a regular OS (Linux or Windows) inside this VM. It requires a fast network interconnect (e.g. InfiniBand) and dedicated hosts (since it runs as a hypervisor beneath the normal OS). We have an operational vSMP cluster here and it runs unmodified OpenMP applications, but performance is strongly dependent on data hierarchy and access patterns.

NICTA used to develop similar SSI hypervisor named vNUMA, but development also stopped. Besides their solution was IA64-specific (IA64 is Intel Itanium, not to be mistaken with Intel64, which is their current generation of x86 CPUs).

Intel used to develop Cluster OpenMP (ClOMP; not to be mistaken with the similarly named project to bring OpenMP support to Clang), but it was abandoned due to "general lack of interest among customers and fewer cases than expected where it showed a benefit" (from here). ClOMP was an Intel extension to OpenMP and it was built into the Intel compiler suite, e.g. you couldn't use it with GCC (this request to start ClOMP development for GCC went in the limbo). If you have access to old versions of Intel compilers (versions from 9.1 to 11.1), you would have to obtain a (trial) ClOMP license, which might be next to impossible given that the product is dead and old (trial) licenses have already expired. Then again, starting with version 12.0, Intel compilers no longer support ClOMP.

Other research projects exist (just search for "distributed shared memory"), but only vSMP (the ScaleMP solution) seems to be mature enough for production HPC environments (and it's priced accordingly). Seems like most efforts now go into development of co-array languages (Co-Array Fortran, Unified Parallel C, etc.) instead. I would suggest that you have a look at Berkeley UPC or invest some time in learning MPI as it is definitely not going away in the years to come.

186

answered Oct 04 '22 20:10

Hristo Iliev

Before, there was the Cluster OpenMP.

Cluster OpenMP, was an implementation of OpenMP that could make use of multiple SMP machines without resorting to MPI. This advance had the advantage of eliminating the need to write explicit messaging code, as well as not mixing programming paradigms. The shared memory in Cluster OpenMP was maintained across all machines through a distributed shared-memory subsystem. Cluster OpenMP is based on the relaxed memory consistency of OpenMP, allowing shared variables to be made consistent only when absolutely necessary. source

Performance Considerations for Cluster OpenMP

Some memory operations are much more expensive than others. To achieve good performance with Cluster OpenMP, the number of accesses to unprotected pages must be as high as possible, relative to the number of accesses to protected pages. This means that once a page is brought up-to-date on a given node, a large number of accesses should be made to it before the next synchronization. In order to accomplish this, a program should have as little synchronization as possible, and re-use the data on a given page as much as possible. This translates to avoiding fine-grained synchronization, such as atomic constructs or locks, and having high data locality source.

answered Oct 04 '22 19:10

dreamcrash

Related questions
                            
                                Handle a blocking function call in Python
                            
                                Why does python thread consume so much memory?
                            
                                How to wait for the start of a thread in java
                            
                                Difference between mutex lock and pthread_join
                            
                                c# backgroundWorker not raising ProgressChanged or RunWorkerCompleted events
                            
                                Using sleep() and interrupt() to reuse thread
                            
                                Failed to set the specified COM apartment state
                            
                                Analysing a multi-threaded Java application
                            
                                rationale behind lock inside lock?
                            
                                How do you loop a thread?
                            
                                QtConcurrent with member function
                            
                                python, how to incrementally create Threads
                            
                                Dynamically create thread
                            
                                Java notify() gets called before wait()
                            
                                ThreadPoolExecutor with ArrayBlockingQueue
                            
                                Python threading.Timer only repeats once
                            
                                Python condition variable timeout
                            
                                How to calculate the total time it takes for multiple threads to finish executing?
                            
                                Sending messages from background thread to main thread on iOS
                            
                                How to limit the number of cores used by the Erlang VM (BEAM)?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

OpenMP program on different hosts

Tags:

performance

multithreading

parallel-processing

distributed-computing

openmp

gkris

People also ask

2 Answers

Hristo Iliev

dreamcrash

Recent Activity

Donate For Us