In CPU world one can do it via memory map. Can similar things done for GPU? If two process can share a same CUDA context, I think it will be trivial - just pass GPU memory pointer around. Is it possible to share same CUDA context between two processes? Another possibility I could think of is to map device memory to a memory mapped host memory. Since it's memory mapped, it can be shared between two processes. Does this make sense / possible, and are there any overhead?

CUDA MPS effectively allows CUDA activity emanating from 2 or more processes to share the same context on the GPU. However this won't provide for what you are asking for: <blockquote> can two processes share the same GPU memory? </blockquote> One method to achieve this is via CUDA IPC (interprocess communication) API. This will allow you to share an allocated device memory region (i.e. a memory region allocated via <code>cudaMalloc</code>) between multiple processes. This answer contains additional resources to learn about CUDA IPC. However, according to my testing, this does not enable sharing of host pinned memory regions (e.g. a region allocated via <code>cudaHostAlloc</code>) between multiple processes. The memory region itself can be shared using ordinary IPC mechanisms available for your particular OS, but it cannot be made to appear as "pinned" memory in 2 or more processes (according to my testing).

can two process shared same GPU memory? (CUDA)

Tags:

memory-management

memory

cuda

gpu

In CPU world one can do it via memory map. Can similar things done for GPU?

If two process can share a same CUDA context, I think it will be trivial - just pass GPU memory pointer around. Is it possible to share same CUDA context between two processes?

Another possibility I could think of is to map device memory to a memory mapped host memory. Since it's memory mapped, it can be shared between two processes. Does this make sense / possible, and are there any overhead?

493

asked Feb 03 '17 20:02

Helin Wang

1 Answers

CUDA MPS effectively allows CUDA activity emanating from 2 or more processes to share the same context on the GPU. However this won't provide for what you are asking for:

can two processes share the same GPU memory?

One method to achieve this is via CUDA IPC (interprocess communication) API.

This will allow you to share an allocated device memory region (i.e. a memory region allocated via cudaMalloc) between multiple processes. This answer contains additional resources to learn about CUDA IPC.

However, according to my testing, this does not enable sharing of host pinned memory regions (e.g. a region allocated via cudaHostAlloc) between multiple processes. The memory region itself can be shared using ordinary IPC mechanisms available for your particular OS, but it cannot be made to appear as "pinned" memory in 2 or more processes (according to my testing).

answered Sep 19 '22 13:09

2 revs

Related questions
                            
                                Android: Image cache strategy and memory cache size
                            
                                How do I loop through a large dataset in python without getting a MemoryError?
                            
                                Python ValueError: not allowed to raise maximum limit
                            
                                Memory settings with thousands of threads
                            
                                What are the bounds of the heap?
                            
                                Python str view
                            
                                Linux: Cannot allocate more than 32 GB/64 GB of memory in a single process due to virtual memory limit
                            
                                Android NDK mmap call broken on 32-bit devices after upgrading to Lollipop
                            
                                Correct idiom for freeing repr(C) structs using Drop trait
                            
                                What tools and techniques do you use to fix browser memory leaks?
                            
                                How to get a pointer value in Haskell?
                            
                                Measure Object Size Accurately in Python - Sys.GetSizeOf not functioning
                            
                                Finding Memory Usage, CPU utilization, Execution time for running a python script
                            
                                Calculating average memory access time in a system implementing cache memory
                            
                                How can you tell whether a file is being cached in memory in linux?
                            
                                How long to run compute retained sizes in visual VM for a 1GB heap?
                            
                                When do I have to free memory?
                            
                                ftp sending python bytesio stream
                            
                                Memory issue with foreach loop in R on Windows 8 (64-bit) (doParallel package)
                            
                                Memory growth with broadcast operations in NumPy

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With