Force GPU memory limit in PyTorch

Tags:

pytorch

Is there a way to force a maximum value for the amount of GPU memory that I want to be available for a particular Pytorch instance? For example, my GPU may have 12Gb available, but I'd like to assign 4Gb max to a particular process.

956

asked Mar 28 '18 08:03

Giorgos Sfikas

2 Answers

Update (04-MAR-2021): it is now available in the stable 1.8.0 version of PyTorch. Also, in the docs

Original answer follows.

This feature request has been merged into PyTorch master branch. Yet, not introduced in the stable release.

Introduced as set_per_process_memory_fraction

Set memory fraction for a process. The fraction is used to limit an caching allocator to allocated memory on a CUDA device. The allowed value equals the total visible memory multiplied fraction. If trying to allocate more than the allowed value in a process, will raise an out of memory error in allocator.

You can check the tests as usage examples.

189

answered Sep 18 '22 05:09

ndrwnaguib

Update pytorch to 1.8.0 （pip install --upgrade torch==1.8.0）

function: torch.cuda.set_per_process_memory_fraction(fraction, device=None)

params:

fraction (float) – Range: 0~1. Allowed memory equals total_memory * fraction.

device (torch.device or int, optional) – selected device. If it is None the default CUDA device is used.

eg:

import torch
torch.cuda.set_per_process_memory_fraction(0.5, 0)
torch.cuda.empty_cache()
total_memory = torch.cuda.get_device_properties(0).total_memory
# less than 0.5 will be ok:
tmp_tensor = torch.empty(int(total_memory * 0.499), dtype=torch.int8, device='cuda')
del tmp_tensor
torch.cuda.empty_cache()
# this allocation will raise a OOM:
torch.empty(total_memory // 2, dtype=torch.int8, device='cuda')

"""
It raises an error as follows: 
RuntimeError: CUDA out of memory. Tried to allocate 5.59 GiB (GPU 0; 11.17 GiB total capacity; 0 bytes already allocated; 10.91 GiB free; 5.59 GiB allowed; 0 bytes reserved in total by PyTorch)
"""

answered Sep 19 '22 05:09

kaiyuanxie

Related questions
                            
                                Using TPUs with PyTorch
                            
                                How to convert a pytorch tensor of ints to a tensor of booleans?
                            
                                PyTorch torch.max over multiple dimensions
                            
                                PyTorch: Add validation error in training
                            
                                Pytorch Change the learning rate based on number of epochs
                            
                                Pytorch: Create an boolean tensor (type: torch.ByteTensor)?
                            
                                Multivariate input LSTM in pytorch
                            
                                How to convert torch tensor to pandas dataframe?
                            
                                Pytorch: Why is the memory occupied by the `tensor` variable so small?
                            
                                AttributeError: 'collections.OrderedDict' object has no attribute 'eval'
                            
                                What is the difference between model.to(device) and model=model.to(device)?
                            
                                Understanding Memory Usage by PyTorch DataLoader Workers
                            
                                How to parallelize a training loop ever samples of a batch when CPU is only available in pytorch?
                            
                                PyTorch's dataloader "too many open files" error when no files should be open
                            
                                How can I invert a MelSpectrogram with torchaudio and get an audio waveform?
                            
                                Implementation of the Dense Synthesizer
                            
                                PyTorch equivalence for softmax_cross_entropy_with_logits
                            
                                Accuracy score in pyTorch LSTM
                            
                                "AssertionError: Torch not compiled with CUDA enabled" in spite upgrading to CUDA version
                            
                                Is there a function to extract image patches in PyTorch?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With