In Pytorch 1.0.0, I found that a <code>tensor</code> variable occupies very small memory. I wonder how it stores so much data. Here's the code. <pre class="prettyprint"><code>a = np.random.randn(1, 1, 128, 256) b = torch.tensor(a, device=torch.device('cpu')) a_size = sys.getsizeof(a) b_size = sys.getsizeof(b) </code></pre> <code>a_size</code> is 262288. <code>b_size</code> is 72.

The answer is in two parts. From the documentation of <code>sys.getsizeof</code>, firstly <blockquote> All built-in objects will return correct results, but this does not have to hold true for third-party extensions as it is implementation specific. </blockquote> so it could be that for tensors <code>__sizeof__</code> is undefined or defined differently than you would expect - this function is not something you can rely on. Secondly <blockquote> Only the memory consumption directly attributed to the object is accounted for, not the memory consumption of objects it refers to. </blockquote> which means that if the <code>torch.Tensor</code> object merely holds a reference to the actual memory, this won't show in <code>sys.getsizeof</code>. This is indeed the case, if you check the size of the underlying storage instead, you will see the expected number <pre class="prettyprint"><code>import torch, sys b = torch.randn(1, 1, 128, 256, dtype=torch.float64) sys.getsizeof(b) >> 72 sys.getsizeof(b.storage()) >> 262208 </code></pre> Note: I am setting <code>dtype</code> to <code>float64</code> explicitly, because that is the default <code>dtype</code> in <code>numpy</code>, whereas <code>torch</code> uses <code>float32</code> by default.

Pytorch: Why is the memory occupied by the `tensor` variable so small?

Tags:

python

numpy

numpy-ndarray

tensor

pytorch

In Pytorch 1.0.0, I found that a tensor variable occupies very small memory. I wonder how it stores so much data. Here's the code.

a = np.random.randn(1, 1, 128, 256)
b = torch.tensor(a, device=torch.device('cpu'))

a_size = sys.getsizeof(a)
b_size = sys.getsizeof(b)

a_size is 262288. b_size is 72.

599

asked Jan 25 '19 08:01

laridzhang

1 Answers

The answer is in two parts. From the documentation of sys.getsizeof, firstly

All built-in objects will return correct results, but this does not have to hold true for third-party extensions as it is implementation specific.

so it could be that for tensors __sizeof__ is undefined or defined differently than you would expect - this function is not something you can rely on. Secondly

Only the memory consumption directly attributed to the object is accounted for, not the memory consumption of objects it refers to.

which means that if the torch.Tensor object merely holds a reference to the actual memory, this won't show in sys.getsizeof. This is indeed the case, if you check the size of the underlying storage instead, you will see the expected number

import torch, sys
b = torch.randn(1, 1, 128, 256, dtype=torch.float64)
sys.getsizeof(b)
>> 72
sys.getsizeof(b.storage())
>> 262208

Note: I am setting dtype to float64 explicitly, because that is the default dtype in numpy, whereas torch uses float32 by default.

167

answered Sep 29 '22 04:09

Jatentaki

Related questions
                            
                                How do I make HTTP POST requests with grequests
                            
                                matplotlib axvline truth ambiguous or list issue?
                            
                                Get Table Name By Table Class In Sqlalchemy
                            
                                Plot number of occurrences from Pandas DataFrame
                            
                                Make python script to run forever on Amazon EC2
                            
                                If in Python I put a list inside a tuple, can I safely change the contents of that list? [duplicate]
                            
                                Python Django e-mail form example [closed]
                            
                                ImageField() not saving images in ModelForm - Django/Python
                            
                                How to check if a date has Passed in Python (Simply)
                            
                                Flask-Admin vs Flask-AppBuilder
                            
                                Progress bar for pandas.DataFrame.to_sql
                            
                                Pandas replace/dictionary slowness
                            
                                How to make a class attribute exclusive to the super class
                            
                                pyodbc the sql contains 0 parameter markers but 1 parameters were supplied' 'hy000'
                            
                                Converting NumPy array to a set takes too long
                            
                                Can you assign variables in a lambda?
                            
                                Jinja docx template, avoiding new line in nested for
                            
                                Understanding Gunicorn and Flask on Docker/Docker-Compose
                            
                                What is the difference between x_train and x_test in Keras?
                            
                                jupyter notebook not rendering in GitHub gist

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With