I read that (in TensorFlow): <blockquote> the value of a <code>tf.constant()</code> is stored multiple times in memory. </blockquote> Why is the value of a <code>tf.constant()</code> stored multiple times in memory?

Because data for a constant tensor is embedded into graph definition. This means this data is stored both in the client, which maintains the graph definition, and in the runtime, which allocates it's own memory for all tensors. IE, try <pre class="prettyprint"><code>a = tf.constant([1,2]) tf.get_default_graph().as_graph_def() </code></pre> You'll see <pre class="prettyprint"><code> dtype: DT_INT32 tensor_shape { dim { size: 2 } } tensor_content: "\001\000\000\000\002\000\000\000" } </code></pre> The <code>tensor_content</code> field is the raw content, same as <code>np.array([1,2], dtype=np.int32).tobytes()</code>. Now, to see the runtime allocation, you can run with <code>export TF_CPP_MIN_LOG_LEVEL=1</code>. If you evaluate anything using <code>a</code> you'll see something like this <pre class="prettyprint"><code>2017-02-24 16:13:58: I tensorflow/core/framework/log_memory.cc:35] __LOG_MEMORY__ MemoryLogTensorOutput { step_id: 1 kernel_name: "Const_1/_1" tensor { dtype: DT_INT32 shape { dim { size: 2 } } allocation_description { requested_bytes: 8 allocated_bytes: 256 allocator_name: "cuda_host_bfc" allocation_id: 1 ptr: 8605532160 } } } </code></pre> This means the runtime asked to allocate 8 bytes, and TF actually allocated 256 bytes. (the choices on how much data to actually allocate are somewhat arbitrary at the moment - bfc_allocator.cc ) Having constants embedded in the graph makes it easier to do some graph-based optimizations like constant folding . But this also means that large constants are inefficient. Also, using large constants is a common cause of exceeding 2GB limit for size of graph.

Why is the value of a `tf.constant()` stored multiple times in memory in TensorFlow?

2 Answers

Because data for a constant tensor is embedded into graph definition. This means this data is stored both in the client, which maintains the graph definition, and in the runtime, which allocates it's own memory for all tensors.

IE, try

a = tf.constant([1,2])
tf.get_default_graph().as_graph_def()

You'll see

    dtype: DT_INT32
    tensor_shape {
      dim {
        size: 2
      }
    }
    tensor_content: "\001\000\000\000\002\000\000\000"
  }

The tensor_content field is the raw content, same as np.array([1,2], dtype=np.int32).tobytes().

Now, to see the runtime allocation, you can run with export TF_CPP_MIN_LOG_LEVEL=1.

If you evaluate anything using a you'll see something like this

2017-02-24 16:13:58: I tensorflow/core/framework/log_memory.cc:35] __LOG_MEMORY__ MemoryLogTensorOutput { step_id: 1 kernel_name: "Const_1/_1" tensor { dtype: DT_INT32 shape { dim { size: 2 } } allocation_description { requested_bytes: 8 allocated_bytes: 256 allocator_name: "cuda_host_bfc" allocation_id: 1 ptr: 8605532160 } } }

This means the runtime asked to allocate 8 bytes, and TF actually allocated 256 bytes. (the choices on how much data to actually allocate are somewhat arbitrary at the moment - bfc_allocator.cc )

Having constants embedded in the graph makes it easier to do some graph-based optimizations like constant folding . But this also means that large constants are inefficient. Also, using large constants is a common cause of exceeding 2GB limit for size of graph.

147

answered Nov 14 '22 22:11

Yaroslav Bulatov

They are referring to the fact that when initializing the constant one copy of the constant is stored as a numpy array and another copy is stored in tensorflow. The two copies exist while it is initializing the constant.

answered Nov 14 '22 22:11

Aaron

Related questions
                            
                                how to created a weighted directed graph from edge list in Networkx
                            
                                Tkinter not changing image on button press
                            
                                Count of unique value in column pandas [duplicate]
                            
                                Server Error (500) when trying to log in in Django admin page
                            
                                regex: matching 3 consecutive words
                            
                                How to query values in a dictionary of a dictionary in python?
                            
                                The SECRET_KEY setting must not be empty - django+pycharm
                            
                                tkinter put scrollbar on canvas at bottom position
                            
                                Sort by two columns at once
                            
                                WARNING:tensorflow - initialize_all_variables (from tensorflow.python.ops.variables) is deprecated
                            
                                Determine if a python fraction can have an equivalent decimal
                            
                                Pandas intersection of groups
                            
                                UnicodeDecodeError: 'charmap' codec can't decode byte 0x8f in position XXX: char
                            
                                Scrapy code throws TypeError: 'NoneType' object is not iterable
                            
                                PyQt Multiline Text Input Box
                            
                                Python Negative Binomial Regression - Results Don't Match those from R
                            
                                Python: Find a string between two strings, repeatedly
                            
                                'Resource exhausted' memory error when trying to train a Keras model
                            
                                Should the main function and main() be placed at the start or the end of the program?
                            
                                Why use LSA before K-Means when doing text clustering

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Why is the value of a `tf.constant()` stored multiple times in memory in TensorFlow?

Tags:

python

tensorflow

Franck Dernoncourt

People also ask

2 Answers

Yaroslav Bulatov

Aaron

Recent Activity

Donate For Us