From what I've gathered so far, there are several different ways of dumping a TensorFlow graph into a file and then loading it into another program, but I haven't been able to find clear examples/information on how they work. What I already know is this: <ol> <li>Save the model's variables into a checkpoint file (.ckpt) using a <code>tf.train.Saver()</code> and restore them later (source)</li> <li>Save a model into a .pb file and load it back in using <code>tf.train.write_graph()</code> and <code>tf.import_graph_def()</code> (source)</li> <li>Load in a model from a .pb file, retrain it, and dump it into a new .pb file using Bazel (source)</li> <li>Freeze the graph to save the graph and weights together (source)</li> <li>Use <code>as_graph_def()</code> to save the model, and for weights/variables, map them into constants (source)</li> </ol> However, I haven't been able to clear up several questions regarding these different methods: <ol> <li>Regarding checkpoint files, do they only save the trained weights of a model? Could checkpoint files be loaded into a new program, and be used to run the model, or do they simply serve as ways to save the weights in a model at a certain time/stage?</li> <li>Regarding <code>tf.train.write_graph()</code>, are the weights/variables saved as well?</li> <li>Regarding Bazel, can it only save into/load from .pb files for retraining? Is there a simple Bazel command just to dump a graph into a .pb?</li> <li>Regarding freezing, can a frozen graph be loaded in using <code>tf.import_graph_def()</code>?</li> <li>The Android demo for TensorFlow loads in Google's Inception model from a .pb file. If I wanted to substitute my own .pb file, how would I go about doing that? Would I need to change any native code/methods?</li> <li>In general, what exactly is the difference between all these methods? Or more broadly, what is the difference between <code>as_graph_def()</code>/.ckpt/.pb?</li> </ol> In short, what I'm looking for is a method to save both a graph (as in, the various operations and such) and its weights/variables into a file, which can then be used to load the graph and weights into another program, for use (not necessarily continuing/retraining). Documentation about this topic isn't very straightforward, so any answers/information would be greatly appreciated.

There are many ways to approach the problem of saving a model in TensorFlow, which can make it a bit confusing. Taking each of your sub-questions in turn: <ol> <li>The checkpoint files (produced e.g. by calling <code>saver.save()</code> on a <code>tf.train.Saver</code> object) contain only the weights, and any other variables defined in the same program. To use them in another program, you must re-create the associated graph structure (e.g. by running code to build it again, or calling <code>tf.import_graph_def()</code>), which tells TensorFlow what to do with those weights. Note that calling <code>saver.save()</code> also produces a file containing a <code>MetaGraphDef</code>, which contains a graph and details of how to associate the weights from a checkpoint with that graph. See the tutorial for more details.</li> <li><code>tf.train.write_graph()</code> only writes the graph structure; not the weights.</li> <li>Bazel is unrelated to reading or writing TensorFlow graphs. (Perhaps I misunderstand your question: feel free to clarify it in a comment.)</li> <li>A frozen graph can be loaded using <code>tf.import_graph_def()</code>. In this case, the weights are (typically) embedded in the graph, so you don't need to load a separate checkpoint.</li> <li>The main change would be to update the names of the tensor(s) that are fed into the model, and the names of the tensor(s) that are fetched from the model. In the TensorFlow Android demo, this would correspond to the <code>inputName</code> and <code>outputName</code> strings that are passed to <code>TensorFlowClassifier.initializeTensorFlow()</code>.</li> <li>The <code>GraphDef</code> is the program structure, which typically does not change through the training process. The checkpoint is a snapshot of the state of a training process, which typically changes at every step of the training process. As a result, TensorFlow uses different storage formats for these types of data, and the low-level API provides different ways to save and load them. Higher-level libraries, such as the <code>MetaGraphDef</code> libraries, Keras, and skflow build on these mechanisms to provide more convenient ways to save and restore an entire model.</li> </ol>

TensorFlow saving into/loading a graph from a file

Tags:

python

tensorflow

protocol-buffers

From what I've gathered so far, there are several different ways of dumping a TensorFlow graph into a file and then loading it into another program, but I haven't been able to find clear examples/information on how they work. What I already know is this:

Save the model's variables into a checkpoint file (.ckpt) using a tf.train.Saver() and restore them later (source)
Save a model into a .pb file and load it back in using tf.train.write_graph() and tf.import_graph_def() (source)
Load in a model from a .pb file, retrain it, and dump it into a new .pb file using Bazel (source)
Freeze the graph to save the graph and weights together (source)
Use as_graph_def() to save the model, and for weights/variables, map them into constants (source)

However, I haven't been able to clear up several questions regarding these different methods:

Regarding checkpoint files, do they only save the trained weights of a model? Could checkpoint files be loaded into a new program, and be used to run the model, or do they simply serve as ways to save the weights in a model at a certain time/stage?
Regarding tf.train.write_graph(), are the weights/variables saved as well?
Regarding Bazel, can it only save into/load from .pb files for retraining? Is there a simple Bazel command just to dump a graph into a .pb?
Regarding freezing, can a frozen graph be loaded in using tf.import_graph_def()?
The Android demo for TensorFlow loads in Google's Inception model from a .pb file. If I wanted to substitute my own .pb file, how would I go about doing that? Would I need to change any native code/methods?
In general, what exactly is the difference between all these methods? Or more broadly, what is the difference between as_graph_def()/.ckpt/.pb?

In short, what I'm looking for is a method to save both a graph (as in, the various operations and such) and its weights/variables into a file, which can then be used to load the graph and weights into another program, for use (not necessarily continuing/retraining).

Documentation about this topic isn't very straightforward, so any answers/information would be greatly appreciated.

488

asked Aug 14 '16 23:08

Technicolor

2 Answers

There are many ways to approach the problem of saving a model in TensorFlow, which can make it a bit confusing. Taking each of your sub-questions in turn:

The checkpoint files (produced e.g. by calling saver.save() on a tf.train.Saver object) contain only the weights, and any other variables defined in the same program. To use them in another program, you must re-create the associated graph structure (e.g. by running code to build it again, or calling tf.import_graph_def()), which tells TensorFlow what to do with those weights. Note that calling saver.save() also produces a file containing a MetaGraphDef, which contains a graph and details of how to associate the weights from a checkpoint with that graph. See the tutorial for more details.
tf.train.write_graph() only writes the graph structure; not the weights.
Bazel is unrelated to reading or writing TensorFlow graphs. (Perhaps I misunderstand your question: feel free to clarify it in a comment.)
A frozen graph can be loaded using tf.import_graph_def(). In this case, the weights are (typically) embedded in the graph, so you don't need to load a separate checkpoint.
The main change would be to update the names of the tensor(s) that are fed into the model, and the names of the tensor(s) that are fetched from the model. In the TensorFlow Android demo, this would correspond to the inputName and outputName strings that are passed to TensorFlowClassifier.initializeTensorFlow().
The GraphDef is the program structure, which typically does not change through the training process. The checkpoint is a snapshot of the state of a training process, which typically changes at every step of the training process. As a result, TensorFlow uses different storage formats for these types of data, and the low-level API provides different ways to save and load them. Higher-level libraries, such as the MetaGraphDef libraries, Keras, and skflow build on these mechanisms to provide more convenient ways to save and restore an entire model.

107

answered Sep 20 '22 14:09

mrry

You can try the following code:

with tf.gfile.FastGFile('model/frozen_inference_graph.pb', "rb") as f:     graph_def = tf.GraphDef()     graph_def.ParseFromString(f.read())     g_in = tf.import_graph_def(graph_def, name="") sess = tf.Session(graph=g_in)

answered Sep 18 '22 14:09

Srihari Humbarwadi

Related questions
                            
                                Python: tf-idf-cosine: to find document similarity
                            
                                Listing available com ports with Python
                            
                                Error: No module named psycopg2.extensions
                            
                                How do I select a random element from an array in Python? [duplicate]
                            
                                matplotlib (equal unit length): with 'equal' aspect ratio z-axis is not equal to x- and y-
                            
                                Get last "column" after .str.split() operation on column in pandas DataFrame
                            
                                Getting a default value on index out of range in Python [duplicate]
                            
                                scipy: savefig without frames, axes, only content
                            
                                How to pad a string with leading zeros in Python 3 [duplicate]
                            
                                Automatically initialize instance variables?
                            
                                How do I access the request object or any other variable in a form's clean() method?
                            
                                How to open a URL in python
                            
                                JSON ValueError: Expecting property name: line 1 column 2 (char 1)
                            
                                How to split an integer into an array of digits?
                            
                                How to block calls to print?
                            
                                Django: Get an object form the DB, or 'None' if nothing matches
                            
                                Most lightweight way to create a random string and a random hexadecimal number
                            
                                How to check whether a method exists in Python?
                            
                                Python script to do something at the same time every day [duplicate]
                            
                                pip installation /usr/local/opt/python/bin/python2.7: bad interpreter: No such file or directory

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With