I'm trying to build an object detector with CNN using tensorflow with python framework. I would like to train my model to do just object recognition (classification) at first and then using several convolutional layers of the pretarined model train it to predict bounding boxes. I will need to replace fully connected layers and probably some last convolutional layers. So, for this reason, I would like to know if it is possible to import only weights from tensorflow graph that was used to train object classifier to a newly defined graph that I will train to do object detection. So basically I would like to do something like this: <pre class="prettyprint"><code># here I initialize the new graph conv_1=tf.nn.conv2d(in, weights_from_old_graph) conv_2=tf.nn.conv2d(conv_1, weights_from_old_graph) ... conv_n=tf.nn.nnconv2d(conv_n-1,randomly_initialized_weights) fc_1=tf.matmul(conv_n, randomly_initalized_weights) </code></pre>

Although I agree with Aechlys to restore variables. The problem is harder when we want to fix these variables. For example, we trained these variables and we want to use them in another model, but this time without training them (training new variables like in transfer-learning). You can see the answer I posted here. Quick example: <pre class="prettyprint"><code> with tf.session() as sess: new_saver = tf.train.import_meta_graph(pathToMeta) new_saver.restore(sess, pathToNonMeta) weight1 = sess.run(sess.graph.get_tensor_by_name("w1:0")) tf.reset_default_graph() #this will eliminate the variables we restored with tf.session() as sess: weights = { '1': tf.Variable(weight1 , name='w1-bis', trainable=False) } ... </code></pre> We are now sure the restored variables are not a part of the graph.

Tensorflow: how to use pretrained weights in new graph?

Tags:

python

tensorflow

I'm trying to build an object detector with CNN using tensorflow with python framework. I would like to train my model to do just object recognition (classification) at first and then using several convolutional layers of the pretarined model train it to predict bounding boxes. I will need to replace fully connected layers and probably some last convolutional layers. So, for this reason, I would like to know if it is possible to import only weights from tensorflow graph that was used to train object classifier to a newly defined graph that I will train to do object detection. So basically I would like to do something like this:

# here I initialize the new graph
conv_1=tf.nn.conv2d(in, weights_from_old_graph)
conv_2=tf.nn.conv2d(conv_1, weights_from_old_graph)
...
conv_n=tf.nn.nnconv2d(conv_n-1,randomly_initialized_weights)
fc_1=tf.matmul(conv_n, randomly_initalized_weights)

655

asked May 07 '18 13:05

Andrew

2 Answers

Use saver with no arguments to save the entire model.

tf.reset_default_graph()
v1 = tf.get_variable("v1", [3], initializer = tf.initializers.random_normal)
v2 = tf.get_variable("v2", [5], initializer = tf.initializers.random_normal)
saver = tf.train.Saver()

with tf.Session() as sess:
    sess.run(tf.global_variables_initializer())
    saver.save(sess, save_path='./test-case.ckpt')

    print(v1.eval())
    print(v2.eval())
saver = None

v1 = [ 2.1882825   1.159807   -0.26564872]
v2 = [0.11437789 0.5742971 ]

Then in the model you want to restore to certain values, pass a list of variable names you want to restore or a dictionary of {"variable name": variable} to the Saver.

tf.reset_default_graph()
b1 = tf.get_variable("b1", [3], initializer= tf.initializers.random_normal)
b2 = tf.get_variable("b2", [3], initializer= tf.initializers.random_normal)
saver = tf.train.Saver(var_list={'v1': b1})

with tf.Session() as sess:
  saver.restore(sess, "./test-case.ckpt")
  print(b1.eval())
  print(b2.eval())

INFO:tensorflow:Restoring parameters from ./test-case.ckpt
b1 = [ 2.1882825   1.159807   -0.26564872]
b2 = FailedPreconditionError: Attempting to use uninitialized value b2

answered Oct 21 '22 18:10

Aechlys

Although I agree with Aechlys to restore variables. The problem is harder when we want to fix these variables. For example, we trained these variables and we want to use them in another model, but this time without training them (training new variables like in transfer-learning). You can see the answer I posted here.

Quick example:

 with tf.session() as sess:
    new_saver = tf.train.import_meta_graph(pathToMeta)
    new_saver.restore(sess, pathToNonMeta) 

    weight1 = sess.run(sess.graph.get_tensor_by_name("w1:0")) 


 tf.reset_default_graph() #this will eliminate the variables we restored


 with tf.session() as sess:
    weights = 
       {
       '1': tf.Variable(weight1 , name='w1-bis', trainable=False)
       }
...

We are now sure the restored variables are not a part of the graph.

answered Oct 21 '22 17:10

L.Ech

Related questions
                            
                                How to apply a Pandas lookup table to a numpy array?
                            
                                Python multiprocessing returning AttributeError when following documentation code [duplicate]
                            
                                docx-python word doc page break
                            
                                How to constantly run Google Colaboratory at a specific time every day?
                            
                                Assigning (yield) to a variable
                            
                                Activating conda environment from c# code (or what is the differences between manually opening cmd and opening it from c#?)
                            
                                How to generate multiple airflow dags through a single script?
                            
                                How to share numpy random state of a parent process with child processes?
                            
                                Understanding Self Internally in Python
                            
                                Extracting items out of an element.ResultSet
                            
                                How to parallelize python api calls?
                            
                                Replace negative values in single DataFrame column
                            
                                Find the maximum values of a column in multiindex dataframe and return all its values
                            
                                Getting Flask JSON response as an HTML Table?
                            
                                Python Numpy vectorize nested for-loops for combinatorics
                            
                                Python error: FileNotFoundError: [Errno 2] No such file or directory
                            
                                Creating an RGB picture in Python with OpenCV from a randomized array
                            
                                Tweepy check if a tweet is a retweet
                            
                                Python pysftp get_r from Linux works fine on Linux but not on Windows
                            
                                Python - Matplotlib / matplotlib.cbook.TimeoutError: LOCKERROR

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With