load multiple models in Tensorflow

Tags:

tensorflow

I have written the following convolutional neural network (CNN) class in Tensorflow [I have tried to omit some lines of code for clarity.]

class CNN:
def __init__(self,
                num_filters=16,        # initial number of convolution filters
             num_layers=5,           # number of convolution layers
             num_input=2,           # number of channels in input
             num_output=5,          # number of channels in output
             learning_rate=1e-4,    # learning rate for the optimizer
             display_step = 5000,   # displays training results every display_step epochs
             num_epoch = 10000,     # number of epochs for training
             batch_size= 64,        # batch size for mini-batch processing
             restore_file=None,      # restore file (default: None)

            ):

                # define placeholders
                self.image = tf.placeholder(tf.float32, shape = (None, None, None,self.num_input))  
                self.groundtruth = tf.placeholder(tf.float32, shape = (None, None, None,self.num_output)) 

                # builds CNN and compute prediction
                self.pred = self._build()

                # I have already created a tensorflow session and saver objects
                self.sess = tf.Session()
                self.saver = tf.train.Saver()

                # also, I have defined the loss function and optimizer as
                self.loss = self._loss_function()
                self.optimizer = tf.train.AdamOptimizer(learning_rate).minimize(self.loss)

                if restore_file is not None:
                    print("model exists...loading from the model")
                    self.saver.restore(self.sess,restore_file)
                else:
                    print("model does not exist...initializing")
                    self.sess.run(tf.initialize_all_variables())

def _build(self):
    #builds CNN

def _loss_function(self):
    # computes loss


# 
def train(self, train_x, train_y, val_x, val_y):
    # uses mini batch to minimize the loss
    self.sess.run(self.optimizer, feed_dict = {self.image:sample, self.groundtruth:gt})


    # I save the session after n=10 epochs as:
    if epoch%n==0:
        self.saver.save(sess,'snapshot',global_step = epoch)

# finally my predict function is
def predict(self, X):
    return self.sess.run(self.pred, feed_dict={self.image:X})

I have trained two CNNs for two separate tasks independently. Each took around 1 day. Say, model1 and model2 are saved as 'snapshot-model1-10000' and 'snapshot-model2-10000' (with their corresponding meta files) respectively. I can test each model and compute its performance separately.

Now, I want to load these two models in a single script. I would naturally try to do as below:

cnn1 = CNN(..., restore_file='snapshot-model1-10000',..........) 
cnn2 = CNN(..., restore_file='snapshot-model2-10000',..........)

I encounter the error [The error message is long. I just copied/pasted a snippet of it.]

NotFoundError: Tensor name "Variable_26/Adam_1" not found in checkpoint files /home/amitkrkc/codes/A549_models/snapshot-hela-95000
     [[Node: save_1/restore_slice_85 = RestoreSlice[dt=DT_FLOAT, preferred_shard=-1, _device="/job:localhost/replica:0/task:0/cpu:0"](_recv_save_1/Const_0, save_1/restore_slice_85/tensor_name, save_1/restore_slice_85/shape_and_slice)]]

Is there a way to load from these two files two separate CNNs? Any suggestion/comment/feedback is welcome.

Thank you,

691

asked Feb 01 '17 21:02

Amit

1 Answers

Yes there is. Use separate graphs.

g1 = tf.Graph()
g2 = tf.Graph()

with g1.as_default():
    cnn1 = CNN(..., restore_file='snapshot-model1-10000',..........) 
with g2.as_default():
    cnn2 = CNN(..., restore_file='snapshot-model2-10000',..........)

EDIT:

If you want them into same graph. You'll have to rename some variables. One idea is have each CNN in separate scope and let saver handle variables in that scope e.g.:

saver = tf.train.Saver(tf.get_collection(tf.GraphKeys.GLOBAL_VARIABLES), scope='model1')

and in cnn wrap all your construction in scope:

with tf.variable_scope('model1'):
    ...

EDIT2:

Other idea is renaming variables which saver manages (since I assume you want to use your saved checkpoints without retraining everything. Saving allows different variable names in graph and in checkpoint, have a look at documentation for initialization.

122

answered Oct 15 '22 04:10

nmiculinic

Related questions
                            
                                Keras Tensorflow - Exception while predicting from multiple threads
                            
                                Average weights in keras models
                            
                                How to print the result of `tf.data.Dataset.from_tensor_slices`?
                            
                                How to make an if statement using a boolean Tensor
                            
                                Regularization for LSTM in tensorflow
                            
                                Where does next_batch in the TensorFlow tutorial batch_xs, batch_ys = mnist.train.next_batch(100) come from?
                            
                                Items of feature_columns must be a _FeatureColumn Given: _VocabularyListCategoricalColumn
                            
                                How can I implement a weighted cross entropy loss in tensorflow using sparse_softmax_cross_entropy_with_logits
                            
                                tensorflow dataset shuffle then batch or batch then shuffle
                            
                                How to use several summary collections in Tensorflow?
                            
                                How to add Tensorboard to a Tensorflow estimator process
                            
                                How to understand sess.as_default() and sess.graph.as_default()?
                            
                                ValueError: Shape mismatch: The shape of labels (received (15,)) should equal the shape of logits except for the last dimension (received (5, 3))
                            
                                How do I select certain columns of a 2D tensor in TensorFlow?
                            
                                Slicing a tensor by using indices in Tensorflow
                            
                                Not able to import tensorflow_datasets module in jupyter notebook
                            
                                How to iterate a dataset several times using TensorFlow's Dataset API?
                            
                                How to correct unstable loss and accuracy during training? (binary classification)
                            
                                Avoid tensorflow print on standard error
                            
                                How to locally view tensorboard of remote server

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With