Load checkpoint and finetuning using tf.estimator.Estimator

Tags:

tensorflow

We're trying to translate old training code based into a more tf.estimator.Estimator compliant code. In the initial code we fine tune an original model for a target dataset. Only some layers are loaded from the checkpoint before the training takes place using a combination of variables_to_restore and init_fn with the MonitoredTrainingSession. How can one achieve this kind of weight loading with the tf.estimator.Estimator approach ?

956

asked Sep 26 '17 10:09

jrabary

1 Answers

you have two options, first one is simpler:

1- use tf.train.init_from_checkpoint in your model_fn

2- model_fn returns an EstimatorSpec. You can set scaffold viaEstimatorSpec.

answered Jan 03 '23 01:01

user1454804

Related questions
                            
                                How to stack multiple layers of conv2d_transpose() of Tensorflow
                            
                                Tensorflow uses same amount of gpu memory regardless of batch size
                            
                                Named Entity Recognition with Syntaxnet
                            
                                How do I combine tf.absolute and tf.square to create the Huber loss function in Tensorflow?
                            
                                Multiplying along an arbitrary axis?
                            
                                How to Implement Center Loss and Other Running Averages of Labeled Embeddings
                            
                                L2 normalised output with keras
                            
                                TensorFlow: tf.placeholder and tf.Variable - why is the dimension not required?
                            
                                No response from celery worker with TensorFlow
                            
                                TensorArray TensorArray_1_0: Could not read from TensorArray index 0 because it has not yet been written to
                            
                                Importing tensorflow when embedding python in c++ returns null
                            
                                TensorFlow - How to predict with trained model on a different test dataset?
                            
                                FLAGS = None meaning?
                            
                                TensorFlow: Incompatible shapes: [100,155] vs. [128,155] when combining CNN and LSTM
                            
                                Tensorboard scalars and graphs duplicated
                            
                                Adverserial images in TensorFlow
                            
                                How to evaluate a pretrained model in Tensorflow object detection api
                            
                                Saving layer weights at each epoch during training into a numpy type/array? Converting TensorFlow Variable to numpy array?
                            
                                Is Bias necessarily need at Colvolution Layer?
                            
                                TypeError: Unrecognized keyword arguments: {'show_accuracy': True} #yelp challenge dataset

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With