Object detection Classfication /A checkpoint was restored (e.g. tf.train.Checkpoint.restore or tf.keras.Model.load_weights)

Tags:

I am try to classfication with object detection at the colab.I am using "ssd_resnet101_v1_fpn_640x640_coco17_tpu-8.config"When I start to training I get error. Training=

!python model_main_tf2.py \
    --pipeline_config_path=training/ssd_resnet101_v1_fpn_640x640_coco17_tpu-8.config \
    --model_dir=training \
    --alsologtostderr

WARNING:tensorflow:A checkpoint was restored (e.g. tf.train.Checkpoint.restore or tf.keras.Model.load_weights) but not all checkpointed values were used. See above for specific issues. Use expect_partial() on the load status object, e.g. tf.train.Checkpoint.restore(...).expect_partial(), to silence these warnings, or use assert_consumed() to make the check explicit. See https://www.tensorflow.org/guide/checkpoint#loading_mechanics for details.
W1130 13:39:27.991891 140559633127296 util.py:158] A checkpoint was restored (e.g. tf.train.Checkpoint.restore or tf.keras.Model.load_weights) but not all checkpointed values were used. See above for specific issues. Use expect_partial() on the load status object, e.g. tf.train.Checkpoint.restore(...).expect_partial(), to silence these warnings, or use assert_consumed() to make the check explicit. See https://www.tensorflow.org/guide/checkpoint#loading_mechanics for details.

935

asked Nov 30 '20 13:11

shine1189

2 Answers

I was dealing with the same error. I assume that the training stopped when you got the error you cited above. If so, you might want to check your folder paths.

I was able to get rid of the error myself when I figured out that I was trying to create a new model but TF was looking to a 'model_dir' folder that contained checkpoints from my previous model. Because my num_steps was not greater than the num_steps used in the previous model, TF effectively stopped running the training because the num_steps had already been completed.

By changing the model_dir to a brand new folder, I was able to overcome this error and begin training a new model. Hopefully this works for you as well.

155

answered Oct 23 '22 00:10

Brad G Grounds

If anyone is trying to continue their training, the solution as @GbG mentioned is to update your num_steps value in the pipeline.config:

Original:

  num_steps: 25000
  optimizer {
    momentum_optimizer: {
      learning_rate: {
        cosine_decay_learning_rate {
          learning_rate_base: .04
          total_steps: 25000

Updated:

  num_steps: 50000
  optimizer {
    momentum_optimizer: {
      learning_rate: {
        cosine_decay_learning_rate {
          learning_rate_base: .04
          total_steps: 50000

answered Oct 23 '22 00:10

TomSelleck

Related questions
                            
                                What does the TensowFlow GradientDescentOptimizer do in this example?
                            
                                Can we use tf.spectral fourier functions in keras?
                            
                                Where Tensorflow EagerTensor is defined?
                            
                                Nested while loop in tensorflow
                            
                                How to reuse tensorflow variables in eager execution mode?
                            
                                Why does tf.Print() not work?
                            
                                Tensorflow numpy image reshape [grayscale images]
                            
                                Python conversion of PIL image to numpy array very slow
                            
                                What does keras normalize axis argument does?
                            
                                Multiple inputs to Keras Sequential model
                            
                                Tensorflow: ValueError: The last dimension of the inputs to `Dense` should be defined. Found `None`
                            
                                Tensorflow predict the class of output
                            
                                ERROR: Could not find a version that satisfies the requirement tensorflow (from versions: none) ERROR: No matching distribution found for tensorflow)
                            
                                Tensorflow 2.0: How to change the output signature while using tf.saved_model
                            
                                @tf.function ValueError: Creating variables on a non-first call to a function decorated with tf.function, unable to understand behaviour
                            
                                How do you edit an existing Tensorboard Training Loss summary?
                            
                                tensorflow2.1 InvalidArgumentError: assertion failed: [0] [Op:Assert] name: EagerVariableNameReuse
                            
                                Failed to attach to any of the Graphcore IPU devices when running simple TensorFlow code example
                            
                                Install Tensorflow-GPU on WSL2
                            
                                InvalidArgumentError: Specified a list with shape [60,9] from a tensor with shape [56,9]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Object detection Classfication /A checkpoint was restored (e.g. tf.train.Checkpoint.restore or tf.keras.Model.load_weights)

Tags:

tensorflow

computer-vision

object-detection

shine1189

People also ask

2 Answers

Brad G Grounds

TomSelleck

Recent Activity

Donate For Us