Tensorflow checkpoint models getting deleted

Tags:

tensorflow

checkpoint

I am using tensorflow checkpointing after every 10 epochs using the following code :

checkpoint_dir = os.path.abspath(os.path.join(out_dir, "checkpoints"))
checkpoint_prefix = os.path.join(checkpoint_dir, "model")
...
if current_step % checkpoint_every == 0:
    path = saver.save(sess, checkpoint_prefix, global_step=current_step)
    print("Saved model checkpoint to {}\n".format(path))

The problem is that, as the new files are getting generated, previous 5 model files are getting deleted automatically.

671

asked Dec 07 '16 13:12

Nitin

1 Answers

This is the expected behavior, the docs for tf.train.Saver say that by default the 5 most recent checkpoint files are kept. To adjust that, set max_to_keep the the desired value.

answered Oct 12 '22 13:10

Gregory Begelman

Related questions
                            
                                tensorflow how to merge batchnorm into convolution for faster inference
                            
                                How LSTM work with word embeddings for text classification, example in Keras
                            
                                Keras seems to hang after call to fit_generator
                            
                                Dependencies missing in current linux-64 channels when trying to install tensorflow-gpu with conda command
                            
                                logits and labels must be broadcastable: logits_size=[32,1] labels_size=[16,1]
                            
                                How to use F-score as error function to train neural networks?
                            
                                TensorFlow Object Detection API: specifying multiple data_augmentation_options
                            
                                Why not use Flatten followed by a Dense layer instead of TimeDistributed?
                            
                                NVidia drivers stopped working on AWS EC2 instance with Ubuntu 16.04 and Tesla K80 GPU
                            
                                LSTM Keras input shape confusion
                            
                                tensorflow transition to gpu version
                            
                                Python kernel dies on Jupyter Notebook with tensorflow 2
                            
                                Save history of model.fit for different epochs
                            
                                ModuleNotFoundError: No module named 'tf'
                            
                                Jupyter Notebook : 'head' is not recognized as an internal or external command, operable program or batch file
                            
                                `return_sequences = False` equivalent in pytorch LSTM
                            
                                I have this error when trying to import tensorflow_hub: cannot import name 'parameter_server_strategy_v2' from 'tensorflow.python.distribute'
                            
                                What exactly is Keras's CategoricalCrossEntropy doing?
                            
                                Why do I get ValueError('\'image\' must be fully defined.') when transforming image in Tensorflow?
                            
                                Tensorflow, missing checkpoint files. Does saver only allow for keeping 5 check points?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With