When to use the .ckpt vs .hdf5 vs. .pb file extensions in Tensorflow model saving?

Tags:

Tensorflow explains that models can be saved in three file formats: .ckpt or .hdf5 or .pb. There's a lot of documentation so it would be nice to get a simpler comparison of when to use which file format.

Here's my current understanding:

ckpt

From https://www.tensorflow.org/guide/checkpoint:

Checkpoints capture the exact value of all parameters (tf.Variable objects) used by a model. Checkpoints do not contain any description of the computation defined by the model and thus are typically only useful when source code that will use the saved parameter values is available.

So it seems like you should use cpkt for checkpointing during training when you know that your source code will be the same. Why is it recommended though over .pb and .hdf5? Does it save space? Does it include data that the other file formats do not?

Also from https://www.tensorflow.org/guide/checkpoint:

The SavedModel format on the other hand includes a serialized description of the computation defined by the model in addition to the parameter values (checkpoint). Models in this format are independent of the source code that created the model. They are thus suitable for deployment via TensorFlow Serving, TensorFlow Lite, TensorFlow.js, or programs in other programming languages (the C, C++, Java, Go, Rust, C# etc. TensorFlow APIs).

The SavedModel format is .pb plus some metadata. So you should save in .pb when you are deploying a model?

hdf5

Use when saving the model weights (matrix of numbers) only?

782

asked Jan 23 '20 21:01

skeller88

1 Answers

It seems you already know some of the differences, but just to add.

.ckpt
This is mainly used for resuming the training and also to allow users to customize savepoints and load to (ie. Highest Accuracy, Latest Trained Model, etc).
And also to create different models from different training checkpoints.
This only saves the weights of the variables or the graph therefore as you indicated you need to have full architectures and functions used.

.pb (Protobuffer)
This is the TensorFlow file format which saves everything about the Model including custom objects, this is the recommended file format to ensure maximum portability when using and exporting to different platforms (ie. Tensorflow Lite, Tensorflow Serving, etc.).

.h5 (HD5F)
This is the suggested saving format of Native Keras, which also saves everything about the model but when used in TensorFlow 2.1.0 (import tensorflow.keras) it will not save the custom objects automatically and will require additional steps to be performed.

You could read more about it in this link.

187

answered Sep 27 '22 19:09

TF_Support

Related questions
                            
                                Python Version Numbering Scheme
                            
                                Do something at the beginning & end of methods
                            
                                How do I use the "group_by_window" function in TensorFlow
                            
                                What do the lines in Seaborn.Regplot represent
                            
                                Comparing a large number of graphs for isomorphism
                            
                                Selenium "selenium.common.exceptions.NoSuchElementException" when using Chrome
                            
                                How to get `python` to run Python 3 in WSL bash?
                            
                                OpenCV - Calibrate fisheye lens error (Ill-conditioned matrix)
                            
                                Projection of a point to a line segment Python Shapely
                            
                                No module named '_bz2' in python3
                            
                                What's the difference between using tf.expand_dims and tf.newaxis in Tensorflow?
                            
                                Using result_type with pandas apply function
                            
                                pandas - how to get last n groups of a groupby object and combine them as a dataframe
                            
                                Lots of edges on a graph plot in python
                            
                                Cannot compare types 'ndarray(dtype=int64)' and 'str'
                            
                                Python multiprocessing crashes docker container
                            
                                How do I check if current code is part of a try-except-block?
                            
                                from_logits=True and from_logits=False get different training result for tf.losses.CategoricalCrossentropy for UNet
                            
                                Exclude type in Python typing annotation
                            
                                What is the difference between tf.keras and tf.python.keras?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

When to use the .ckpt vs .hdf5 vs. .pb file extensions in Tensorflow model saving?

Tags:

python

hdf5

tensorflow

tensorflow2.0

ckpt

skeller88

People also ask

1 Answers

TF_Support

Recent Activity

Donate For Us