I have trained a model with fine-tuning pre-trained model <code>ssd_mobilenet_v2_coco_2018</code>. Here, I have used the exact same pipeline.config file for training which is available inside <code>ssd_mobilenet_v2_coco_2018</code> pre-trained folder. I have only removed the <code>batch_norm_trainable: true</code> flag and changed the number of classes (4). After training the model with my custom datasets with 4 classes, I found <code>concat</code> and <code>concat_1</code> nodes get exchange with each other. Pre-trained model has <code>| concat | 1x1917x1x4 |</code> after-training it becomes <code>| concat | 1x1917x5 |</code> I have attached both tensorboard graph visualisation images. First image is pre-trained graph <code>ssd_mobilenet_v2_coco_2018</code>. <img src="https://i.stack.imgur.com/fOCZ3.png" alt="enter image description here"> <img src="https://i.stack.imgur.com/ACxuJ.png" alt="enter image description here"> The node exchanges can be seen on the rightmost corner of the image. As in the pre-trained graph, <code>Postprocess layer</code> connect with <code>concat_1</code> and <code>Squeeeze</code> connect with <code>concat</code>. But after the training, the graph shows completely reverse. Like <code>Prosprocess layer</code> connect with <code>concat</code> and <code>Squeeeze</code> connect with <code>concat_1</code>. Further, I also found in the pre-trained model graph that the <code>Preprocessor</code> takes input <code>ToFloat</code> while after training the graph shows Cast as an input to <code>Preprocessor</code>. I have fed the input to the model as <code>tfrecords</code>.

Most probably, the difference is not in the graph, but simply in the names of the nodes, i.e. nodes <code>concat</code> and <code>concat_1</code> on the left are the same nodes as resp. <code>concat_1</code> and <code>concat</code> on the right. The thing is, when you don't provide an explicit name to a node, tensorflow needs to come up with one, and it's naming convention is rather uninventive. The first time it needs to name a node, it does so with its type. When it encounter the situation again, it simply add <code>_</code> + an increasing number to the name. Take this example: <pre class="prettyprint lang-py prettyprint-override"><code>import tensorflow as tf x = tf.placeholder(tf.float32, (1,), name='x') y = tf.placeholder(tf.float32, (1,), name='y') z = tf.placeholder(tf.float32, (1,), name='z') xy = tf.concat([x, y], axis=0) # named 'concat' xz = tf.concat([x, z], axis=0) # named 'concat_1' </code></pre> The graph looks like this: <img src="https://i.stack.imgur.com/BNGcn.png" alt="enter image description here"> Now if we construct the same graph, but this time creating <code>xz</code> before <code>xy</code>, we get the following graph: <img src="https://i.stack.imgur.com/0KxH9.png" alt="enter image description here"> So the graph did not really change -- only the names did. This is probably what happened in your case: the same operations were created but not in the same order. The fact that names changed for stateless nodes like <code>concat</code> is unimportant, because no weights will be misrouted when loading a saved model for example. Nonetheless, if naming stability is important for you, you could either give explicit names to your operations or place them in distinct scopes: <pre class="prettyprint"><code>xy = tf.concat([x, y], axis=0, name='xy') xz = tf.concat([x, z], axis=0, name='xz') </code></pre> <img src="https://i.stack.imgur.com/kH1Kj.png" alt="enter image description here"> It is much more problematic if variables switch name. This is one of the reason why <code>tf.get_variable</code> -- which forces variables to have a name and raises an error when a name conflict occurs -- was the preferred way of dealing with variables in the pre-TF2 era.

Tensorflow graph nodes are exchange

Tags:

python

python-3.x

tensorflow

tensorboard

object-detection-api

I have trained a model with fine-tuning pre-trained model ssd_mobilenet_v2_coco_2018. Here, I have used the exact same pipeline.config file for training which is available inside ssd_mobilenet_v2_coco_2018 pre-trained folder. I have only removed the batch_norm_trainable: true flag and changed the number of classes (4). After training the model with my custom datasets with 4 classes, I found concat and concat_1 nodes get exchange with each other. Pre-trained model has | concat | 1x1917x1x4 | after-training it becomes | concat | 1x1917x5 | I have attached both tensorboard graph visualisation images. First image is pre-trained graph ssd_mobilenet_v2_coco_2018. enter image description here

The node exchanges can be seen on the rightmost corner of the image. As in the pre-trained graph, Postprocess layer connect with concat_1 and Squeeeze connect with concat. But after the training, the graph shows completely reverse. Like Prosprocess layer connect with concat and Squeeeze connect with concat_1. Further, I also found in the pre-trained model graph that the Preprocessor takes input ToFloat while after training the graph shows Cast as an input to Preprocessor. I have fed the input to the model as tfrecords.

296

asked Apr 05 '20 14:04

Sanjay

1 Answers

Most probably, the difference is not in the graph, but simply in the names of the nodes, i.e. nodes concat and concat_1 on the left are the same nodes as resp. concat_1 and concat on the right.

The thing is, when you don't provide an explicit name to a node, tensorflow needs to come up with one, and it's naming convention is rather uninventive. The first time it needs to name a node, it does so with its type. When it encounter the situation again, it simply add _ + an increasing number to the name.

Take this example:

import tensorflow as tf

x = tf.placeholder(tf.float32, (1,), name='x')
y = tf.placeholder(tf.float32, (1,), name='y')
z = tf.placeholder(tf.float32, (1,), name='z')

xy = tf.concat([x, y], axis=0)  # named 'concat'
xz = tf.concat([x, z], axis=0)  # named 'concat_1'

The graph looks like this:

enter image description here

Now if we construct the same graph, but this time creating xz before xy, we get the following graph:

enter image description here

So the graph did not really change -- only the names did. This is probably what happened in your case: the same operations were created but not in the same order.

The fact that names changed for stateless nodes like concat is unimportant, because no weights will be misrouted when loading a saved model for example. Nonetheless, if naming stability is important for you, you could either give explicit names to your operations or place them in distinct scopes:

xy = tf.concat([x, y], axis=0, name='xy')
xz = tf.concat([x, z], axis=0, name='xz')

enter image description here

It is much more problematic if variables switch name. This is one of the reason why tf.get_variable -- which forces variables to have a name and raises an error when a name conflict occurs -- was the preferred way of dealing with variables in the pre-TF2 era.

174

answered Oct 20 '22 22:10

P-Gn

Related questions
                            
                                ValueError: Axes instance argument was not found in a figure, Question with same name has no answer
                            
                                Embed widgets with pythreejs: wrong perspective and camera look-at
                            
                                Running pre-commit hooks (e.g. pylint) when developing with docker
                            
                                Merging two DataFrames based on indexes from two other DataFrames
                            
                                Fastai - failed initiation of language model in Sentence Piece Processor, cache_dir parameter
                            
                                How can I get a tqdm progress_apply bar in vscode + python jupyter extension?
                            
                                How to override the default browser selection in Windows 7 when opening webppages with Python
                            
                                Object of type Response is not JSON serializable
                            
                                Correct POS tags for numbers substituted with ## in spacy
                            
                                Is zip_safe only relevant for the egg format?
                            
                                How to use the s3 hook in airflow
                            
                                How to use timedelta with pandas df.query()?
                            
                                Liquibase integration in python project
                            
                                Python Plotly: How to add an image to a 3D scatter plot
                            
                                Docker hyperkit process CPU usage going crazy. How to keep it under control?
                            
                                Dash: how to control graph style via CSS?
                            
                                Python: How to offer a single executable file without showing the code in 2020
                            
                                Cannot open jupyter notebook in VSCode
                            
                                How to find which library prevents updating a package in conda?
                            
                                Problem with KerasRegressor & multiple output

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Tensorflow graph nodes are exchange

Tags:

python

python-3.x

tensorflow

tensorboard

object-detection-api

Sanjay

People also ask

1 Answers

P-Gn

Recent Activity

Donate For Us