Logging training and validation loss in tensorboard

Tags:

I'm trying to learn how to use tensorflow and tensorboard. I have a test project based on the MNIST neural net tutorial.

In my code, I construct a node that calculates the fraction of digits in a data set that are correctly classified, like this:

correct = tf.nn.in_top_k(self._logits, labels, 1) correct = tf.to_float(correct) accuracy = tf.reduce_mean(correct)

Here, self._logitsis the inference part of the graph, and labels is a placeholder that contains the correct labels.

Now, what I would like to do is evaluate the accuracy for both the training set and the validation set as training proceeds. I can do this by running the accuracy node twice, with different feed_dicts:

train_acc = tf.run(accuracy, feed_dict={images : training_set.images, labels : training_set.labels}) valid_acc = tf.run(accuracy, feed_dict={images : validation_set.images, labels : validation_set.labels})

This works as intended. I can print the values, and I can see that initially, the two accuracies will both increase, and eventually the validation accuracy will flatten out while the training accuracy keeps increasing.

However, I would also like to get graphs of these values in tensorboard, and I can not figure out how to do this. If I simply add a scalar_summary to accuracy, the logged values will not distinguish between training set and validation set.

I also tried creating two identical accuracy nodes with different names and running one on the training set and one on the validation set. I then add a scalar_summary to each of these nodes. This does give me two graphs in tensorboard, but instead of one graph showing the training set accuracy and one showing the validation set accuracy, they are both showing identical values that do not match either of the ones printed to the terminal.

I am probably misunderstanding how to solve this problem. What is the recommended way of separately logging the output from a single node for different inputs?

242

asked Dec 26 '15 13:12

user3468216

2 Answers

There are several different ways you could achieve this, but you're on the right track with creating different tf.summary.scalar() nodes. Since you must explicitly call SummaryWriter.add_summary() each time you want to log a quantity to the event file, the simplest approach is probably to fetch the appropriate summary node each time you want to get the training or validation accuracy:

accuracy = tf.reduce_mean(correct)  training_summary = tf.summary.scalar("training_accuracy", accuracy) validation_summary = tf.summary.scalar("validation_accuracy", accuracy)   summary_writer = tf.summary.FileWriter(...)  for step in xrange(NUM_STEPS):    # Perform a training step....    if step % LOG_PERIOD == 0:      # To log training accuracy.     train_acc, train_summ = sess.run(         [accuracy, training_summary],          feed_dict={images : training_set.images, labels : training_set.labels})     writer.add_summary(train_summ, step)       # To log validation accuracy.     valid_acc, valid_summ = sess.run(         [accuracy, validation_summary],         feed_dict={images : validation_set.images, labels : validation_set.labels})     writer.add_summary(valid_summ, step)

Alternatively, you could create a single summary op whose tag is a tf.placeholder(tf.string, []) and feed the string "training_accuracy" or "validation_accuracy" as appropriate.

answered Oct 01 '22 09:10

mrry

Another way to do it, is to use a second file writer. So you are able to use the merge_summaries command.

train_writer = tf.summary.FileWriter(FLAGS.summaries_dir + '/train',                                       sess.graph) test_writer = tf.summary.FileWriter(FLAGS.summaries_dir + '/test') tf.global_variables_initializer().run()

Here is the complete documentation. This works for me fine : TensorBoard: Visualizing Learning

answered Oct 01 '22 08:10

stillPatrick

Related questions
                            
                                Getting started with secure AWS CloudFront streaming with Python
                            
                                Configuring Python to use additional locations for site-packages
                            
                                Pythonic Style for Multiline List Comprehension [duplicate]
                            
                                How to remove outline of circle marker when using pyplot.plot in matplotlib
                            
                                Use of PunktSentenceTokenizer in NLTK
                            
                                Find and draw the largest contour in opencv on a specific color (Python)
                            
                                aws lambda: Error: Runtime exited with error: signal: killed
                            
                                How to create a draggable legend in matplotlib?
                            
                                How to get the common name for a pytz timezone eg. EST/EDT for America/New_York
                            
                                theano - print value of TensorVariable
                            
                                Nice IDE with GUI designer for wxPython or Tkinter [closed]
                            
                                Parse annotations from a pdf
                            
                                how to generate a graph/diagram like Google Analytics's Visitor Flow?
                            
                                Does Python have a function to reduce fractions?
                            
                                Docstrings vs Comments
                            
                                How to properly add hours to a pandas.tseries.index.DatetimeIndex?
                            
                                How to use bisect.insort_left with a key?
                            
                                How to return a subset of a list that matches a condition [duplicate]
                            
                                Why does “np.inf // 2” result in NaN and not infinity?
                            
                                Global dictionaries don't need keyword global to modify them? [duplicate]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Logging training and validation loss in tensorboard

Tags:

python

tensorflow

tensorboard

user3468216

People also ask

2 Answers

mrry

stillPatrick

Recent Activity

Donate For Us