Printing the loss during TensorFlow training

Tags:

tensorflow

I am looking at the TensorFlow "MNIST For ML Beginners" tutorial, and I want to print out the training loss after every training step.

My training loop currently looks like this:

for i in range(100):     batch_xs, batch_ys = mnist.train.next_batch(100)     sess.run(train_step, feed_dict={x: batch_xs, y_: batch_ys})

Now, train_step is defined as:

train_step = tf.train.GradientDescentOptimizer(0.01).minimize(cross_entropy)

Where cross_entropy is the loss which I want to print out:

cross_entropy = -tf.reduce_sum(y_ * tf.log(y))

One way to print this would be to explicitly compute cross_entropy in the training loop:

for i in range(100):     batch_xs, batch_ys = mnist.train.next_batch(100)     cross_entropy = -tf.reduce_sum(y_ * tf.log(y))     print 'loss = ' + str(cross_entropy)     sess.run(train_step, feed_dict={x: batch_xs, y_: batch_ys})

I now have two questions regarding this:

Given that cross_entropy is already computed during sess.run(train_step, ...), it seems inefficient to compute it twice, requiring twice the number of forward passes of all the training data. Is there a way to access the value of cross_entropy when it was computed during sess.run(train_step, ...)?
How do I even print a tf.Variable? Using str(cross_entropy) gives me an error...

Thank you!

663

asked Nov 20 '15 18:11

1 Answers

You can fetch the value of cross_entropy by adding it to the list of arguments to sess.run(...). For example, your for-loop could be rewritten as follows:

for i in range(100):     batch_xs, batch_ys = mnist.train.next_batch(100)     cross_entropy = -tf.reduce_sum(y_ * tf.log(y))     _, loss_val = sess.run([train_step, cross_entropy],                            feed_dict={x: batch_xs, y_: batch_ys})     print 'loss = ' + loss_val

The same approach can be used to print the current value of a variable. Let's say, in addition to the value of cross_entropy, you wanted to print the value of a tf.Variable called W, you could do the following:

for i in range(100):     batch_xs, batch_ys = mnist.train.next_batch(100)     cross_entropy = -tf.reduce_sum(y_ * tf.log(y))     _, loss_val, W_val = sess.run([train_step, cross_entropy, W],                                   feed_dict={x: batch_xs, y_: batch_ys})     print 'loss = %s' % loss_val     print 'W = %s' % W_val

152

answered Oct 02 '22 12:10

mrry

Related questions
                            
                                Python Headless MatplotLib / Pyplot [duplicate]
                            
                                List as a member of a python class, why is its contents being shared across all instances of the class?
                            
                                How to determine if Python script was run via command line?
                            
                                How to convert `ctime` to `datetime` in Python?
                            
                                Pandas: create named columns in DataFrame from dict
                            
                                Django test coverage vs code coverage
                            
                                Are functions objects in Python?
                            
                                tkinter: how to use after method
                            
                                Persisting data in Google Colaboratory
                            
                                Creating graph with date and time in axis labels with matplotlib
                            
                                Django admin hangs (until timeout error) for a specific model when trying to edit/create
                            
                                Setting LD_LIBRARY_PATH from inside Python
                            
                                Django ORM, group by day
                            
                                Rendering a dictionary in Jinja2
                            
                                Cassandra: File "cqlsh", line 95 except ImportError, e:
                            
                                How to get single value from dict with single entry?
                            
                                Specific reasons to favor pip vs. conda when installing Python packages
                            
                                How do I re-map python dict keys
                            
                                Can you make a python subprocess output stdout and stderr as usual, but also capture the output as a string? [duplicate]
                            
                                Using ^ to match beginning of line in Python regex

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Printing the loss during TensorFlow training

Tags:

python

tensorflow

Karnivaurus

People also ask

1 Answers

mrry

Recent Activity

Donate For Us