I'm using <code>tf.estimator</code> in TensorFlow 1.4 and <code>tf.estimator.train_and_evaluate</code> is great but I need early stopping. What's the prefered way of adding that? I assume there is some <code>tf.train.SessionRunHook</code> somewhere for this. I saw that there was an old contrib package with a <code>ValidationMonitor</code> that seemed to have early stopping, but it doesn't seem to be around anymore in 1.4. Or will the preferred way in the future be to rely on <code>tf.keras</code> (with which early stopping is really easy) instead of <code>tf.estimator/tf.layers/tf.data</code>, perhaps?

First, you must name the loss to make it available to the early stopping call. If your loss variable is named "loss" in the estimator, the line <pre class="prettyprint"><code>copyloss = tf.identity(loss, name="loss") </code></pre> right beneath it will work. Then, create a hook with this code. <pre class="prettyprint"><code>class EarlyStopping(tf.train.SessionRunHook): def __init__(self,smoothing=.997,tolerance=.03): self.lowestloss=float("inf") self.currentsmoothedloss=-1 self.tolerance=tolerance self.smoothing=smoothing def before_run(self, run_context): graph = ops.get_default_graph() #print(graph) self.lossop=graph.get_operation_by_name("loss") #print(self.lossop) #print(self.lossop.outputs) self.element = self.lossop.outputs[0] #print(self.element) return tf.train.SessionRunArgs([self.element]) def after_run(self, run_context, run_values): loss=run_values.results[0] #print("loss "+str(loss)) #print("running average "+str(self.currentsmoothedloss)) #print("") if(self.currentsmoothedloss<0): self.currentsmoothedloss=loss*1.5 self.currentsmoothedloss=self.currentsmoothedloss*self.smoothing+loss*(1-self.smoothing) if(self.currentsmoothedloss<self.lowestloss): self.lowestloss=self.currentsmoothedloss if(self.currentsmoothedloss>self.lowestloss+self.tolerance): run_context.request_stop() print("REQUESTED_STOP") raise ValueError('Model Stopping because loss is increasing from EarlyStopping hook') </code></pre> this compares an exponentially smoothed loss validation with its lowest value, and if it is higher by tolerance, it stops training. If it stops too early, raising tolerance and smoothing will make it stop later. Keep smoothing below one, or it will never stop. You can replace the logic in after_run with something else if you want to stop based on a different condition. Now, add this hook to the evaluation spec. Your code should look something like this: <pre class="prettyprint"><code>eval_spec=tf.estimator.EvalSpec(input_fn=lambda:eval_input_fn(batchsize),steps=100,hooks=[EarlyStopping()])# </code></pre> Important note: The function, run_context.request_stop() is broken in the train_and_evaluate call, and doesn't stop training. So, I raised a value error to stop training. So you have to wrap the train_and_evaluate call in a try catch block like this: <pre class="prettyprint"><code>try: tf.estimator.train_and_evaluate(classifier,train_spec,eval_spec) except ValueError as e: print("training stopped") </code></pre> if you don't do this, the code will crash with an error when training stops.

Yes, there is <code>tf.train.StopAtStepHook</code>: <blockquote> This hook requests stop after either a number of steps have been executed or a last step has been reached. Only one of the two options can be specified. </blockquote> You can also extend it and implement your own stopping strategy based on the step results. <pre class="prettyprint"><code>class MyHook(session_run_hook.SessionRunHook): ... def after_run(self, run_context, run_values): if condition: run_context.request_stop() </code></pre>

Early stopping with tf.estimator, how?

Tags:

python

neural-network

tensorflow

keras

tensorflow-estimator

I'm using tf.estimator in TensorFlow 1.4 and tf.estimator.train_and_evaluate is great but I need early stopping. What's the prefered way of adding that?

I assume there is some tf.train.SessionRunHook somewhere for this. I saw that there was an old contrib package with a ValidationMonitor that seemed to have early stopping, but it doesn't seem to be around anymore in 1.4. Or will the preferred way in the future be to rely on tf.keras (with which early stopping is really easy) instead of tf.estimator/tf.layers/tf.data, perhaps?

631

asked Nov 06 '17 12:11

Carl Thomé

3 Answers

Good news! tf.estimator now has early stopping support on master and it looks like it will be in 1.10.

estimator = tf.estimator.Estimator(model_fn, model_dir)  os.makedirs(estimator.eval_dir())  # TODO This should not be expected IMO.  early_stopping = tf.contrib.estimator.stop_if_no_decrease_hook(     estimator,     metric_name='loss',     max_steps_without_decrease=1000,     min_steps=100)  tf.estimator.train_and_evaluate(     estimator,     train_spec=tf.estimator.TrainSpec(train_input_fn, hooks=[early_stopping]),     eval_spec=tf.estimator.EvalSpec(eval_input_fn))

147

answered Oct 13 '22 03:10

Carl Thomé

First, you must name the loss to make it available to the early stopping call. If your loss variable is named "loss" in the estimator, the line

copyloss = tf.identity(loss, name="loss")

right beneath it will work.

Then, create a hook with this code.

class EarlyStopping(tf.train.SessionRunHook):     def __init__(self,smoothing=.997,tolerance=.03):         self.lowestloss=float("inf")         self.currentsmoothedloss=-1         self.tolerance=tolerance         self.smoothing=smoothing     def before_run(self, run_context):         graph = ops.get_default_graph()         #print(graph)         self.lossop=graph.get_operation_by_name("loss")         #print(self.lossop)         #print(self.lossop.outputs)         self.element = self.lossop.outputs[0]         #print(self.element)         return tf.train.SessionRunArgs([self.element])     def after_run(self, run_context, run_values):         loss=run_values.results[0]         #print("loss "+str(loss))         #print("running average "+str(self.currentsmoothedloss))         #print("")         if(self.currentsmoothedloss<0):             self.currentsmoothedloss=loss*1.5         self.currentsmoothedloss=self.currentsmoothedloss*self.smoothing+loss*(1-self.smoothing)         if(self.currentsmoothedloss<self.lowestloss):             self.lowestloss=self.currentsmoothedloss         if(self.currentsmoothedloss>self.lowestloss+self.tolerance):             run_context.request_stop()             print("REQUESTED_STOP")             raise ValueError('Model Stopping because loss is increasing from EarlyStopping hook')

this compares an exponentially smoothed loss validation with its lowest value, and if it is higher by tolerance, it stops training. If it stops too early, raising tolerance and smoothing will make it stop later. Keep smoothing below one, or it will never stop.

You can replace the logic in after_run with something else if you want to stop based on a different condition.

Now, add this hook to the evaluation spec. Your code should look something like this:

eval_spec=tf.estimator.EvalSpec(input_fn=lambda:eval_input_fn(batchsize),steps=100,hooks=[EarlyStopping()])#

Important note: The function, run_context.request_stop() is broken in the train_and_evaluate call, and doesn't stop training. So, I raised a value error to stop training. So you have to wrap the train_and_evaluate call in a try catch block like this:

try:     tf.estimator.train_and_evaluate(classifier,train_spec,eval_spec) except ValueError as e:     print("training stopped")

if you don't do this, the code will crash with an error when training stops.

answered Oct 13 '22 02:10

user3806120

Yes, there is tf.train.StopAtStepHook:

This hook requests stop after either a number of steps have been executed or a last step has been reached. Only one of the two options can be specified.

You can also extend it and implement your own stopping strategy based on the step results.

class MyHook(session_run_hook.SessionRunHook):
  ...
  def after_run(self, run_context, run_values):
    if condition:
      run_context.request_stop()

answered Oct 13 '22 04:10

Maxim

Related questions
                            
                                superimpose matplotlib quiver on image
                            
                                SyntaxError: keyword argument repeated
                            
                                imshow colormap figure and the suptitle don't align in the center
                            
                                gitlab-ci.yml python -c 'multiple line cmd' failed
                            
                                Pandas series mean and standard deviation
                            
                                How to loop over nextPageToken using GoogleDrive's Python Quickstart
                            
                                OpenCV canny edge detection is not working properly on ideal square
                            
                                How can I click a pushButton on my PyQt5 code and allow it to execute/run another .py file?
                            
                                How can i skip files that does not exist file in the repository using python?
                            
                                Python selenium print frame source
                            
                                How to flip a byte in python?
                            
                                Set order of columns in DynamoDB table of AWS
                            
                                Center x-axis labels in line plot
                            
                                Enabling SSL on Flask + Google App Engine
                            
                                In matplotlib 2.0, how do I revert colorbar behaviour to that of matplotlib 1.5?
                            
                                Understanding lstm input shape in keras with different sequence
                            
                                Fitting a Lognormal Distribution in Python using CURVE_FIT
                            
                                Python , variable store in memory
                            
                                Trying load a pandas dataframe into Flask session and use that throughout the session
                            
                                MkDocs and MathJax

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Early stopping with tf.estimator, how?

Tags:

python

neural-network

tensorflow

keras

tensorflow-estimator

Carl Thomé

People also ask

3 Answers

Carl Thomé

user3806120

Maxim

Recent Activity

Donate For Us