Learning rate of custom training loop for tensorflow 2.0

Tags:

tensorflow

Are there any functions or methods which can show the learning rate when I use the tensorflow 2.0 custom training loop?

Here is an example of tensorflow guide:

def train_step(images, labels):
  with tf.GradientTape() as tape:
    predictions = model(images)
    loss = loss_object(labels, predictions)
  gradients = tape.gradient(loss, model.trainable_variables)
  optimizer.apply_gradients(zip(gradients, model.trainable_variables))

  train_loss(loss)
  train_accuracy(labels, predictions)

How can I retrieve the current learning rate from the optimizer when the model is training?

I will be grateful for any help you can provide. :)

868

asked Sep 28 '19 19:09

Video Answer

2 Answers

In Tensorflow 2.1, the Optimizer class has an undocumented method _decayed_lr (see definition here), which you can invoke in the training loop by supplying the variable type to cast to:

current_learning_rate = optimizer._decayed_lr(tf.float32)

Here's a more complete example with TensorBoard too.

train_step_count = 0
summary_writer = tf.summary.create_file_writer('logs/')
def train_step(images, labels):
  train_step_count += 1
  with tf.GradientTape() as tape:
    predictions = model(images)
    loss = loss_object(labels, predictions)
  gradients = tape.gradient(loss, model.trainable_variables)
  optimizer.apply_gradients(zip(gradients, model.trainable_variables))

  # optimizer._decayed_lr(tf.float32) is the current Learning Rate.
  # You can save it to TensorBoard like so:
  with summary_writer.as_default():
    tf.summary.scalar('learning_rate',
                      optimizer._decayed_lr(tf.float32),
                      step=train_step_count)

answered Sep 27 '22 22:09

P Shved

In custom training loop setting, you can print(optimizer.lr.numpy()) to get the learning rate.

If you are using keras api, you can define your own callback that records the current learning rate.

from tensorflow.keras.callbacks import Callback

class LRRecorder(Callback):
    """Record current learning rate. """
    def on_epoch_begin(self, epoch, logs=None):
        lr = self.model.optimizer.lr
        print("The current learning rate is {}".format(lr.numpy()))

# your other callbacks 
callbacks.append(LRRecorder())

Update

w := w - (base_lr*m/sqrt(v))*grad = w - act_lr*grad The learning rate we get above is the base_lr. However, act_lr is adaptive changed during training. Take Adam optimizer as an example, act_lr is determined by base_lr, m and v. m and v are the first and second momentums of parameters. Different parameters have different m and v values. So if you would like to know the act_lr, you need to know the variable's name. For example, you want to know the act_lr of the variable Adam/dense/kernel, you can access the m and v like this,

for var in optimizer.variables():
  if 'Adam/dense/kernel/m' in var.name:
    print(var.name, var.numpy())

  if 'Adam/dense/kernel/v' in var.name:
    print(var.name, var.numpy())

Then you can easily calculate the act_lr using above formula.

answered Sep 27 '22 22:09

zihaozhihao

Related questions
                            
                                Embedding multiple Python sub-interpreters into a C program
                            
                                Efficient way to add elements to a tuple
                            
                                How can I make a virtual environment work with pyenv?
                            
                                Python: How to obtain a random subset
                            
                                Apache Airflow: Delay a task for some period of time
                            
                                Why isn't this function type-annotated correctly (error: Missing type parameters for generic type)?
                            
                                Address Sanitizer on a python extension
                            
                                matplotlib remove the ticks (axis) from the colorbar
                            
                                How to raise an exception for a tensorflow out of memory error?
                            
                                Compare packages between two anaconda installations
                            
                                How to convert a list with key value pairs to dictionary
                            
                                so exactly how many digits can float8, float16, float32, float64, and float128 contain?
                            
                                How to use flatbuffers in python the right way?
                            
                                KeyError: ''val_loss" when training model
                            
                                Pyinstaller win32ctypes.pywin32.pywintypes.error: (1920, 'LoadLibraryExW', 'System cannot access the file')
                            
                                Parsing fields in django-import-export before importing
                            
                                Tox fails because setup.py can't find the requirements.txt
                            
                                How do I get my accounts' positions at Interactive Brokers using Python API?
                            
                                How do i automatically update a dropdown selection widget when another selection widget is changed? (Python panel pyviz)
                            
                                How to use pipreqs to create requirements.txt file

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With