Tensorflow: why is zip() function used in the steps involving applying the gradients?

Tags:

I am working through Assignment 6 of the Udacity Deep Learning course. I am unsure why the zip() function is used in these steps to apply the gradients.

Here is the relevant code:

# define the loss function
logits = tf.nn.xw_plus_b(tf.concat(0, outputs), w, b) 
loss = tf.reduce_mean(tf.nn.softmax_cross_entropy_with_logits(logits, tf.concat(0, train_labels)))

# Optimizer.

global_step = tf.Variable(0)
#staircase=True means that the learning_rate updates at discrete time steps
learning_rate = tf.train.exponential_decay(10.0, global_step, 5000, 0.1, staircase=True)
optimizer = tf.train.GradientDescentOptimizer(learning_rate)

gradients, v = zip(*optimizer.compute_gradients(loss))
gradients, _ = tf.clip_by_global_norm(gradients, 1.25)
optimizer = optimizer.apply_gradients(zip(gradients, v), global_step=global_step)

What is the purpose of applying the zip() function?

Why is gradients and v stored that way? I thought zip(*iterable) returned just one zip object.

276

asked Jul 25 '16 13:07

Taivanbat Badamdorj

1 Answers

I don't know Tensorflow, but presumably optimizer.compute_gradients(loss) yields (gradient, value) tuples.

gradients, v = zip(*optimizer.compute_gradients(loss))

performs a transposition, creating a list of gradients and a list of values.

gradients, _ = tf.clip_by_global_norm(gradients, 1.25)

then clips the gradients, and

optimizer = optimizer.apply_gradients(zip(gradients, v), global_step=global_step)

re-zips the gradient and value lists back into an iterable of (gradient, value) tuples which is then passed to the optimizer.apply_gradients method.

answered Oct 12 '22 23:10

PM 2Ring

Related questions
                            
                                Understanding the output of Doc2Vec from Gensim package
                            
                                Cannot connect to neo4j database on Docker container
                            
                                How to convert a sha256 object to integer and pack it to bytearray in python?
                            
                                Python CMA-ES Algorithm to solve user-defined function and constraints
                            
                                What's distutils' equivalent of setuptools' `find_packages`? (python)
                            
                                How to unittest Python Lock is acquired with 'with' statement?
                            
                                value based thread lock
                            
                                What's the most efficient way to select a non-rectangular ROI of an Image in OpenCV?
                            
                                Unsupported TIFF Compression
                            
                                Is it actually possible to pass data (callback) from mpld3 to ipython?
                            
                                How to compute optical flow using tvl1 opencv function
                            
                                How to use monkeypatch in a "setup" method for unit tests using pytest?
                            
                                Parse BeautifulSoup element into Selenium
                            
                                Reading large file in Spark issue - python
                            
                                catch exception and return empty dataframe
                            
                                Dividing Pandas Dataframe by Week
                            
                                How to drop rows in an H2OFrame?
                            
                                Handle invalid arguments with argparse in Python
                            
                                multiprocessing module and distinct psycopg2 connections
                            
                                Angular-cli with any other server

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Tensorflow: why is zip() function used in the steps involving applying the gradients?

Tags:

python

machine-learning

tensorflow

deep-learning

Taivanbat Badamdorj

People also ask

1 Answers

PM 2Ring

Recent Activity

Donate For Us