it seems that you can just declare a cost function by tf.abs() and then pass it down to auto-gradient generation (see https://github.com/nfmcclure/tensorflow_cookbook/blob/master/03_Linear_Regression/04_Loss_Functions_in_Linear_Regressions/04_lin_reg_l1_vs_l2.py) . but we know abs() is not differentiable. how is this done in Tensorflow? does it just randomly throw a number in [-1,1] ? if someone could please point me to the implementation that would be great. Thanks! (I looked for tensorflow.py in the git, but it does not even exist)

<code>f(x) = abs(x)</code> is differentiable everywhere, except at <code>x=0</code>. It derivative equals: <img src="https://i.stack.imgur.com/AMl8U.gif" alt="abs derivative"> So the only question is how tensorflow implements derivative at <code>x=0</code>. You can check this manually: <pre class="prettyprint"><code>import tensorflow as tf x = tf.Variable(0.0) y = tf.abs(x) grad = tf.gradients(y, [x])[0] with tf.Session() as sess: sess.run(tf.global_variables_initializer()) print(sess.run(grad)) </code></pre> It prints <code>0.0</code>.

how does TensorFlow handle the differentials for L1 regularization?

Tags:

tensorflow

it seems that you can just declare a cost function by tf.abs() and then pass it down to auto-gradient generation (see https://github.com/nfmcclure/tensorflow_cookbook/blob/master/03_Linear_Regression/04_Loss_Functions_in_Linear_Regressions/04_lin_reg_l1_vs_l2.py)

. but we know abs() is not differentiable.

how is this done in Tensorflow? does it just randomly throw a number in [-1,1] ?

if someone could please point me to the implementation that would be great. Thanks!

(I looked for tensorflow.py in the git, but it does not even exist)

992

asked Jan 07 '17 07:01

teddy teddy

1 Answers

f(x) = abs(x) is differentiable everywhere, except at x=0. It derivative equals:

abs derivative

So the only question is how tensorflow implements derivative at x=0. You can check this manually:

import tensorflow as tf
x = tf.Variable(0.0)
y = tf.abs(x)
grad = tf.gradients(y, [x])[0]
with tf.Session() as sess:
    sess.run(tf.global_variables_initializer())
    print(sess.run(grad))

It prints 0.0.

answered Sep 30 '22 17:09

standy

Related questions
                            
                                How to load numpy array in a tensorflow dataset
                            
                                Why are deep learning libraries so huge?
                            
                                Feeding nullable data from BigQuery into Tensorflow Transform
                            
                                sklearn utils compute_class_weight function for large dataset
                            
                                Difficulty in GAN training
                            
                                Quantization aware training in TensorFlow version 2 and BatchNorm folding
                            
                                Keras loss and metrics values do not match with same function in each
                            
                                Early stopping with multiple conditions
                            
                                Variables on CPU, training/gradients on GPU
                            
                                Port TensorFlow code to Android
                            
                                LSTM implementation with peephole
                            
                                What's different about momentum gradient update in Tensorflow and Theano like this?
                            
                                TensorFlow: how to batch mut-mul a batch tensor by a weight variable?
                            
                                Restoring graph in tensorflow fails because there is no variable to save
                            
                                How to pip install tensorflow on El Capitan?
                            
                                Using custom beam scorer in TensorFlow CTC (language model)
                            
                                Save or export weights and biases in TensorFlow for non-Python replication
                            
                                How do I implement weight noise in Tensorflow
                            
                                How to predict input image with trained model in Keras?
                            
                                What is the use of tf.select

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With