How do I get the gradient of a keras model with respect to its inputs?

Tags:

I just asked a question on the same topic but for custom models (How do I find the derivative of a custom model in Keras?) but realised quickly that this was trying to run before I could walk so that question has been marked as a duplicate of this one.

I've tried to simplify my scenario and now have a (not custom) keras model consisting of 2 Dense layers:

inputs = tf.keras.Input((cols,), name='input')

layer_1 = tf.keras.layers.Dense(
        10,
        name='layer_1',
        input_dim=cols,
        use_bias=True,
        kernel_initializer=tf.constant_initializer(0.5),
        bias_initializer=tf.constant_initializer(0.1))(inputs)

outputs = tf.keras.layers.Dense(
        1,
        name='alpha',
        use_bias=True,
        kernel_initializer=tf.constant_initializer(0.1),
        bias_initializer=tf.constant_initializer(0))(layer_1)

model = tf.keras.Model(inputs=inputs, outputs=outputs)

prediction = model.predict(input_data)
# gradients = ...

Now I would like to know the derivative of outputs with respect to inputs for inputs = input_data.

What I've tried so far:

This answer to a different question suggests running grads = K.gradients(model.output, model.input). However, if I run that I get this error;

tf.gradients is not supported when eager execution is enabled. Use tf.GradientTape instead.

I can only assume this is something to do with eager execution now being the default.

Another approach was in the answer to my question on custom keras models, which involved adding this:

with tf.GradientTape() as tape:
    x = tf.Variable(np.random.normal(size=(10, rows, cols)), dtype=tf.float32)
    out = model(x)

What I don't understand about this approach is how I'm supposed to load the data. It requires x to be a variable, but my x is a tf.keras.Input object. I also don't understand what that with statement is doing, some kind of magic but I don't understand it.

There's a very similar-sounding question to this one here: Get Gradients with Keras Tensorflow 2.0 although the application and scenario are sufficiently different for me to have difficulty applying the answer to this scenario. It did lead me to add the following to my code:

with tf.GradientTape() as t:
    t.watch(outputs)

That does work, but now what? I run model.predict(...), but then how do I get my gradients? The answer says I should run t.gradient(outputs, x_tensor).numpy(), but what do I put in for x_tensor? I don't have an input variable. I tried running t.gradient(outputs, model.inputs) after running predict, but that resulted in this:

enter image description here

938

asked Jan 04 '20 12:01

quant

1 Answers

I ended up getting this to work with a variant of the answer to this question: Get Gradients with Keras Tensorflow 2.0

x_tensor = tf.convert_to_tensor(input_data, dtype=tf.float32)
with tf.GradientTape() as t:
    t.watch(x_tensor)
    output = model(x_tensor)

result = output
gradients = t.gradient(output, x_tensor)

This allows me to obtain both the output and the gradient without redundant computation.

107

answered Sep 20 '22 15:09

quant

Related questions
                            
                                list_local_device tensorflow does not detect gpu
                            
                                Add hand-crafted features to Keras sequential model
                            
                                How to use Keras Embedding layer when there are more than 1 text features
                            
                                Keras plot_model not showing the input layer appropriately
                            
                                Error in load a model saved by callbakcs.ModelCheckpoint() in Keras
                            
                                ImportError: cannot import name 'transpose_shape'
                            
                                Why use same padding with max pooling?
                            
                                Keras model.predict() slower on first iteration then gets faster
                            
                                Why does model.losses return regularization losses?
                            
                                TF2 / Keras slice tensor using [:, :, 0]
                            
                                Keras Convolution2D Input: Error when checking model input: expected convolution2d_input_1 to have shape
                            
                                How can I use categorical one-hot labels for training with Keras?
                            
                                Frozen model from Keras doesn't predict after restoration
                            
                                extracting Bottleneck features using pretrained Inceptionv3 - differences between Keras' implementation and Native Tensorflow implementation
                            
                                How to feed into LSTM with 4 dimensional input?
                            
                                How does Keras back propagate custom loss function?
                            
                                How is the training accuracy in Keras determined for every epoch?
                            
                                Keras Denoising Autoencoder (tabular data)
                            
                                Error when checking target: expected dense_3 to have shape (2,) but got array with shape (1,)
                            
                                How can I reduce the number of CPUs used by Tensorlfow/Keras?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How do I get the gradient of a keras model with respect to its inputs?

Tags:

keras

tensorflow2.0

quant

People also ask

1 Answers

quant

Recent Activity

Donate For Us