How to wrap a custom TensorFlow loss function in Keras?

Tags:

keras

This is my third attempt to get a deep learning project off the ground. I'm working with protein sequences. First I tried TFLearn, then raw TensorFlow, and now I'm trying Keras.

The previous two attempts taught me a lot, and gave me some code and concepts that I can re-use. However there has always been an obstacle, and I've asked questions that the developers can't answer (in the case of TFLearn), or I've simply gotten bogged down (TensorFlow object introspection is tedious).

I have written this TensorFlow loss function, and I know it works:

def l2_angle_distance(pred, tgt):
    with tf.name_scope("L2AngleDistance"):
        # Scaling factor
        count = tgt[...,0,0]
        scale = tf.to_float(tf.count_nonzero(tf.is_finite(count)))
        # Mask NaN in tgt
        tgt = tf.where(tf.is_nan(tgt), pred, tgt)
        # Calculate L1 losses
        losses = tf.losses.cosine_distance(pred, tgt, -1, reduction=tf.losses.Reduction.NONE)
        # Square the losses, then sum, to get L2 scalar loss.
        # Divide the loss result by the scaling factor.
        return tf.reduce_sum(losses * losses) / scale

My target values (tgt) can include NaN, because my protein sequences are passed in a 4D Tensor, despite the fact that the individual sequences differ in length. Before you ask, the data can't be resampled like an image. So I use NaN in the tgt Tensor to indicate "no prediction needed here." Before I calculate the L2 cosine loss, I replace every NaN with the matching values in the prediction (pred) so the loss for every NaN is always zero.

Now, how can I re-use this function in Keras? It appears that the Keras Lambda core layer is not a good choice, because a Lambda only takes a single argument, and a loss function needs two arguments.

Alternately, can I rewrite this function in Keras? I shouldn't ever need to use the Theano or CNTK backend, so it isn't necessary for me to rewrite my function in Keras. I'll use whatever works.

I just looked at the Keras losses.py file to get some clues. I imported keras.backend and had a look around. I also found https://keras.io/backend/. I don't seem to find wrappers for ANY of the TensorFlow function calls I happen to use: to_float(), count_nonzero(), is_finite(), where(), is_nan(), cosine_distance(), or reduce_sum().

Thanks for your suggestions!

721

asked Jan 01 '18 07:01

John Ladasky

2 Answers

I answered my own question. I'm posting the solution for anyone who may come across this same problem.

I tried using my TF loss function directly in Keras, as was independently suggested by Matias Valdenegro. I did not provoke any errors from Keras by doing so, however, the loss value went immediately to NaN.

Eventually I identified the problem. The calling convention for a Keras loss function is first y_true (which I called tgt), then y_pred (my pred). But the calling convention for a TensorFlow loss function is pred first, then tgt. So if you want to keep a Tensorflow-native version of the loss function around, this fix works:

def keras_l2_angle_distance(tgt, pred):
    return l2_angle_distance(pred, tgt)

<snip>

model.compile(loss = keras_l2_angle_distance, optimizer = "something")

Maybe Theano or CNTK uses the same parameter order as Keras, I don't know. But I'm back in business.

139

answered Nov 05 '22 07:11

John Ladasky

You don't need to use keras.backend, as your loss is directly written in TensorFlow, then you can use it directly in Keras. The backend functions are an abstraction layer so you can code a loss/layer that will work with the multiple available backends in Keras.

You just have to put your loss in the model.compile call:

model.compile(loss = l2_angle_distance, optimizer = "something")

answered Nov 05 '22 09:11

Dr. Snoopy

Related questions
                            
                                Eager execution in Tensorflow 2
                            
                                How to apply Guided BackProp in Tensorflow 2.0?
                            
                                How to convert "tensor" to "numpy" array in tensorflow?
                            
                                How to deploy cnn file
                            
                                BodyPix: Real-time Person Segmentation
                            
                                How to set environment variable TF_Keras = 1 for onnx conversion?
                            
                                Cannot clone object <tensorflow.python.keras.wrappers.scikit_learn.KerasClassifier object
                            
                                Implementing a trainable generalized Bump function layer in Keras/Tensorflow
                            
                                Sequence to Sequence - for time series prediction
                            
                                How to design a neural network to predict arrays from arrays
                            
                                TensorFlow: How to apply the same image distortion to multiple images
                            
                                Tensorflow: feed dict error: You must feed a value for placeholder tensor
                            
                                What is a seed in TensorFlow? [duplicate]
                            
                                How to compile Tensor Flow with SSE and AVX instructions on Windows?
                            
                                ValueError: Attempt to reuse RNNCell with a different variable scope than its first use
                            
                                Visualizing filter weights in tf.layers.conv2d
                            
                                Keras model.predict always 0
                            
                                How to parse the output received by gRPC stub client from tensorflow serving server?
                            
                                keras loss function for 360 degree prediction
                            
                                Where is the code for gradient descent?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to wrap a custom TensorFlow loss function in Keras?

Tags:

tensorflow

keras

John Ladasky

People also ask

2 Answers

John Ladasky

Dr. Snoopy

Recent Activity

Donate For Us