Implementing contrastive loss and triplet loss in Tensorflow

1 Answers

Update (2018/03/19): I wrote a blog post detailing how to implement triplet loss in TensorFlow.

You need to implement yourself the contrastive loss or the triplet loss, but once you know the pairs or triplets this is quite easy.

Contrastive Loss

Suppose you have as input the pairs of data and their label (positive or negative, i.e. same class or different class). For instance you have images as input of size 28x28x1:

left = tf.placeholder(tf.float32, [None, 28, 28, 1]) right = tf.placeholder(tf.float32, [None, 28, 28, 1]) label = tf.placeholder(tf.int32, [None, 1]). # 0 if same, 1 if different margin = 0.2  left_output = model(left)  # shape [None, 128] right_output = model(right)  # shape [None, 128]  d = tf.reduce_sum(tf.square(left_output - right_output), 1) d_sqrt = tf.sqrt(d)  loss = label * tf.square(tf.maximum(0., margin - d_sqrt)) + (1 - label) * d  loss = 0.5 * tf.reduce_mean(loss)

Triplet Loss

Same as with contrastive loss, but with triplets (anchor, positive, negative). You don't need labels here.

anchor_output = ...  # shape [None, 128] positive_output = ...  # shape [None, 128] negative_output = ...  # shape [None, 128]  d_pos = tf.reduce_sum(tf.square(anchor_output - positive_output), 1) d_neg = tf.reduce_sum(tf.square(anchor_output - negative_output), 1)  loss = tf.maximum(0., margin + d_pos - d_neg) loss = tf.reduce_mean(loss)

The real trouble when implementing triplet loss or contrastive loss in TensorFlow is how to sample the triplets or pairs. I will focus on generating triplets because it is harder than generating pairs.

The easiest way is to generate them outside of the Tensorflow graph, i.e. in python and feed them to the network through the placeholders. Basically you select images 3 at a time, with the first two from the same class and the third from another class. We then perform a feedforward on these triplets, and compute the triplet loss.

The issue here is that generating triplets is complicated. We want them to be valid triplets, triplets with a positive loss (otherwise the loss is 0 and the network doesn't learn).
To know whether a triplet is good or not you need to compute its loss, so you already make one feedforward through the network...

Clearly, implementing triplet loss in Tensorflow is hard, and there are ways to make it more efficient than sampling in python but explaining them would require a whole blog post !

122

answered Sep 19 '22 22:09

Olivier Moindrot

Related questions
                            
                                How to understand loss acc val_loss val_acc in Keras model fitting
                            
                                What is the meaning of the "None" in model.summary of KERAS?
                            
                                How to use tf.while_loop() in tensorflow
                            
                                What is the difference between model.fit() an model.evaluate() in Keras?
                            
                                Adam optimizer goes haywire after 200k batches, training loss grows
                            
                                TensorFlow 'module' object has no attribute 'global_variables_initializer'
                            
                                Illegal instruction (core dumped) after running import tensorflow
                            
                                What is the best way to implement weight constraints in TensorFlow?
                            
                                Keras: How to get layer shapes in a Sequential model
                            
                                Unknown initializer: GlorotUniform when loading Keras model
                            
                                Keras difference between generator and sequence
                            
                                What is the difference between Keras and tf.keras in TensorFlow 1.1+?
                            
                                What are the differences between all these cross-entropy losses in Keras and TensorFlow?
                            
                                looking for source code of from gen_nn_ops in tensorflow
                            
                                TensorFlow operator overloading
                            
                                TensorFlow wasn't compiled to use SSE (etc.) instructions, but these are available
                            
                                TensorFlow: questions regarding tf.argmax() and tf.equal()
                            
                                keras tensorboard: plot train and validation scalars in a same figure
                            
                                How do I split Tensorflow datasets?
                            
                                How to understand the term `tensor` in TensorFlow?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Implementing contrastive loss and triplet loss in Tensorflow

Tags:

tensorflow

deep-learning

Tiago Freitas Pereira

People also ask

1 Answers

Contrastive Loss

Triplet Loss

Olivier Moindrot

Recent Activity

Donate For Us