Cross Entropy Loss for Semantic Segmentation Keras

Tags:

I'm pretty sure this is a silly question but I can't find it anywhere else so I'm going to ask it here.

I'm doing semantic image segmentation using a cnn (unet) in keras with 7 labels. So my label for each image is (7,n_rows,n_cols) using the theano backend. So across the 7 layers for each pixel, it's one-hot encoded. In this case, is the correct error function to use categorical cross-entropy? It seems that way to me but the network seems to learn better with binary cross-entropy loss. Can someone shed some light on why that would be and what the principled objective is?

866

asked Feb 08 '17 16:02

TSW

1 Answers

Binary cross-entropy loss should be used with sigmod activation in the last layer and it severely penalizes opposite predictions. It does not take into account that the output is a one-hot coded and the sum of the predictions should be 1. But as mis-predictions are severely penalizing the model somewhat learns to classify properly.

Now to enforce the prior of one-hot code is to use softmax activation with categorical cross-entropy. This is what you should use.

Now the problem is using the softmax in your case as Keras don't support softmax on each pixel.

The easiest way to go about it is permute the dimensions to (n_rows,n_cols,7) using Permute layer and then reshape it to (n_rows*n_cols,7) using Reshape layer. Then you can added the softmax activation layer and use crossentopy loss. The data should also be reshaped accordingly.

The other way of doing so will be to implement depth-softmax :

def depth_softmax(matrix):
    sigmoid = lambda x: 1 / (1 + K.exp(-x))
    sigmoided_matrix = sigmoid(matrix)
    softmax_matrix = sigmoided_matrix / K.sum(sigmoided_matrix, axis=0)
    return softmax_matrix

and use it as a lambda layer:

model.add(Deconvolution2D(7, 1, 1, border_mode='same', output_shape=(7,n_rows,n_cols)))
model.add(Permute(2,3,1))
model.add(BatchNormalization())
model.add(Lambda(depth_softmax))

If tf image_dim_ordering is used then you can do way with the Permute layers.

For more reference check here.

198

answered Sep 30 '22 17:09

indraforyou

Related questions
                            
                                Real TIme Image Processing (OCR) [closed]
                            
                                iOS Custom UIImagePickerController Camera Crop to Square
                            
                                Find extreme outer points in image with Python OpenCV
                            
                                How to use .predict_generator() on new Images - Keras
                            
                                How to programmatically change the hue of UIImage?
                            
                                How to "smart resize" a displayed image to original aspect ratio
                            
                                Applying a coloured overlay to an image in either PIL or Imagemagik
                            
                                Read Multiple images on a folder in OpenCv (python)
                            
                                Sharpen on a Bitmap using C#
                            
                                With the Python Imaging Library (PIL), how does one compose an image with an alpha channel over another image?
                            
                                Camera Module Focus Adjust using Contrast Transfer Function
                            
                                Detect clusters of circular objects by iterative adaptive thresholding and shape analysis
                            
                                Improve Tesseract OCR results with blurred text
                            
                                Fast algorithm for finding a small picture in big picture?
                            
                                How to connect the ends of edges in order to close the holes between them?
                            
                                Image processing for OCR with leptonica (inverse color text)
                            
                                Suggestions for digit recognition
                            
                                Compute gradient vector field of an image
                            
                                How to identify CMYK images in ASP.NET using C#
                            
                                what's the best image input type for tesseract?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Cross Entropy Loss for Semantic Segmentation Keras

Tags:

image-processing

deep-learning

keras

image-segmentation

TSW

People also ask

1 Answers

indraforyou

Recent Activity

Donate For Us