I'm trying to implement a custom loss function for my CNN model. I found an IPython notebook that has implemented a custom loss function named Dice, just as follows: <pre class="prettyprint"><code>from keras import backend as K smooth = 1. def dice_coef(y_true, y_pred, smooth=1): intersection = K.sum(y_true * y_pred, axis=[1,2,3]) union = K.sum(y_true, axis=[1,2,3]) + K.sum(y_pred, axis=[1,2,3]) return K.mean( (2. * intersection + smooth) / (union + smooth), axis=0) def bce_dice(y_true, y_pred): return binary_crossentropy(y_true, y_pred)-K.log(dice_coef(y_true, y_pred)) def true_positive_rate(y_true, y_pred): return K.sum(K.flatten(y_true)*K.flatten(K.round(y_pred)))/K.sum(y_true) seg_model.compile(optimizer = 'adam', loss = bce_dice, metrics = ['binary_accuracy', dice_coef, true_positive_rate]) </code></pre> I have never used keras backend before and really get confused with the matrix calculations of keras backend. So, I created some tensors to see what's happening in the code: <pre class="prettyprint"><code>val1 = np.arange(24).reshape((4, 6)) y_true = K.variable(value=val1) val2 = np.arange(10,34).reshape((4, 6)) y_pred = K.variable(value=val2) </code></pre> Now I run the <code>dice_coef</code> function: <pre class="prettyprint"><code>result = K.eval(dice_coef(y_true=y_true, y_pred=y_pred)) print('result is:', result) </code></pre> But it gives me this error: <pre class="prettyprint"><code>ValueError: Invalid reduction dimension 2 for input with 2 dimensions. for 'Sum_32' (op: 'Sum') with input shapes: [4,6], [3] and with computed input tensors: input[1] = <1 2 3>. </code></pre> Then I changed all of <code>[1,2,3]</code> to <code>-1</code> just like below: <pre class="prettyprint"><code>def dice_coef(y_true, y_pred, smooth=1): intersection = K.sum(y_true * y_pred, axis=-1) # intersection = K.sum(y_true * y_pred, axis=[1,2,3]) # union = K.sum(y_true, axis=[1,2,3]) + K.sum(y_pred, axis=[1,2,3]) union = K.sum(y_true, axis=-1) + K.sum(y_pred, axis=-1) return K.mean( (2. * intersection + smooth) / (union + smooth), axis=0) </code></pre> Now it gives me a value. <pre class="prettyprint"><code>result is: 14.7911625 </code></pre> Questions: <ol> <li>What is <code>[1,2,3]</code>? </li> <li>Why the code works when I change <code>[1,2,3]</code> to <code>-1</code>?</li> <li>What does this <code>dice_coef</code> function do?</li> </ol>

Just like in numpy, you can define the axis along you want to perform a certain operation. For example, for a 4d array, we can sum along a specific axis like this <pre class="prettyprint"><code>>>> a = np.arange(150).reshape((2, 3, 5, 5)) >>> a.sum(axis=0).shape (3, 5, 5) >>> a.sum(axis=0, keepdims=True).shape (1, 3, 5, 5) >>> a.sum(axis=1, keepdims=True).shape (2, 1, 5, 5) </code></pre> If we feed a tuple, we can perform this operation along multiple axes. <pre class="prettyprint"><code>>>> a.sum(axis=(1, 2, 3), keepdims=True).shape (2, 1, 1, 1) </code></pre> If the argument is <code>-1</code>, it defaults to performing the operation over the last axis, regardless of how many there are. <pre class="prettyprint"><code>>>> a.sum(axis=-1, keepdims=True).shape (2, 3, 5, 1) </code></pre> This should have clarified points 1 and 2. Since the axis argument is <code>(1, 2, 3)</code>, you need a minimum of 4 axes for the operation to be valid. Try changing your variables to something like <code>val1 = np.arange(24).reshape((2, 2, 2, 3))</code> and it all works. The model seems to calculate the Binary Cross Entropy Dice loss and <code>dice_coeff()</code>, as the name suggests, calculates the Dice coefficient. I'm not sure what the purpose of <code>smooth</code> is, but if it was for the purpose of avoiding divisions by 0, you'd expect a small number, like 1e-6.

<blockquote> What is [1,2,3]? </blockquote> These numbers specify which dimension we want to do the summation. The smallest number shows the outer dimension and the biggest shows the inner. See the example: <pre class="prettyprint"><code>import tensorflow as tf tf.enable_eager_execution() a = tf.constant([[[1, 2], [3, 4]], [[5, 6], [7, 8]]]) print(tf.reduce_sum(a, axis=2).numpy()) #[[ 3 7] # [11 15]] print(tf.reduce_sum(a, axis=1).numpy()) #[[ 4 6] # [12 14]] print(tf.reduce_sum(a, axis=0).numpy()) #[[ 6 8] # [10 12]] </code></pre> In the above example, <code>axis = 2</code> means, inner entries which are: [1,2] , [3,4], [5,6], and [7,8]. As a result, after summation, we have the tensor: <code>[[3, 7], [11, 15]]</code>. The same idea applies to other axes. <blockquote> Why the code works when I change [1,2,3] to -1 </blockquote> When we did not specify any axis or on the other hand specify all axis means that we sum over all tensor elements. This result our tensor converted to a single scalar. See example: <pre class="prettyprint"><code>a = tf.constant([[[1, 2], [3, 4]], [[5, 6], [7, 8]]]) print(tf.reduce_sum(a).numpy()) # 36 print(tf.reduce_sum(a, axis=[0,1,2])) # 36 </code></pre> If we have 3 dimension [0, 1, 2], <code>axis = -1</code> is equal to <code>axis = 2</code>. See here for complete tutorial on python indexing. <blockquote> What does this dice_coef function do? </blockquote> <img src="https://i.stack.imgur.com/2Taa7.png" alt="enter image description here"> See here for a complete explanation about dice_coef.

What does axis=[1,2,3] mean in K.sum in keras backend?

Tags:

python

tensorflow

keras

conv-neural-network

I'm trying to implement a custom loss function for my CNN model. I found an IPython notebook that has implemented a custom loss function named Dice, just as follows:

from keras import backend as K
smooth = 1.

def dice_coef(y_true, y_pred, smooth=1):
    intersection = K.sum(y_true * y_pred, axis=[1,2,3])
    union = K.sum(y_true, axis=[1,2,3]) + K.sum(y_pred, axis=[1,2,3])
    return K.mean( (2. * intersection + smooth) / (union + smooth), axis=0)

def bce_dice(y_true, y_pred):
    return binary_crossentropy(y_true, y_pred)-K.log(dice_coef(y_true, y_pred))

def true_positive_rate(y_true, y_pred):
    return K.sum(K.flatten(y_true)*K.flatten(K.round(y_pred)))/K.sum(y_true)

seg_model.compile(optimizer = 'adam', 
              loss = bce_dice, 
              metrics = ['binary_accuracy', dice_coef, true_positive_rate])

I have never used keras backend before and really get confused with the matrix calculations of keras backend. So, I created some tensors to see what's happening in the code:

val1 = np.arange(24).reshape((4, 6))
y_true = K.variable(value=val1)

val2 = np.arange(10,34).reshape((4, 6))
y_pred = K.variable(value=val2)

Now I run the dice_coef function:

result = K.eval(dice_coef(y_true=y_true, y_pred=y_pred))
print('result is:', result)

But it gives me this error:

ValueError: Invalid reduction dimension 2 for input with 2 dimensions. for 'Sum_32' (op: 'Sum') with input shapes: [4,6], [3] and with computed input tensors: input[1] = <1 2 3>.

Then I changed all of [1,2,3] to -1 just like below:

def dice_coef(y_true, y_pred, smooth=1):
    intersection = K.sum(y_true * y_pred, axis=-1)
    # intersection = K.sum(y_true * y_pred, axis=[1,2,3])
    # union = K.sum(y_true, axis=[1,2,3]) + K.sum(y_pred, axis=[1,2,3])
    union = K.sum(y_true, axis=-1) + K.sum(y_pred, axis=-1)
    return K.mean( (2. * intersection + smooth) / (union + smooth), axis=0)

Now it gives me a value.

result is: 14.7911625

Questions:

What is [1,2,3]?
Why the code works when I change [1,2,3] to -1?
What does this dice_coef function do?

965

asked Jan 10 '19 10:01

Muser

2 Answers

Just like in numpy, you can define the axis along you want to perform a certain operation. For example, for a 4d array, we can sum along a specific axis like this

>>> a = np.arange(150).reshape((2, 3, 5, 5))
>>> a.sum(axis=0).shape
(3, 5, 5)
>>> a.sum(axis=0, keepdims=True).shape
(1, 3, 5, 5)
>>> a.sum(axis=1, keepdims=True).shape
(2, 1, 5, 5)

If we feed a tuple, we can perform this operation along multiple axes.

>>> a.sum(axis=(1, 2, 3), keepdims=True).shape
(2, 1, 1, 1)

If the argument is -1, it defaults to performing the operation over the last axis, regardless of how many there are.

>>> a.sum(axis=-1, keepdims=True).shape
(2, 3, 5, 1)

This should have clarified points 1 and 2. Since the axis argument is (1, 2, 3), you need a minimum of 4 axes for the operation to be valid. Try changing your variables to something like val1 = np.arange(24).reshape((2, 2, 2, 3)) and it all works.

The model seems to calculate the Binary Cross Entropy Dice loss and dice_coeff(), as the name suggests, calculates the Dice coefficient. I'm not sure what the purpose of smooth is, but if it was for the purpose of avoiding divisions by 0, you'd expect a small number, like 1e-6.

127

answered Sep 19 '22 17:09

Reti43

What is [1,2,3]?

These numbers specify which dimension we want to do the summation. The smallest number shows the outer dimension and the biggest shows the inner. See the example:

import tensorflow as tf

tf.enable_eager_execution()

    a = tf.constant([[[1, 2], [3, 4]], [[5, 6], [7, 8]]])

    print(tf.reduce_sum(a, axis=2).numpy())
    #[[ 3  7]
    # [11 15]]
    print(tf.reduce_sum(a, axis=1).numpy())
    #[[ 4  6]
    # [12 14]]
    print(tf.reduce_sum(a, axis=0).numpy())
    #[[ 6  8]
    # [10 12]]

In the above example, axis = 2 means, inner entries which are: [1,2] , [3,4], [5,6], and [7,8]. As a result, after summation, we have the tensor: [[3, 7], [11, 15]]. The same idea applies to other axes.

Why the code works when I change [1,2,3] to -1

When we did not specify any axis or on the other hand specify all axis means that we sum over all tensor elements. This result our tensor converted to a single scalar. See example:

a = tf.constant([[[1, 2], [3, 4]], [[5, 6], [7, 8]]])
print(tf.reduce_sum(a).numpy()) # 36
print(tf.reduce_sum(a, axis=[0,1,2])) # 36

If we have 3 dimension [0, 1, 2], axis = -1 is equal to axis = 2. See here for complete tutorial on python indexing.

What does this dice_coef function do?

enter image description here

See here for a complete explanation about dice_coef.

answered Sep 20 '22 17:09

Amir

Related questions
                            
                                Boxplot with Pandas in Python
                            
                                How to display actual values instead of percentages on my pie chart using matplotlibs [duplicate]
                            
                                RuntimeError: Too early to create image [duplicate]
                            
                                Extracting number of days from timedelta column in pandas
                            
                                Pytest - test case execution order
                            
                                float() argument must be a string or a number, not 'Timestamp'
                            
                                Variable scopes inside class definitions are confusing
                            
                                TypeError: object of type 'numpy.int64' has no len()
                            
                                How to handle text classification problems when multiple features are involved
                            
                                How to create JWK from RSA Key pair?
                            
                                Creating a new logger for each async function invocation, good idea or not?
                            
                                Custom Hebbian Layer Implementation in Keras - input/output dims and lateral node connections
                            
                                In Python, Is it possible to connect Azure SQL Server using Active Directory Password Authentication?
                            
                                How can I select the good colors from an image with OpenCV and mask?
                            
                                Pandas xlsxwriter to write dataframe to excel and implementing column-width and border related formatting
                            
                                K.<v> notation in Python 2
                            
                                Selenium with Firefox webdriver results in error: Service geckodriver unexpectedly exited. Status code was: -11
                            
                                How to replace special characters within a text with a space in Python?
                            
                                Get value of nested attribute by filtering list on other attribute with Python Glom
                            
                                How to resample text (imbalanced groups) in a pipeline?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With