I want to initialize the convolution layer by a specific kernel which is not defined in Keras. For instance, if I define the below function to initialize the kernel: <pre class="prettyprint"><code>def init_f(shape): ker=np.zeros((shape,shape)) ker[int(np.floor(shape/2)),int(np.floor(shape/2))]=1 return ker </code></pre> And the convolution layer is designed as follows: <pre class="prettyprint"><code>model.add(Conv2D(filters=32, kernel_size=(3,3), kernel_initializer=init_f(3))) </code></pre> I get the error: <blockquote> Could not interpret initializer identifier </blockquote> I have followed a similar issue at: https://groups.google.com/forum/#!topic/keras-users/J46pplO64-8 But I could not adapt it to my code. Could you please help me to define the arbitrary kernel in Keras?

A few items to fix. Let's start with the kernel initializer. From the documentation: <blockquote> If passing a custom callable, then it must take the argument shape (shape of the variable to initialize) and dtype (dtype of generated values) </blockquote> So the signature should become: <pre class="prettyprint"><code>def init_f(shape, dtype=None) </code></pre> The function will work without the <code>dtype</code>, but it's good practice to keep it there. That way you can specify the <code>dtype</code> to calls inside your function, e.g.: <pre class="prettyprint"><code>np.zeros(shape, dtype=dtype) </code></pre> This also addresses your second issue: the <code>shape</code> argument is a tuple, so you just need to pass it straight to <code>np.zeros</code> and don't need to make another tuple. I'm guessing you're trying to initialize the kernel with a 1 in the middle, so you could also generalize your function to work with whatever shape it receives: <pre class="prettyprint"><code>ker[tuple(map(lambda x: int(np.floor(x/2)), ker.shape))]=1 </code></pre> Putting it all together: <pre class="prettyprint"><code>def init_f(shape, dtype=None): ker = np.zeros(shape, dtype=dtype) ker[tuple(map(lambda x: int(np.floor(x/2)), ker.shape))]=1 return ker </code></pre> One last problem. You need to pass the function to the layer, not the result of the call: <pre class="prettyprint"><code>model.add(Conv2D(filters=32, kernel_size=(3,3), kernel_initializer=init_f)) </code></pre> The layer function will pass the arguments to <code>init_f</code>.

How to initialize a convolution layer with an arbitrary kernel in Keras?

I want to initialize the convolution layer by a specific kernel which is not defined in Keras. For instance, if I define the below function to initialize the kernel:

def init_f(shape):
      ker=np.zeros((shape,shape))
      ker[int(np.floor(shape/2)),int(np.floor(shape/2))]=1
      return ker

And the convolution layer is designed as follows:

model.add(Conv2D(filters=32, kernel_size=(3,3),
                      kernel_initializer=init_f(3)))

I get the error:

Could not interpret initializer identifier

I have followed a similar issue at: https://groups.google.com/forum/#!topic/keras-users/J46pplO64-8 But I could not adapt it to my code. Could you please help me to define the arbitrary kernel in Keras?

What is kernel initializer in keras?

Initializers define the way to set the initial random weights of Keras layers. The keyword arguments used for passing initializers to layers depends on the layer. Usually, it is simply kernel_initializer and bias_initializer : from tensorflow.keras import layers from tensorflow.keras import initializers layer = layers.

What is kernel initializer He_uniform?

Main aliases initializers. he_uniform . Draws samples from a uniform distribution within [-limit, limit] , where limit = sqrt(6 / fan_in) ( fan_in is the number of input units in the weight tensor).

What is He_Normal kernel initializer?

He_Normal initializer takes samples from a truncated normal distribution centered on 0 with stddev = sqrt(2 / fan_in) where fan_in is the number of input units in the weight tensor.

What is the default weight initialization in keras?

The default is glorot initializer. It draws samples from a uniform distribution within [-limit, limit] where limit is sqrt(6 / (fan_in + fan_out)) where fan_in is the number of input units in the weight tensor and fan_out is the number of output units in the weight tensor.

A few items to fix. Let's start with the kernel initializer. From the documentation:

If passing a custom callable, then it must take the argument shape (shape of the variable to initialize) and dtype (dtype of generated values)

So the signature should become:

def init_f(shape, dtype=None)

The function will work without the dtype, but it's good practice to keep it there. That way you can specify the dtype to calls inside your function, e.g.:

np.zeros(shape, dtype=dtype)

This also addresses your second issue: the shape argument is a tuple, so you just need to pass it straight to np.zeros and don't need to make another tuple.

I'm guessing you're trying to initialize the kernel with a 1 in the middle, so you could also generalize your function to work with whatever shape it receives:

ker[tuple(map(lambda x: int(np.floor(x/2)), ker.shape))]=1

Putting it all together:

def init_f(shape, dtype=None):
    ker = np.zeros(shape, dtype=dtype)
    ker[tuple(map(lambda x: int(np.floor(x/2)), ker.shape))]=1
    return ker

One last problem. You need to pass the function to the layer, not the result of the call:

model.add(Conv2D(filters=32, kernel_size=(3,3),
                  kernel_initializer=init_f))

The layer function will pass the arguments to init_f.

How to initialize a convolution layer with an arbitrary kernel in Keras?

Tags:

initialization

neural-network

keras

keras-layer

convolution

Alireza Esmailzehi

People also ask

1 Answers

ggallo

Recent Activity

Donate For Us

How to initialize a convolution layer with an arbitrary kernel in Keras?

Tags:

initialization

neural-network

keras

keras-layer

convolution

Alireza Esmailzehi

People also ask

1 Answers

ggallo

Related questions

Recent Activity

Donate For Us