Changing activation function of a keras layer w/o replacing whole layer

Tags:

I am trying to change the activation function of the last layer of a keras model without replacing the whole layer. In this case, only the softmax function

Click to copy

import keras.backend as K
from keras.models import load_model
from keras.preprocessing.image import load_img, img_to_array
import numpy as np

model = load_model(model_path)  # Load any model
img = load_img(img_path, target_size=(224, 224))
img = img_to_array(img)
print(model.predict(img))

My output:

Click to copy

array([[1.53172877e-07, 7.13159451e-08, 6.18941920e-09, 8.52070968e-07,
    1.25813088e-07, 9.98970985e-01, 1.48254022e-08, 6.09538893e-06,
    1.16236095e-07, 3.91888688e-10, 6.29304608e-08, 1.79565995e-09,
    1.75571788e-08, 1.02110009e-03, 2.14380114e-09, 9.54465733e-08,
    1.05938483e-07, 2.20544337e-07]], dtype=float32)

Then I do this to change the activation:

Click to copy

model.layers[-1].activation = custom_softmax
print(model.predict(test_img))

and the output I got is exactly the same. Any ideas how to fix? Thanks!

You could try to use the custom_softmax below:

Click to copy

def custom_softmax(x, axis=-1):
"""Softmax activation function.
# Arguments
    x : Tensor.
    axis: Integer, axis along which the softmax normalization is applied.
# Returns
    Tensor, output of softmax transformation.
# Raises
    ValueError: In case `dim(x) == 1`.
"""
ndim = K.ndim(x)
if ndim >= 2:
    return K.zeros_like(x)
else:
    raise ValueError('Cannot apply softmax to a tensor that is 1D')

654

asked Mar 24 '18 12:03

Hardian Lawi

1 Answers

At the current state of things there's no official, clean way to do that. As pointed by @layser in the comments, the Tensorflow graph isn't being updated - which results in the lack of change in your output. One option is to use keras-vis' utils. My recommendation is to isolate that in your own utils.py, like so:

Click to copy

from vis.utils.utils import apply_modifications

def update_layer_activation(model, activation, index=-1):
    model.layers[index].activation = activation
    return apply_modifications(model)

Which would lead to a similar use:

Click to copy

model = update_layer_activation(model, custom_softmax)

If you follow the given link, you'll see what they do is quite simple: they save the model to a temporary path, then load it back and return, finally deleting the temp file.

195

answered Oct 10 '22 00:10

Julio Cezar Silva

Related questions
                            
                                Connecting python to cassandra a cluster from windows with DseAuthenticator and DseAuthorizer
                            
                                pass fixture to test class in pytest
                            
                                Count subtests in Python unittests separately
                            
                                reverse word embeddings in keras - python
                            
                                How to provide learning rate value to tensorboard in keras
                            
                                Pandas - Fast way of accessing a column of objects' attribute
                            
                                MonitoredTrainingSession writes more than one metagraph event per run
                            
                                Determine WHY Features Are Important in Decision Tree Models
                            
                                Turning a generator of pairs into a pair of generators
                            
                                Read log file with pandas
                            
                                How to deep join a tuple into a string
                            
                                Why doesn't Keras need the gradient of a custom loss function?
                            
                                How do I do the equivalent of Gimp's Colors, Auto, White Balance in Python-Fu?
                            
                                Pandas: check if a number appear multiple times in a row
                            
                                How does tensorflow ignore undefined flags
                            
                                types.MethodType third argument in python2
                            
                                Module can't be found when called from outside
                            
                                How to handle multiple results from a coroutine function?
                            
                                pandas Categorical error: "Cannot setitem on a Categorical with a new category, set the categories first"
                            
                                Sentence Structure identification - spacy

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Changing activation function of a keras layer w/o replacing whole layer

Tags:

python

machine-learning

tensorflow

keras

Hardian Lawi

People also ask

1 Answers

Julio Cezar Silva

Recent Activity

Donate For Us