TensorFlow 2.0 Keras layers with custom tensors as variables

Tags:

In TF 1.x, it was possible to build layers with custom variables. Here's an example:

import numpy as np
import tensorflow as tf

def make_custom_getter(custom_variables):
    def custom_getter(getter, name, **kwargs):
        if name in custom_variables:
            variable = custom_variables[name]
        else:
            variable = getter(name, **kwargs)
        return variable
    return custom_getter

# Make a custom getter for the dense layer variables.
# Note: custom variables can result from arbitrary computation;
#       for the sake of this example, we make them just constant tensors.
custom_variables = {
    "model/dense/kernel": tf.constant(
        np.random.rand(784, 64), name="custom_kernel", dtype=tf.float32),
    "model/dense/bias": tf.constant(
        np.random.rand(64), name="custom_bias", dtype=tf.float32),
}
custom_getter = make_custom_getter(custom_variables)

# Compute hiddens using a dense layer with custom variables.
x = tf.random.normal(shape=(1, 784), name="inputs")
with tf.variable_scope("model", custom_getter=custom_getter):
    Layer = tf.layers.Dense(64)
    hiddens = Layer(x)

print(Layer.variables)

The printed variables of the constructed dense layer will be custom tensors we specified in the custom_variables dict:

[<tf.Tensor 'custom_kernel:0' shape=(784, 64) dtype=float32>, <tf.Tensor 'custom_bias:0' shape=(64,) dtype=float32>]

This allows us to create layers/models that use provided tensors in custom_variables directly as their weights, so that we could further differentiate the output of the layers/models with respect to any tensors that custom_variables may depend on (particularly useful for implementing functionality in modulating sub-nets, parameter generation, meta-learning, etc.).

Variable scopes used to make it easy to nest all off graph-building inside scopes with custom getters and build models on top of the provided tensors as their parameters. Since sessions and variable scopes are no longer advisable in TF 2.0 (and all of that low-level stuff is moved to tf.compat.v1), what would be the best practice to implement the above using Keras and TF 2.0?

(Related issue on GitHub.)

424

asked Oct 07 '19 18:10

maruan

1 Answers

Answer based on the comment below

Given you have:

kernel = createTheKernelVarBasedOnWhatYouWant() #shape (784, 64)
bias = createTheBiasVarBasedOnWhatYouWant() #shape (64,)

Make a simple function copying the code from Dense:

def custom_dense(x):
    inputs, kernel, bias = x

    outputs = K.dot(inputs, kernel)
    outputs = K.bias_add(outputs, bias, data_format='channels_last')
    return outputs

Use the function in a Lambda layer:

layer = Lambda(custom_dense)
hiddens = layer([x, kernel, bias])

Warning: kernel and bias must be produced from a Keras layer, or come from an kernel = Input(tensor=the_kernel_var) and bias = Input(tensor=bias_var)

If the warning above is bad for you, you can always use kernel and bias "from outside", like:

def custom_dense(inputs):
    outputs = K.dot(inputs, kernel) #where kernel is not part of the arguments anymore
    outputs = K.bias_add(outputs, bias, data_format='channels_last')
    return outputs

layer = Lambda(custom_dense)
hiddens = layer(x)

This last option makes it a bit more complicated to save/load models.

Old answer

You should probably use a Keras Dense layer and set its weights in a standard way:

layer = tf.keras.layers.Dense(64, name='the_layer')
layer.set_weights([np.random.rand(784, 64), np.random.rand(64)])

If you need that these weights are not trainable, before compiling the keras model you set:

model.get_layer('the_layer').trainable=False

If you want direct access to the variables as tensors, they are:

kernel = layer.kernel    
bias = layer.bias

There are plenty of other options, but that depends on your exact intention, which is not clear in your question.

101

answered Nov 03 '22 00:11

Daniel Möller

Related questions
                            
                                Lenient JSON Parser for Python
                            
                                Pandas dataframe: How to set values after an index to 0
                            
                                python signalR - 500 Server Error when trying to connect
                            
                                Loop over a tensor and apply function to each element
                            
                                Create a new column based on previous row value and delete the current row
                            
                                Break on exception in Chrome using Selenium
                            
                                How to apply the describe function after grouping a PySpark DataFrame?
                            
                                "SyntaxError: Generator expression must be parenthesized" when trying to use conda
                            
                                How does pandas treat timezone when reading from a CSV file?
                            
                                How to get nodes coordinates in graph rendered with graphviz
                            
                                How to fix "Fatal error in launcher: Unable to create process using *path*/scrapy.exe" in anaconda? [duplicate]
                            
                                Django sessions: How to include session ID in every log record?
                            
                                Fill up missing datetime with NaN or supress straight line in line plot
                            
                                How to get the python import tree [closed]
                            
                                Celery Redis instance filling up despite queue looking empty
                            
                                Celery - How to update the state of a task after the worker's being shutdown?
                            
                                "Module not found" when importing a Python package within a plpython3u procedure
                            
                                Submit a Form in a Single Page Application using Flask Without Reloading
                            
                                Keras Autoencoder: Tying Weights from Encoder To Decoder not working
                            
                                String Matching with wildcard in Python

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

TensorFlow 2.0 Keras layers with custom tensors as variables

Tags:

python

tensorflow

keras

tensorflow2.0