Average weights in keras models

Tags:

How to average weights in Keras models, when I train few models with the same architecture with different initialisations?

Now my code looks something like this?

datagen = ImageDataGenerator(rotation_range=15,
                             width_shift_range=2.0/28,
                             height_shift_range=2.0/28
                            )

epochs = 40 
lr = (1.234e-3)
optimizer = Adam(lr=lr)

main_input = Input(shape= (28,28,1), name='main_input')

sub_models = []

for i in range(5):

    x = Conv2D(32, kernel_size=(3,3), strides=1)(main_input)
    x = BatchNormalization()(x)
    x = Activation('relu')(x)
    x = MaxPool2D(pool_size=2)(x)

    x = Conv2D(64, kernel_size=(3,3), strides=1)(x)
    x = BatchNormalization()(x)
    x = Activation('relu')(x)
    x = MaxPool2D(pool_size=2)(x)

    x = Conv2D(64, kernel_size=(3,3), strides=1)(x)
    x = BatchNormalization()(x)
    x = Activation('relu')(x)

    x = Flatten()(x)

    x = Dense(1024)(x)
    x = BatchNormalization()(x)
    x = Activation('relu')(x)
    x = Dropout(0.1)(x)

    x = Dense(256)(x)
    x = BatchNormalization()(x)
    x = Activation('relu')(x)
    x = Dropout(0.4)(x)

    x = Dense(10, activation='softmax')(x)

    sub_models.append(x)

x = keras.layers.average(sub_models)

main_output = keras.layers.average(sub_models)

model = Model(inputs=[main_input], outputs=[main_output])

model.compile(loss='categorical_crossentropy', metrics=['accuracy'],
              optimizer=optimizer)

print(model.summary())

plot_model(model, to_file='model.png')

filepath="weights.best.hdf5"
checkpoint = ModelCheckpoint(filepath, monitor='val_acc', verbose=1, save_best_only=True, mode='max')
tensorboard = TensorBoard(log_dir='./Graph', histogram_freq=0, write_graph=True, write_images=True)
callbacks = [checkpoint, tensorboard]

model.fit_generator(datagen.flow(X_train, y_train, batch_size=128),
                    steps_per_epoch=len(X_train) / 128,
                    epochs=epochs,
                    callbacks=callbacks,
                    verbose=1,
                    validation_data=(X_test, y_test))

So now I average only last layer, but I want to average weights in all layers after training each one separately.

Thanks!

416

asked Jan 11 '18 16:01

Miłosz Bednarzak

Video Answer

1 Answers

So let's assume that models is a collection of your models. First - collect all weights:

weights = [model.get_weights() for model in models]

Now - create a new averaged weights:

new_weights = list()

for weights_list_tuple in zip(*weights):
    new_weights.append(
        [numpy.array(weights_).mean(axis=0)\
            for weights_ in zip(*weights_list_tuple)])

And what is left is to set these weights in a new model:

new_model.set_weights(new_weights)

Of course - averaging weights might be a bad idea, but in case you try - you should follow this approach.

128

answered Sep 27 '22 19:09

Marcin Możejko

Related questions
                            
                                TensorFlow 1.14.0 is not using GPU
                            
                                Fail to run word embedding example in tensorflow tutorial with GPUs
                            
                                tensorflow installation problems
                            
                                SGD with momentum in TensorFlow
                            
                                what is meaning of hook that used in tensorflow
                            
                                significance of "trainable" and "training" flag in tf.layers.batch_normalization
                            
                                get_config missing while loading previously saved model without custom layers
                            
                                Problems implementing an XOR gate with Neural Nets in Tensorflow
                            
                                Doing pairwise distance computation with TensorFlow
                            
                                using Tensorflow with Anaconda and PyCharm on Windows
                            
                                How to retrieve float_val from a PredictResponse object?
                            
                                How to change Keras/tensorflow version in Google colab?
                            
                                Is it possible to use TensorFlow C++ API on Windows?
                            
                                Tensorflow Object detection API: Print detected class as output to terminal
                            
                                tensorflow:Can save best model only with val_acc available, skipping
                            
                                TensorFlow (Mac OS X): can't determine number of CPU cores:
                            
                                Keras Custom loss function to pass arguments other than y_true and y_pred
                            
                                How does Keras 1d convolution layer work with word embeddings - text classification problem? (Filters, kernel size, and all hyperparameter)
                            
                                How can I convert TFRecords into numpy arrays?
                            
                                Keras Tensorflow - Exception while predicting from multiple threads

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Average weights in keras models

Tags:

neural-network

tensorflow

deep-learning

keras

keras-layer

Miłosz Bednarzak

People also ask

Video Answer

1 Answers

Marcin Możejko

Recent Activity

Donate For Us