How to train the network only on one output when there are multiple outputs?

Tags:

I am using a multiple output model in Keras

model1 = Model(input=x, output=[y2, y3])

model1.compile((optimizer='sgd', loss=cutom_loss_function)

my custom_loss function is

def custom_loss(y_true, y_pred):
   y2_pred = y_pred[0]
   y2_true = y_true[0]

   loss = K.mean(K.square(y2_true - y2_pred), axis=-1)
   return loss

I only want to train the network on output y2.

What is the shape/structure of the y_pred and y_true argument in loss function when multiple outputs are used? Can I access them as above? Is it y_pred[0] or y_pred[:,0]?

271

asked May 25 '17 04:05

3 Answers

I only want to train the network on output y2.

Based on Keras functional API guide you can achieve that with

model1 = Model(input=x, output=[y2,y3])   
model1.compile(optimizer='sgd', loss=custom_loss_function,
                  loss_weights=[1., 0.0])

What is the shape/structure of the y_pred and y_true argument in loss function when multiple outputs are used? Can I access them as above? Is it y_pred[0] or y_pred[:,0]

In keras multi-output models loss function is applied for each output separately. In pseudo-code:

loss = sum( [ loss_function( output_true, output_pred ) for ( output_true, output_pred ) in zip( outputs_data, outputs_model ) ] )

The functionality to do loss function on multiple outputs seems unavailable to me. One probably could achieve that by incorporating the loss function as a layer of the network.

165

answered Oct 05 '22 14:10

The accepted answer won't work in general if the custom loss can't be applied to the outputs you're trying to ignore, e.g. if they have the wrong shapes. In that case you can assign a dummy loss function to those outputs:

labels = [labels_for_relevant_output, dummy_labels_for_ignored_output]

def dummy_loss(y_true, y_pred):
    return 0.0

model.compile(loss = [custom_loss_function, dummy_loss])
model.fit(x, labels)

answered Oct 05 '22 15:10

Elan

Sharapolas' answer is right.

However, there is a better way than using a layer for building custom loss functions with complex interdependence of several outputs of a model.

The method I know is being used in practice is by never calling model.compile but only model._make_predict_function(). From there on, you can go on and build a custom optimizer method by calling model.output in there. This will give you all outputs, [y2,y3] in your case. When doing your magic with it, get a keras.optimizer and use it's get_update method using your model.trainable_weights and your loss. Finally, return a keras.function with a list of the inputs required (in your case only model.input) and the updates you just got from the optimizer.get_update call. This function now replaces model.fit.

The above is often used in PolicyGradient algorithms, like A3C or PPO. Here is an example of what I tried to explain: https://github.com/Hyeokreal/Actor-Critic-Continuous-Keras/blob/master/a2c_continuous.py Look at build_model and critic_optimizer methods and read kreas.backend.function documentation to understand what happens.

I found this way to have frequently problems with the session management and does not appear to work in tf-2.0 keras at all currently. Hence, if anyone knows a method, please let me know. I came here looking for one :)

answered Oct 05 '22 13:10

Nric

Related questions
                            
                                ER-Diagram: Ternary Relationship - How to read properly?
                            
                                django days-of-week representation in model
                            
                                laravel: eloquent example insert data to database
                            
                                Django ORM how to Round an Avg result
                            
                                automatically generate javascript object model from c# object
                            
                                Can I prevent / delay the AngularJS $digest from happening when model is updated
                            
                                Rails has_one vs belongs_to semantics
                            
                                Difference among Model, javabean and POJO
                            
                                Smarter paperclip validations
                            
                                Django models: mutual references between two classes and impossibility to use forward declaration in python
                            
                                Django CharField with no max length
                            
                                Rails - How to validate a field only if a another field has a certain value?
                            
                                Django: Creating a Mixin for Reusable Model Fields
                            
                                Python/Django: synonym for field "type" in database model (reserved built-in symbol)
                            
                                Why Two Classes, View Model and Domain Model?
                            
                                Check if a OneToOne relation exists in Django
                            
                                Update Table Using Laravel Model
                            
                                In Laravel, how do I retrieve a random user_id from the Users table for Model Factory seeding data generation?
                            
                                How to model interpretations of rap music
                            
                                Backbone change model of View

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to train the network only on one output when there are multiple outputs?

Tags:

model

keras

prediction

loss

multipleoutputs

shaaa

People also ask

3 Answers

Sharapolas

Elan

Nric

Recent Activity

Donate For Us