Keras' Sequential vs Functional API for Multi-Task Learning Neural Network

Tags:

I would like to design a neural network for a multi-task deep learning task. Within the Keras API we can either use the "Sequential" or "Functional" approach to build such a neural network. Underneath I provide the code I used to build a network using both approaches to build a network with two outputs:

Sequential

seq_model = Sequential()
seq_model.add(LSTM(32, input_shape=(10,2)))
seq_model.add(Dense(8))
seq_model.add(Dense(2))
seq_model.summary()

Functional

input1 = Input(shape=(10,2))
lay1 = LSTM(32, input_shape=(10,2))(input1)
lay2 = Dense(8)(lay1)
out1 = Dense(1)(lay2)
out2 = Dense(1)(lay2)
func_model = Model(inputs=input1, outputs=[out1, out2])
func_model.summary()

When I look at both the summary outputs for the models, each of them contains identical number of trainable params:

Sequential and Functional .summary()

Up until now, this looks fine - however I start doubting myself when I plot both models (using keras.utils.plot_model) which results in the followings graphs: Sequential and Functional plot_model()

Personally I do not know how to interpret these. When using a multi-task learning approach, I want all neurons (in my case 8) of the layer before the output-layer to connect to both output neurons. For me this clearly shows in the Functional API (where I have two Dense(1) instances), but this is not very clear from the Sequential API. Nevertheless, the amount of trainable params is identical; suggesting that also the Sequential API the last layer is fully connected to both neurons in the Dense output layer.

Could anybody explain to me the differences between those two examples, or are those fully identical and result in the same neural network architecture? Also, which one would be preferred in this case?

Thank you a lot in advance.

745

asked Sep 25 '19 06:09

wptmdoorn

3 Answers

The difference between Sequential and functional keras API:

The sequential API allows you to create models layer-by-layer for most problems. It is limited in that it does not allow you to create models that share layers or have multiple inputs or outputs.

the functional API allows you to create models that have a lot more flexibility as you can easily define models where layers connect to more than just the previous and next layers. In fact, you can connect layers to (literally) any other layer. As a result, creating complex networks such as siamese networks and residual networks become possible.

To answer your question:

No these APIs are not the same and the number of layers is normal that are the same number. Which one to use? It depends on the use you want to make of this network. What are you doing the training for? What do you want the output to be?

I recommend this link to make the most of the concept.

Sequential Models & Functional Models

I hope I helped you understand better.

answered Nov 10 '22 09:11

Zrufy

Both models are (in theory) equivalent, as the two output nodes do not have any interaction between them.

It is just that the required outputs have a different shape

[(batch_size,2)]

[(batch_size,),(batch_size,)]

and thus, the loss will be different.

The total loss is averaged for the sequential model in this example, whereas it is summed up for the functional model with two outputs (at least with a default loss such as MSE).

Of course, you can also adapt the functional model to be exactly equivalent to the sequential model:

out1 = Dense(2)(lay2)
#out2 = Dense(1)(lay2)
func_model = Model(inputs=input1, outputs=out1)

Maybe you will also need some activations after the Dense layers.

answered Nov 10 '22 08:11

Max

Both networks are functionally equivalent. Dense layers are fully connected by definition, which is considered to be the most basic and simple design that can be assumed for "normal" neural networks not otherwise specified. The exact learned parameters and behavior may vary slightly based on the implementation. The graph presented is ambiguous only because it does not show the connection of the neurons (which may number in the millions), but rather provides a symbolic representation of the connectivity with its name (Dense), in this case indicating a fully connected layer.

I expect that the sequential model (or equivalent functional model using one dense layer with two neurons as the output) would be faster because it can use a simplified optimization path, but I have not tested this and I have no knowledge of the compile time optimizations performed by Tensorflow.

answered Nov 10 '22 08:11

11_22_33

Related questions
                            
                                What does the 'm' in a Python ABI tag mean?
                            
                                What is the difference between MLP implementation from scratch and in PyTorch?
                            
                                How to redirect -progress option output of ffmpeg to stderr?
                            
                                How to add calculated column to Dataframe counting frequency in column in pandas
                            
                                What is a time complexity of move_to_end operation for OrderedDict in Python 3?
                            
                                Multivariate polynomial regression with Python
                            
                                How to join nearby bounding boxes in OpenCV Python
                            
                                How to plot a vertical line at the x-axis range median position using plotly in Python API?
                            
                                Count of values grouped per month, year - Pandas
                            
                                Python: Dynamically import module's code from string with importlib
                            
                                gyp ERR! stack Error: Can't find Python executable
                            
                                Multiple aggregated Counting in Pandas
                            
                                How to keep only the consecutive values in a Pandas dataframe using Python
                            
                                AttributeError: module 'torch' has no attribute '_six'. Bert model in Pytorch
                            
                                Snakemake using a rule in a loop
                            
                                How do I upload to a shared drive in Python with Google Drive API v3?
                            
                                pytables writes much faster than h5py. Why?
                            
                                How to convert NumPy arrays obtained from cv2.findContours to Shapely polygons?
                            
                                Why is Altair returning an empty chart when using log scale?
                            
                                Python: how to override type hint on an instance attribute in a subclass?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Keras' Sequential vs Functional API for Multi-Task Learning Neural Network

Tags:

python

functional-programming

neural-network

keras

sequential

wptmdoorn

People also ask

3 Answers

Zrufy

Max

11_22_33

Recent Activity

Donate For Us