Reading through the documentation of implementing custom layers with <code>tf.keras</code>, they specify two options to inherit from, <code>tf.keras.Layer</code> and <code>tf.keras.Model</code>. Under the context of creating custom layers, I'm asking myself what is the difference between these two? Technically what is different? If I were to implement the transformer encoder for example, which one would be more suitable? (assuming the transformer is a only a "layer" in my full model)

In the documentation: <blockquote> The Model class has the same API as Layer, with the following differences: - It exposes built-in training, evaluation, and prediction loops (model.fit(), model.evaluate(), model.predict()). - It exposes the list of its inner layers, via the model.layers property. - It exposes saving and serialization APIs. Effectively, the "Layer" class corresponds to what we refer to in the literature as a "layer" (as in "convolution layer" or "recurrent layer") or as a "block" (as in "ResNet block" or "Inception block"). Meanwhile, the "Model" class corresponds to what is referred to in the literature as a "model" (as in "deep learning model") or as a "network" (as in "deep neural network"). </blockquote> So if you want to be able to call <code>.fit()</code>, <code>.evaluate()</code>, or <code>.predict()</code> on those blocks or you want to be able to save and load those blocks separately or something you should use the Model class. The Layer class is leaner so you won't bloat the layers with unnecessary functionality...but I would guess that that generally wouldn't be a big problem.

TensorFlow - Difference between tf.keras.layers.Layer vs tf.keras.Model

1 Answers

In the documentation:

The Model class has the same API as Layer, with the following differences: - It exposes built-in training, evaluation, and prediction loops (model.fit(), model.evaluate(), model.predict()). - It exposes the list of its inner layers, via the model.layers property. - It exposes saving and serialization APIs.

Effectively, the "Layer" class corresponds to what we refer to in the literature as a "layer" (as in "convolution layer" or "recurrent layer") or as a "block" (as in "ResNet block" or "Inception block").

Meanwhile, the "Model" class corresponds to what is referred to in the literature as a "model" (as in "deep learning model") or as a "network" (as in "deep neural network").

So if you want to be able to call .fit(), .evaluate(), or .predict() on those blocks or you want to be able to save and load those blocks separately or something you should use the Model class. The Layer class is leaner so you won't bloat the layers with unnecessary functionality...but I would guess that that generally wouldn't be a big problem.

127

answered Oct 20 '22 16:10

enumaris

Related questions
                            
                                Run Flask dev server over HTTPS using CLI
                            
                                module 'snappy' has no attribute 'decompress'
                            
                                How to extract countries from a text?
                            
                                How to delete last n rows from Numpy array?
                            
                                Manager isn't available; 'auth.User' has been swapped for 'polls.User'
                            
                                Why heappop time complexity is O(logn) (not O(n)) in python?
                            
                                How to add progress bar?
                            
                                Decode UTF-8 encoding in JSON string
                            
                                dateutil 2.5.0 is the minimum required version
                            
                                What it really is @client.event? discord.py
                            
                                Analyzing seasonality of Google trend time series using FFT
                            
                                Python 2d array boolean reduction
                            
                                Python: How to read and load an excel file from AWS S3?
                            
                                Pandas convert strings to numeric if possible; else keep string values
                            
                                "Failed building wheel for regex" while installing pip package
                            
                                Get multiple Key/Values in Redis with Python
                            
                                How to get apache beam for dataflow GCP on Python 3.x
                            
                                Printing contents of a Queue in Python
                            
                                How to install dlib for python on mac?
                            
                                Redis - Python example of xadd and xread

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

TensorFlow - Difference between tf.keras.layers.Layer vs tf.keras.Model

Tags:

python

tensorflow

keras

bluesummers

People also ask

1 Answers

enumaris

Recent Activity

Donate For Us