What is "metrics" in Keras?

Tags:

It is not yet clear for me what metrics are (as given in the code below). What exactly are they evaluating? Why do we need to define them in the model? Why we can have multiple metrics in one model? And more importantly what is the mechanics behind all this? Any scientific reference is also appreciated.

model.compile(loss='mean_squared_error',               optimizer='sgd',               metrics=['mae', 'acc'])

577

asked Nov 15 '17 07:11

DragonKnight

1 Answers

So in order to understand what metrics are, it's good to start by understanding what a loss function is. Neural networks are mostly trained using gradient methods by an iterative process of decreasing a loss function.

A loss is designed to have two crucial properties - first, the smaller its value is, the better your model fits your data, and second, it should be differentiable. So, knowing this, we could fully define what a metric is: it's a function that, given predicted values and ground truth values from examples, provides you with a scalar measure of a "fitness" of your model, to the data you have. So, as you may see, a loss function is a metric, but the opposite doesn't always hold. To understand these differences, let's look at the most common examples of metrics usage:

Measure a performance of your network using non-differentiable functions: e.g. accuracy is not differentiable (not even continuous) so you cannot directly optimize your network w.r.t. to it. However, you could use it in order to choose the model with the best accuracy.
Obtain values of different loss functions when your final loss is a combination of a few of them: Let's assume that your loss has a regularization term which measures how your weights differ from 0, and a term which measures the fitness of your model. In this case, you could use metrics in order to have a separate track of how the fitness of your model changes across epochs.
Track a measure with respect to which you don't want to directly optimize your model: so - let's assume that you are solving a multidimensional regression problem where you are mostly concerned about mse, but at the same time you are interested in how a cosine-distance of your solution is changing in time. Then, it's the best to use metrics.

I hope that the explanation presented above made obvious what metrics are used for, and why you could use multiple metrics in one model. So now, let's say a few words about mechanics of their usage in keras. There are two ways of computing them while training:

Using metrics defined while compilation: this is what you directly asked. In this case, keras is defining a separate tensor for each metric you defined, to have it computed while training. This usually makes computation faster, but this comes at a cost of additional compilations, and the fact that metrics should be defined in terms of keras.backend functions.
Using keras.callback: It is nice that you can use Callbacks in order to compute your metrics. As each callback has a default attribute of model, you could compute a variety of metrics using model.predict or model parameters while training. Moreover, it makes it possible to compute it, not only epoch-wise, but also batch-wise, or training-wise. This comes at a cost of slower computations, and more complicated logic - as you need to define metrics on your own.

Here you can find a list of available metrics, as well as an example on how you could define your own.

112

answered Oct 03 '22 18:10

Marcin Możejko

Related questions
                            
                                Meaning of X = X[:, 1] in Python
                            
                                Cannot resolve 'django.utils.log.NullHandler' in Django 1.9+
                            
                                Add alpha to an existing matplotlib colormap
                            
                                Random Sample of a subset of a dataframe in Pandas
                            
                                What is as_index in groupby in pandas?
                            
                                selenium.common.exceptions.SessionNotCreatedException: Message: Unable to find a matching set of capabilities with Firefox 46 through Selenium
                            
                                What is the internal precision of numpy.float128?
                            
                                TypeError: 'bool' object is not callable
                            
                                Python based asynchronous workflow modules : What is difference between celery workflow and luigi workflow?
                            
                                jinja2 load template from string: TypeError: no loader for this environment specified
                            
                                JOIN two dataframes on common column in python
                            
                                Matplotlib: how to show legend elements horizontally?
                            
                                numpy array assignment problem
                            
                                How do I stop Tornado web server? [duplicate]
                            
                                PyAudio Input overflowed
                            
                                Mock patching from/import statement in Python
                            
                                Element-wise array maximum function in NumPy (more than two arrays)
                            
                                Matplotlib boxplot without outliers
                            
                                Why is loading SQLAlchemy objects via the ORM 5-8x slower than rows via a raw MySQLdb cursor?
                            
                                Why does list ask about __len__?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

What is "metrics" in Keras?

Tags:

python

machine-learning

neural-network

deep-learning

keras

DragonKnight

People also ask

1 Answers

Marcin Możejko

Recent Activity

Donate For Us