Can anyone please explain in simple words and possibly with some examples what is a loss function in the field of machine learning/neural networks? This came out while I was following a Tensorflow tutorial: https://www.tensorflow.org/get_started/get_started

It describes how far off the result your network produced is from the expected result - it indicates the magnitude of error your model made on its prediciton. You can then take that error and 'backpropagate' it through your model, adjusting its weights and making it get closer to the truth the next time around.

The loss function is how you're penalizing your output. The following example is for a supervised setting i.e. when you know the correct result should be. Although loss functions can be applied even in unsupervised settings. Suppose you have a model that always predicts 1. Just the scalar value 1. You can have many loss functions applied to this model. L2 is the euclidean distance. If I pass in some value say 2 and I want my model to learn the x**2 function then the result should be 4 (because 2*2 = 4). If we apply the L2 loss then its computed as ||4 - 1||^2 = 9. We can also make up our own loss function. We can say the loss function is always 10. So no matter what our model outputs the loss will be constant. Why do we care about loss functions? Well they determine how poorly the model did and in the context of backpropagation and neural networks. They also determine the gradients from the final layer to be propagated so the model can learn. As other comments have suggested I think you should start with basic material. Here's a good link to start off with http://neuralnetworksanddeeplearning.com/

What is a loss function in simple words?

2 Answers

It describes how far off the result your network produced is from the expected result - it indicates the magnitude of error your model made on its prediciton.

You can then take that error and 'backpropagate' it through your model, adjusting its weights and making it get closer to the truth the next time around.

127

answered Oct 22 '22 00:10

Piotr Trochim

The loss function is how you're penalizing your output.

The following example is for a supervised setting i.e. when you know the correct result should be. Although loss functions can be applied even in unsupervised settings.

Suppose you have a model that always predicts 1. Just the scalar value 1.

You can have many loss functions applied to this model. L2 is the euclidean distance.

If I pass in some value say 2 and I want my model to learn the x**2 function then the result should be 4 (because 2*2 = 4). If we apply the L2 loss then its computed as ||4 - 1||^2 = 9.

We can also make up our own loss function. We can say the loss function is always 10. So no matter what our model outputs the loss will be constant.

Why do we care about loss functions? Well they determine how poorly the model did and in the context of backpropagation and neural networks. They also determine the gradients from the final layer to be propagated so the model can learn.

As other comments have suggested I think you should start with basic material. Here's a good link to start off with http://neuralnetworksanddeeplearning.com/

answered Oct 21 '22 22:10

Steven

Related questions
                            
                                sklearn LabelBinarizer returns vector when there are 2 classes
                            
                                How to merge multiple feature vectors in DataFrame?
                            
                                Best Machine Learning package for Python 3x? [closed]
                            
                                How to extract sklearn decision tree rules to pandas boolean conditions?
                            
                                How to find wrong prediction cases in test set (CNNs using Keras)
                            
                                How to handle date variable in machine learning data pre-processing
                            
                                I get error "Error in nnet.default(x, y, w, ...) : too many (77031) weights" while training neural networks
                            
                                How to accumulate gradients for large batch sizes in Keras
                            
                                Accuracy, precision, and recall for multi-class model
                            
                                Variable importance with ranger
                            
                                how to save a scikit-learn pipline with keras regressor inside to disk?
                            
                                How to compute AUC with ROCR package
                            
                                Keras - Validation Loss and Accuracy stuck at 0
                            
                                Convolutional neural network Conv1d input shape
                            
                                How to train Word2vec on very large datasets?
                            
                                How tf.gradients work in TensorFlow
                            
                                Scikit-learn : Input contains NaN, infinity or a value too large for dtype ('float64')
                            
                                Where is it best to use svm with linear kernel?
                            
                                PyTorch - How to get learning rate during training?
                            
                                How is the complexity of PCA O(min(p^3,n^3))?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

What is a loss function in simple words?

Tags:

machine-learning

neural-network

tensorflow

loss

Federico

People also ask

2 Answers

Piotr Trochim

Steven

Recent Activity

Donate For Us