I would like to know what are the <code>various techniques</code> and <code>metrics</code> used to evaluate how accurate/good an algorithm is and how to use a given metric to derive a conclusion about a ML model. one way to do this is to use precision and recall, as defined here in wikipedia. Another way is to use the accuracy metric as explained here. So, what I would like to know is whether there are other metrics for evaluating an ML model?

I've compiled, a while ago, a list of metrics used to evaluate classification and regression algorithms, under the form of a cheatsheet. Some metrics for classification: precision, recall, sensitivity, specificity, F-measure, Matthews correlation, etc. They are all based on the confusion matrix. Others exist for regression (continuous output variable). The technique is mostly to run an algorithm on some data to get a model, and then apply that model on new, previously unseen data, and evaluate the metric on that data set, and repeat. Some techniques (actually resampling techniques from statistics): <ul> <li>Jacknife</li> <li>Crossvalidation</li> <li>K-fold validation</li> <li>bootstrap.</li> </ul>

What are the metrics to evaluate a machine learning algorithm

Tags:

machine-learning

I would like to know what are the various techniques and metrics used to evaluate how accurate/good an algorithm is and how to use a given metric to derive a conclusion about a ML model.

one way to do this is to use precision and recall, as defined here in wikipedia. Another way is to use the accuracy metric as explained here. So, what I would like to know is whether there are other metrics for evaluating an ML model?

923

asked Jan 13 '14 13:01

Mohamed Ali JAMAOUI

1 Answers

I've compiled, a while ago, a list of metrics used to evaluate classification and regression algorithms, under the form of a cheatsheet. Some metrics for classification: precision, recall, sensitivity, specificity, F-measure, Matthews correlation, etc. They are all based on the confusion matrix. Others exist for regression (continuous output variable).

The technique is mostly to run an algorithm on some data to get a model, and then apply that model on new, previously unseen data, and evaluate the metric on that data set, and repeat.

Some techniques (actually resampling techniques from statistics):

Jacknife
Crossvalidation
K-fold validation
bootstrap.

answered Sep 21 '22 04:09

damienfrancois

Related questions
                            
                                NLP - When to lowercase text during preprocessing
                            
                                Relation between Word2Vec vector size and total number of words scanned?
                            
                                Pytorch: how to convert data into tensor
                            
                                Keras: difference between test_on_batch and predict_on_batch
                            
                                ValueError: `decode_predictions` expects a batch of predictions (i.e. a 2D array of shape (samples, 1000)). Found array with shape: (1, 7)
                            
                                How to find an optimum number of processes in GridSearchCV( ..., n_jobs = ... )?
                            
                                How can I limit regression output between 0 to 1 in keras
                            
                                How to interpret the probabilities (p0, p1) of the result of h2o.predict()
                            
                                How to implement RBF activation function in Keras?
                            
                                What is loss_cls and loss_bbox and why are they always zero in training
                            
                                Building SVM with tensorflow's LinearClassifier and Panda's Dataframes
                            
                                What is the difference between different kernel sizes(1x1, 3x3, 5x5) in a convolution neural network? [closed]
                            
                                Result of GridSearchCV as table
                            
                                Using WeightedRandomSampler in PyTorch
                            
                                Threads is not executing in parallel python with ThreadPoolExecutor
                            
                                machine learning libraries in s+ (or R)?
                            
                                Machine learning, best technique
                            
                                Automatic semantic role labeling in FrameNet
                            
                                Recommender Algorithm
                            
                                What is the difference between point-wise and pair-wise ranking in machine learning

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With