Why is the F-Measure a harmonic mean and not an arithmetic mean of the Precision and Recall measures?

3 Answers

To explain, consider for example, what the average of 30mph and 40mph is? if you drive for 1 hour at each speed, the average speed over the 2 hours is indeed the arithmetic average, 35mph.

However if you drive for the same distance at each speed -- say 10 miles -- then the average speed over 20 miles is the harmonic mean of 30 and 40, about 34.3mph.

The reason is that for the average to be valid, you really need the values to be in the same scaled units. Miles per hour need to be compared over the same number of hours; to compare over the same number of miles you need to average hours per mile instead, which is exactly what the harmonic mean does.

Precision and recall both have true positives in the numerator, and different denominators. To average them it really only makes sense to average their reciprocals, thus the harmonic mean.

119

answered Oct 16 '22 09:10

Sean Owen

Because it punishes extreme values more.

Consider a trivial method (e.g. always returning class A). There are infinite data elements of class B, and a single element of class A:

Precision: 0.0
Recall:    1.0

When taking the arithmetic mean, it would have 50% correct. Despite being the worst possible outcome! With the harmonic mean, the F1-measure is 0.

Arithmetic mean: 0.5
Harmonic mean:   0.0

In other words, to have a high F1, you need to both have a high precision and recall.

answered Oct 16 '22 07:10

Has QUIT--Anony-Mousse

The above answers are well explained. This is just for a quick reference to understand the nature of the arithmetic mean and the harmonic mean with plots. As you can see from the plot, consider the X axis and Y axis as precision and recall, and the Z axis as the F1 Score. So, from the plot of the harmonic mean, both the precision and recall should contribute evenly for the F1 score to rise up unlike the Arithmetic mean.

This is for the arithmetic mean.

enter image description here

This is for the Harmonic mean.

enter image description here

answered Oct 16 '22 07:10

gadde saikumar

Related questions
                            
                                Accuracy Score ValueError: Can't Handle mix of binary and continuous target
                            
                                cocktail party algorithm SVD implementation ... in one line of code?
                            
                                scikit-learn .predict() default threshold
                            
                                What is the intuition of using tanh in LSTM? [closed]
                            
                                RuntimeError: Input type (torch.FloatTensor) and weight type (torch.cuda.FloatTensor) should be the same
                            
                                How to create a new gym environment in OpenAI?
                            
                                keras: how to save the training history attribute of the history object
                            
                                How to get Tensorflow tensor dimensions (shape) as int values?
                            
                                What is machine learning? [closed]
                            
                                What is the difference between np.mean and tf.reduce_mean?
                            
                                Keras: Difference between Kernel and Activity regularizers
                            
                                Understanding min_df and max_df in scikit CountVectorizer
                            
                                What is the role of TimeDistributed layer in Keras?
                            
                                Error in Python script "Expected 2D array, got 1D array instead:"?
                            
                                What is the mAP metric and how is it calculated? [closed]
                            
                                Common causes of nans during training
                            
                                Python: tf-idf-cosine: to find document similarity
                            
                                word2vec: negative sampling (in layman term)?
                            
                                How to concatenate two layers in keras?
                            
                                multi-layer perceptron (MLP) architecture: criteria for choosing number of hidden layers and size of the hidden layer? [closed]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Why is the F-Measure a harmonic mean and not an arithmetic mean of the Precision and Recall measures?

Tags:

machine-learning

classification

data-mining

London guy

People also ask

3 Answers

Sean Owen

Has QUIT--Anony-Mousse

gadde saikumar

Recent Activity

Donate For Us