Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

difference between baseline and benchmark in performance of an application

what is a baseline and what is a benchmark? what is the best definition for these and how do you baseline a set of numbers and benchmark another set?

like image 554
gagneet Avatar asked Dec 06 '08 21:12

gagneet


People also ask

What is the difference between benchmark and baseline in performance testing?

Baseline Testing is specific to an individual software application. Benchmark Testing is often applicable to all the software applications belong to an organization. Baseline Testing is done from the application and user experience point of view. Benchmark testing is done from business and SLA point of view.

What is a baseline benchmark?

A benchmark is a standard point of reference within your industry against which things may be compared or assessed. A baseline is the starting point used to compare your historical performance. Both are connected in the world of business analysis, sometimes interchangeable. The definition can change based on context.

What is a performance benchmark?

What is performance benchmarking? Performance benchmarking is the process of measuring and analyzing an organization's performance of products, services, operations, and other business processes against other companies, competitors, or industry leaders. It helps businesses identify and understand areas for improvement.

What is the difference between benchmarking and baselining in an IT risk management context?

Difference between benchmark and baseline is that benchmark is comparing the performance of a company with the best practices in the industry; baseline is setting up a framework before any project starts, that can be used as a basis for implementation. Both these techniques are performance measuring tools.


2 Answers

In scientific research, a benchmark is a kind of test and a baseline is a kind of result.

Let's look at an example of a benchmark test: we might take a collection of 5,000 sentences in English and use the lab's four-core Dell machine to translate them into Spanish using various algorithms. Because we've kept the data and the machine constant, we can meaningfully compare the time taken by the different algorithms to complete the task, as well as their relative accuracy (measured against gold-standard human translations).

To find a baseline for this benchmark test, we might write a very naive translation algorithm that just finds the commonest translation for each individual word, with no regard for the context. Measuring the accuracy of this algorithm against our human translations gives us an idea of the minimum score - the baseline - that the others must beat, and gives us a feel for what level of accuracy counts as "good".

At the other end of the scale from a baseline, an upper bound is a useful yardstick too. In the translation example, we might find the upper bound by measuring the accuracy of one of our human translations with respect to the others. This gives us an idea of how high it's possible to get on our "accuracy" measure before you hit the ceiling of human disagreement. We expect our machine translation algorithms to perform at a level between the baseline and the upper bound.

like image 185
Tommy Herbert Avatar answered Oct 23 '22 20:10

Tommy Herbert


Interesting definitions from SPR (Software Productivity Research)

Baseline and benchmark are similar but distinct activities.

Figuratively, a baseline is a "line in the sand" for an organization whereby it measures important performance characteristics for future reference.

This is not necessarily a "good" state", just a reference.

A benchmark is best understood by way of the original derivation of the word itself:

Tradesmen engaged in repetitive tasks, such as sawing lumber to consistent lengths, often placed notches on their workbenches to indicate placement of boards prior to cutting. Literally, a benchmark became a standard for comparison and an indicator of past success.

Basically:

  • baseline is about identification of a significant state, meaning your set of numbers met an approval status, publicly recognized.
  • a benchmark is about assessing the relative performance of an application.
like image 30
VonC Avatar answered Oct 23 '22 18:10

VonC