How to compute the average of doubles, so that the total error is minimal?

Tags:

floating-point

Assume we have a long array of doubles, say, N == 1000000.

array<double, N> arr;

There are two naive approaches to compute the average. First

double result = 0;
for (double x : arr) {
    result += x;
}
result /= arr.size();

This may be inaccurate when the sum of values is very big. Floating point numbers lose precision then.

Another approach is:

double result = 0;
for (double x : arr) {
    result += x / arr.size();
}

This may lose precision when the numbers are small.

Is there any fail-safe way to calculate a simple average of floating point numbers? Solutions, which use only the standard library are appreciated.

226

asked May 20 '17 09:05

3 Answers

If you want to squeeze more accuracy out of doubles, you can use Kahan summation and finally division by number of elements. There is however no standard library implementation of Kahan summation I know of.

An easy, standard way (almost like cheating) would of course be calculation using long doubles, basically using your first implementation and only converting the result back to double precision.

answered Oct 22 '22 21:10

Peter G.

The so-called naive ways are not naive. What do the data mean, and how accurately can you measure those values? Unless the answer is something very unusual, the simple method with doubles is fine. However floats are a bit under-powered for general use.

If you add the small absolute values first you might get an extra bit or so of precision. That requires a sort. If the data are all above a certain threshold, subtracting the minimum may also give you another bit.

You can also store a partial total, and a partial mean, and check at each stage that partial mean * number processed is within a certain tolerance of the partial total. That won't give you any extra accuracy, but it will tell you if the fpu is too inaccurate for your purposes.

You can also use long double, or even code your own extended-precision floating point library (or use someone else's). However the solutions get increasingly heroic.

answered Oct 22 '22 21:10

Malcolm McLean

One way to reduce loss of precision would be to sort the doubles and then add them together in sorted order, starting with the smallest values and then at the end divide the final sum by the number of doubles.

So the tools you need would be std::sort and std::accumulate and plain old division /.

answered Oct 22 '22 21:10

Jesper Juhl

Related questions
                            
                                How to receive proper UDP packet in QT?
                            
                                Default values when creating a vector, C++
                            
                                How to create new instance of a Q_GADGET struct in QML?
                            
                                Design pattern for method returning different types/classes
                            
                                Use nested class of templated class as template template parameter in C++
                            
                                How to install <QtCharts> on windows
                            
                                Why does THREAD_MODE_BACKGROUND_BEGIN cause my code to run 20x slower than THREAD_PRIORITY_LOWEST?
                            
                                socket connect() being mistaken for QT connect()
                            
                                Brace-init-list and assignments
                            
                                Why is c++ string == (equality) operator is much faster than manually checking characters one by one?
                            
                                std::map emplace fails with explicit constructor
                            
                                if constexpr(condition) as compile-time conditional
                            
                                replacing chars in std::stringstream inline
                            
                                static_assert and class templates
                            
                                How to Alias a namespace and extend the original namespace based on the alias
                            
                                gdb break when entering child process
                            
                                What to use for `Iterator::pointer` when nothing makes sense?
                            
                                Variadic class template and inheritance - default compiler generated constructor
                            
                                Does vector::erase not work with reverse iterators?
                            
                                Find an exact substr in a string

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to compute the average of doubles, so that the total error is minimal?

Tags:

c++

floating-point

marmistrz

People also ask

3 Answers

Peter G.

Malcolm McLean

Jesper Juhl

Recent Activity

Donate For Us