How can I get consistent program behavior when using floats?

Tags:

I am writing a simulation program that proceeds in discrete steps. The simulation consists of many nodes, each of which has a floating-point value associated with it that is re-calculated on every step. The result can be positive, negative or zero.

In the case where the result is zero or less something happens. So far this seems straightforward - I can just do something like this for each node:

if (value <= 0.0f) something_happens();

A problem has arisen, however, after some recent changes I made to the program in which I re-arranged the order in which certain calculations are done. In a perfect world the values would still come out the same after this re-arrangement, but because of the imprecision of floating point representation they come out very slightly different. Since the calculations for each step depend on the results of the previous step, these slight variations in the results can accumulate into larger variations as the simulation proceeds.

Here's a simple example program that demonstrates the phenomena I'm describing:

float f1 = 0.000001f, f2 = 0.000002f;
f1 += 0.000004f; // This part happens first here
f1 += (f2 * 0.000003f);
printf("%.16f\n", f1);

f1 = 0.000001f, f2 = 0.000002f;
f1 += (f2 * 0.000003f);
f1 += 0.000004f; // This time this happens second
printf("%.16f\n", f1);

The output of this program is

0.0000050000057854
0.0000050000062402

even though addition is commutative so both results should be the same. Note: I understand perfectly well why this is happening - that's not the issue. The problem is that these variations can mean that sometimes a value that used to come out negative on step N, triggering something_happens(), now may come out negative a step or two earlier or later, which can lead to very different overall simulation results because something_happens() has a large effect.

What I want to know is whether there is a good way to decide when something_happens() should be triggered that is not going to be affected by the tiny variations in calculation results that result from re-ordering operations so that the behavior of newer versions of my program will be consistent with the older versions.

The only solution I've so far been able to think of is to use some value epsilon like this:

if (value < epsilon) something_happens();

but because the tiny variations in the results accumulate over time I need to make epsilon quite large (relatively speaking) to ensure that the variations don't result in something_happens() being triggered on a different step. Is there a better way?

I've read this excellent article on floating point comparison, but I don't see how any of the comparison methods described could help me in this situation.

Note: Using integer values instead is not an option.

Edit the possibility of using doubles instead of floats has been raised. This wouldn't solve my problem since the variations would still be there, they'd just be of a smaller magnitude.

209

asked Mar 08 '12 05:03

sjs

2 Answers

I've worked with simulation models for 2 years and the epsilon approach is the sanest way to compare your floats.

answered Oct 20 '22 03:10

Pepe

Generally, using suitable epsilon values is the way to go if you need to use floating point numbers. Here are a few things which may help:

If your values are in a known range you and you don't need divisions you may be able to scale the problem and use exact operations on integers. In general, the conditions don't apply.
A variation is to use rational numbers to do exact computations. This still has restrictions on the operations available and it typically has severe performance implications: you trade performance for accuracy.
The rounding mode can be changed. This can be use to compute an interval rather than an individual value (possibly with 3 values resulting from round up, round down, and round closest). Again, it won't work for everything but you may get an error estimate out of this.
Keeping track of the value and a number of operations (possible multiple counters) may also be used to estimate the current size of the error.
To possibly experiment with different numeric representations (float, double, interval, etc.) you might want to implement your simulation as templates parameterized for the numeric type.
There are many books written on estimating and minimizing errors when using floating point arithmetic. This is the topic of numerical mathematics.

Most cases I'm aware of experiment briefly with some of the methods mentioned above and conclude that the model is imprecise anyway and don't bother with the effort. Also, doing something else than using float may yield better result but is just too slow, even using double due to the doubled memory footprint and the smaller opportunity of using SIMD operations.

answered Oct 20 '22 03:10

Dietmar Kühl

Related questions
                            
                                How do I forward declare HANDLE? (Win32)
                            
                                OOP vs macro problem
                            
                                boost serialization exception: unregistered class, serializing polymorphic base problem
                            
                                Generic Container in C++
                            
                                How to use delay loading with a DLL that exports C++ classes
                            
                                understanding c++, compilation etc [closed]
                            
                                GLSL: How should i store and keep track of uniform/attribute locations?
                            
                                Are user-defined-literals resolved at compile-time or runtime?
                            
                                What is the "right" way to avoid Aliasing (e.g. when adding an element of a container to itself) in C++?
                            
                                Custom interpreter for mathematical expressions
                            
                                boost asio asynchronously waiting on a condition variable
                            
                                Explicit destructor in templated context
                            
                                Can I use placement new(this) in operator=?
                            
                                Finding which Checksum is used
                            
                                C++ Forcing Method Override In Concrete Class
                            
                                No linkage at block scope?
                            
                                Why when one member cannot be moved, the whole enclosing class cannot be moved?
                            
                                C++ STL String Copy Constructor - Always a Deep Copy?
                            
                                Visual studio breakpoint conditional on the stack state
                            
                                Clang vs MSVC: Treatment of template function prototypes

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How can I get consistent program behavior when using floats?

Tags:

c++

c

floating-point

sjs

People also ask

2 Answers

Pepe

Dietmar Kühl

Recent Activity

Donate For Us