How to avoid floating point exceptions in unused SIMD lanes

Tags:

I like to run my code with floating point exceptions enabled. I do this under Linux using:

feenableexcept( FE_DIVBYZERO | FE_INVALID | FE_OVERFLOW );

So far so good.

The issue I am having, is that sometimes the compiler (I use clang8) decides to use SIMD instructions to do a scalar division. Fine, if that is faster, even for a single scalar, why not.

But the result is that an unused lane in the SIMD register can contain a zero.

And when the SIMD division is executed, a floating point exception is thrown.

Does that mean that floating point exceptions cannot be used at all if you allow the compiler to use sse/avx extensions?

In my case, this line of C code:

float a0, min, a, d;
...
a0 = (min - a) / (d);

...is exectuted as:

divps  %xmm2,%xmm3

Which then throws a:

Thread 1 "noisetuner" received signal SIGFPE, Arithmetic exception.

826

asked Jul 28 '20 01:07

Bram

1 Answers

I think you have found a bug in clang or maybe in llvm.

Here’s how I have reproduced, clang 10.0 emits the same code i.e. has that bug as well. Clearly, that vdivps instruction only has valid data in the initial 2 lanes of the vectors, and in the higher 2 lanes it will run 0.0 / 0.0, thus you’ll get a runtime exception if you enable these interrupts in mxcsr register like you’re doing.

Microsoft, Intel and gcc don’t emit divps for that code. If you can, switch to gcc and it should be good.

Update: Clang 10+ has an option controlling such optimizations, -ffp-exception-behavior=maytrap, take a look: https://godbolt.org/z/WG7bEE

114

answered Oct 17 '22 21:10

Soonts

Related questions
                            
                                Range of Float in Java
                            
                                decimal.InvalidOperation error when rounding values in Series
                            
                                Why does python round(np.float16(np.pi),5) return infinity? Bug, limitation, or expected?
                            
                                How does Python compare 'int' to 'float' objects?
                            
                                OpenGL: How do I avoid rounding errors when specifying UV co-ordinates
                            
                                Floating point limits
                            
                                Cosine in floating point
                            
                                Are there float and double types with fixed sizes in C99?
                            
                                How to compute a double precision float score from the first 8 bytes of a string in Python?
                            
                                Using depth buffer for layering 2D sprites
                            
                                Do floats, doubles, and long doubles have a guaranteed minimum precision?
                            
                                How can I decode f16 to f32 using only the stable standard library?
                            
                                Why does Excel not round according to 8-byte IEEE 754
                            
                                Compare a 32 bit float and a 32 bit integer without casting to double, when either value could be too large to fit the other type exactly
                            
                                Comparing sqrt(n) with the rational p/q
                            
                                C float literal translation
                            
                                What is going on with floating point precision here?
                            
                                Is floating point expression contraction allowed in C++?
                            
                                PHP floating point precision: Is var_dump secretly rounding and how can I debug precisley then?
                            
                                Check if a number is exactly representable as `f32`

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to avoid floating point exceptions in unused SIMD lanes

Tags:

floating-point

clang

simd

sigfpe

floating-point-exceptions

Bram

People also ask

1 Answers

Soonts

Recent Activity

Donate For Us