According to the documentation, there is a <code>fma()</code> function in <code>math.h</code>. That is very nice, and I know how FMA works and what to use it for. However, I am not so certain how this is implemented in practice? I'm mostly interested in the <code>x86</code> and <code>x86_64</code> architectures. Is there a floating-point (non-vector) instruction for FMA, perhaps as defined by IEEE-754 2008? Is FMA3 or FMA4 instruction used? Is there an intrinsic to make sure that a real FMA is used, when the precision is relied upon?

The actual implementation varies from platform to platform, but speaking very broadly: <ul> <li>If you tell your compiler to target a machine with hardware FMA instructions (PowerPC, ARM with VFPv4 or AArch64, Intel Haswell or AMD Bulldozer and onwards), the compiler may replace calls to <code>fma( )</code> by just dropping the appropriate instruction into your code. This is not guaranteed, but is generally good practice. Otherwise you will get a call to the math library, and:</li> <li>When running on a processor that has hardware FMA, those instructions should be used to implement the function. However, if you have an older version of your operating system, or an older version of the math library, it may not take advantage of those instructions.</li> <li>If you are running on a processor that does not have hardware FMA, or you are using an older (or just not very good) math library, then a software implementation of FMA will be used instead. This might be implemented using clever extended-precision floating-point tricks, or with integer arithmetic.</li> <li>The result of the <code>fma( )</code> function should always be correctly rounded (i.e. a "real fma"). If it is not, that's a bug in your system's math library. Unfortunately, <code>fma( )</code> is one of the more difficult math library functions to implement correctly, so many implementations have bugs. Please report them to your library vendor so they get fixed!</li> </ul> <blockquote> Is there an intrinsic to make sure that a real FMA is used, when the precision is relied upon? </blockquote> Given a good compiler, this shouldn't be necessary; it should suffice to use the <code>fma( )</code> function and tell the compiler what architecture you are targeting. However, compilers are not perfect, so you may need to use the <code>_mm_fmadd_sd( )</code> and related intrinsics on x86 (but report the bug to your compiler vendor!)

How is fma() implemented

Tags:

floating-point

ieee-754

instruction-set

fma

According to the documentation, there is a fma() function in math.h. That is very nice, and I know how FMA works and what to use it for. However, I am not so certain how this is implemented in practice? I'm mostly interested in the x86 and x86_64 architectures.

Is there a floating-point (non-vector) instruction for FMA, perhaps as defined by IEEE-754 2008?

Is FMA3 or FMA4 instruction used?

Is there an intrinsic to make sure that a real FMA is used, when the precision is relied upon?

463

asked Feb 20 '15 14:02

the swine

2 Answers

The actual implementation varies from platform to platform, but speaking very broadly:

If you tell your compiler to target a machine with hardware FMA instructions (PowerPC, ARM with VFPv4 or AArch64, Intel Haswell or AMD Bulldozer and onwards), the compiler may replace calls to fma( ) by just dropping the appropriate instruction into your code. This is not guaranteed, but is generally good practice. Otherwise you will get a call to the math library, and:
When running on a processor that has hardware FMA, those instructions should be used to implement the function. However, if you have an older version of your operating system, or an older version of the math library, it may not take advantage of those instructions.
If you are running on a processor that does not have hardware FMA, or you are using an older (or just not very good) math library, then a software implementation of FMA will be used instead. This might be implemented using clever extended-precision floating-point tricks, or with integer arithmetic.
The result of the fma( ) function should always be correctly rounded (i.e. a "real fma"). If it is not, that's a bug in your system's math library. Unfortunately, fma( ) is one of the more difficult math library functions to implement correctly, so many implementations have bugs. Please report them to your library vendor so they get fixed!

Is there an intrinsic to make sure that a real FMA is used, when the precision is relied upon?

Given a good compiler, this shouldn't be necessary; it should suffice to use the fma( ) function and tell the compiler what architecture you are targeting. However, compilers are not perfect, so you may need to use the _mm_fmadd_sd( ) and related intrinsics on x86 (but report the bug to your compiler vendor!)

answered Sep 21 '22 17:09

Stephen Canon

One way to implement FMA in software is by splitting the significant into high and low bits. I use Dekker's algorithm

Click to copy

typedef struct { float hi; float lo; } doublefloat;  
doublefloat split(float a) {
    float t = ((1<<12)+1)*a;
    float hi = t - (t - a);
    float lo = a - hi;
    return (doublefloat){hi, lo};
}

Once you split the the float you can calculate a*b-c with a single rounding like this

Click to copy

float fmsub(float a, float b, float c) {
    doublefloat as = split(a), bs = split(b);
    return ((as.hi*bs.hi - c) + as.hi*bs.lo + as.lo*bs.hi) + as.lo*bs.lo;
}

This basically subtracts c from (ahi,alo)*(bhi,blo) = (ahi*bhi + ahi*blo + alo*bhi + alo*blo).

I got this idea from the twoProd function in the paper Extended-Precision Floating-Point Numbers for GPU Computation and from the mul_sub_x function in Agner Fog's vector class library. He uses a different function for splitting vectors of floats which splits differently. I tried to reproduce a scalar version here

Click to copy

typedef union {float f; int i;} u;
doublefloat split2(float a) {
    u lo, hi = {a};
    hi.i &= -(1<<12);
    lo.f = a - hi.f;
    return (doublefloat){hi.f,lo.f};
}

In any case using split or split2 in fmsub agrees well with fma(a,b,-c) from the math library in glibc. For whatever reason my version is significantly faster than fma except on a machine that has hardware fma (in which case I use _mm_fmsub_ss anyway).

answered Sep 24 '22 17:09

Z boson

Related questions
                            
                                Numpy's float32 and float comparisons
                            
                                Does a value x of type float exist for which x + 1 == x?
                            
                                define double constant as hexadecimal?
                            
                                Why float shows exact representation when declared
                            
                                C++: How to Convert From Float to String Without Rounding, Truncation or Padding? [duplicate]
                            
                                ArithmeticException thrown in Java
                            
                                x86-64 long double precision
                            
                                Are there any floating-point comparison "anomalies"?
                            
                                want to display exactly 2 digits after floating point
                            
                                Count number of digits after `.` in floating point numbers?
                            
                                Will "min to max" uniform real distribution produce Inf,-Inf, or NaN?
                            
                                Printing floats with a specific number of zeros
                            
                                What is the next normalised floating point number after(before) a normalised floating point number f?
                            
                                How to get rid of minus sign from signed zero
                            
                                Index python dict by object or two floats
                            
                                How to get the sum an array of strings in ruby
                            
                                C/C++ - Convert 24-bit signed integer to float
                            
                                C# parsing float from string
                            
                                BigDecimal Error
                            
                                How to convert IEEE-11073 16-bit SFLOAT to simple float in Java?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How is fma() implemented

Tags:

floating-point

ieee-754

instruction-set

fma

the swine

People also ask

2 Answers

Stephen Canon

Z boson

Recent Activity

Donate For Us