What are the applications/benefits of an 80-bit extended precision data type?

Tags:

Yeah, I meant to say 80-bit. That's not a typo...

My experience with floating point variables has always involved 4-byte multiples, like singles (32 bit), doubles (64 bit), and long doubles (which I've seen referred to as either 96-bit or 128-bit). That's why I was a bit confused when I came across an 80-bit extended precision data type while I was working on some code to read and write to AIFF (Audio Interchange File Format) files: an extended precision variable was chosen to store the sampling rate of the audio track.

When I skimmed through Wikipedia, I found the link above along with a brief mention of 80-bit formats in the IEEE 754-1985 standard summary (but not in the IEEE 754-2008 standard summary). It appears that on certain architectures "extended" and "long double" are synonymous.

One thing I haven't come across are specific applications that make use of extended precision data types (except for, of course, AIFF file sampling rates). This led me to wonder:

Has anyone come across a situation where extended precision was necessary/beneficial for some programming application?
What are the benefits of an 80-bit floating point number, other than the obvious "it's a little more precision than a double but fewer bytes than most implementations of a long double"?
Is its applicability waning?

349

asked Mar 04 '09 21:03

gnovice

1 Answers

Intel's FPUs use the 80-bit format internally to get more precision for intermediate results.

That is, you may have 32-bit or 64-bit variables, but when they are loaded into the FPU registers, they are converted to 80 bit; the FPU then (by default) performs all calculations in 80 but; after the calculation, the result is stored back into a 32-bit or 64-bit variables.

BTW - A somewhat unfortunate consequence of this is that debug and release builds may produce slightly different results: in the release build, the optimizer may keep an intermediate variable in an 80-bit FPU register, while in the debug build, it will be stored in a 64-bit variable, causing loss of precision. You can avoid this by using 80-bit variables, or use an FPU switch (or compiler option) to perform all calculations in 64 bit.

answered Sep 19 '22 00:09

oefe

Related questions
                            
                                PHP float modulus not working
                            
                                What's the difference between the classes Floating and Fractional in Haskell?
                            
                                How to set precision of a float
                            
                                Convert float to plain string representation
                            
                                Is indexing of Data.Vector.Unboxed.Mutable.MVector really this slow?
                            
                                Why is there no ceil(float) in Java?
                            
                                How can I compare the performance of log() and fp division in C++?
                            
                                What does clang's `-Ofast` option do in practical terms especially for any differences from gcc?
                            
                                Should we compare floating point numbers for equality against a *relative* error?
                            
                                C: Casting minimum 32-bit integer (-2147483648) to float gives positive number (2147483648.0)
                            
                                Trapping quiet NaN
                            
                                How do you get the next value in the floating-point sequence? [duplicate]
                            
                                Are there any whole numbers which the double cannot represent within the MIN/MAX range of a double?
                            
                                Problem with Precision floating point operation in C
                            
                                Why is BigDecimal returning a weird value?
                            
                                std::is_floating_point returns false for float in some cases
                            
                                Integer division always zero [duplicate]
                            
                                C# float infinite loop
                            
                                F# converting a string to a float
                            
                                Convert floating point variable to integer?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

What are the applications/benefits of an 80-bit extended precision data type?

Tags:

floating-point

ieee-754

long-double

x87

extended-precision

gnovice

People also ask

1 Answers

oefe

Recent Activity

Donate For Us