Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Floating point support in 64-bit compiler

What should we expect from the floating point support in 64-bit Delphi compiler?

  • Will 64-bit compiler use SSE to implement floating point arithmetic?

  • Will 64-bit compiler support the current 80-bit floating type (Extended)?

These questions are closely related, so I ask them as a single question.

like image 740
kludg Avatar asked Oct 31 '10 17:10

kludg


People also ask

What is the range of float with 64 bits?

A double precision, floating-point number is a 64-bit approximation of a real number. The number can be zero or can range from -1.797693134862315E+308 to -2.225073858507201E-308, or from 2.225073858507201E-308 to 1.797693134862315E+308.

Is 64-bit double precision?

The XDR standard defines the encoding for the double-precision floating-point data type as a double. The length of a double is 64 bits or 8 bytes. Doubles are encoded using the IEEE standard for normalized double-precision floating-point numbers.

What is the bit size for floating-point?

According to this standard, floating point numbers are represented with 32 bits (single precision) or 64 bits (double precision).

What is the difference between a 32-bit and 64-bit floating-point value?

Floating Point Numbers Floats generally come in two flavours: “single” and “double” precision. Single precision floats are 32-bits in length while “doubles” are 64-bits. Due to the finite size of floats, they cannot represent all of the real numbers - there are limitations on both their precision and range.


3 Answers

I made two posts on the subject (here and there), to summarize, yes, the 64bit compiler uses SSE2 (double precision), but it doesn't use SSE (single precision). Everything is converted to double precision floats, and computed using SSE2 (edit: however there is an option to control that)

This means f.i. that if Maths on double precision floats is fast, maths on single precision is slow (lots of redundant conversions between single and double precisions are thrown in), "Extended" is aliased to "Double", and intermediate computations precision is limited to double precision.

Edit: There was an undocumented (at the time) directive that controls SSE code generation, {$EXCESSPRECISION OFF} activates SSE code generation, which brings back performance within expectations.

like image 55
Eric Grange Avatar answered Sep 21 '22 06:09

Eric Grange


According to Marco van de Voort in his answer to: How should I prepare my 32-bit Delphi programs for an eventual 64-bit compiler:

x87 FPU is deprecated on x64, and in general SSE2 will be used for florating point. so floating point and its exception handling might work slightly differently, and extended might not be 80-bit (but 64-bit or, less likely 128-bit). This also relates to the usual rounding (copro controlwork) changes when interfacing wiht C code that expects a different fpu word.

PHis commented on that answer with:

I wouldn't say that the x87 FPU is deprecated, but it is certainly the case that Microsoft have decided to do their best to make it that way (and they really don't seem to like 80-bit FP values), although it is clearly technically possible to use the FPU/80-bit floats on Win64.

like image 39
lkessler Avatar answered Sep 19 '22 06:09

lkessler


I just posted an answer to your other question, but I guess it actually should go here:

Obviously, nobody except for Embarcadero can answer this for sure before the product is released.

It is very likely that any decent x64 compiler will use the SSE2 instruction set as a baseline and therefore attempt to do as much floating point computation using SSE features as possible, minimising the use of the x87 FPU. However, it should also be said that there is no technical reason that would prevent the use of the x87 FPU in x64 application code (despite rumours to the contrary which have been around for some time; if you want more info on that point, please have a look at Agner Fog's Calling Convention Manual, specifically chapter 6.1 "Can floating point registers be used in 64-bit Windows?").

Edit 1: Delphi XE2 Win64 indeed does not support 80-bit floating-point calculations out of the box (see e.g. discussuion here (although it allows one to read/write such values). One can bring such capabilities back to Delphi Win64 using a record + class operators, as is done in this TExtendedX87 type (although caveats apply).

like image 29
PhiS Avatar answered Sep 20 '22 06:09

PhiS