Consider the following code: <pre class="prettyprint"><code>#include <math.h> #include <stdio.h> #include <string.h> int main() { __uint128_t n = (__uint128_t(0x00007ffff7dd6e65ULL) << 64) | 0x63696c400a2d2d21ULL; long double d = 0; memcpy(&d, &n, sizeof(long double)); printf("%d\n", isnan(d)); printf("%Le\n", d); } </code></pre> When compiled with clang version 12.0.0 (clang-1200.0.32.29) and run on macOS it produces the following output: <pre class="prettyprint"><code>1 3.345927e+3575 </code></pre> Why does <code>isnan</code> reports this <code>long double</code> as NaN while <code>printf</code> prints it as <code>3.345927e+3575</code>? Same happens with iostreams and clang++: <pre class="prettyprint"><code>std::cout << d; // prints 3.34593e+3575 </code></pre> Specifically, why is there is a difference in behavior between different C and C++ APIs when handling this number (which appears to be unnormal extended precision number)?

The object formed by the initialization from the 128-bit integer is invalid because its explicit significand bit does not match the other bits. Apple Clang is using Intel’s 80-bit floating-point format. According to Intel 64 and IA-32 Architectures Software Developer’s Manual (December 2017) 4.2.2, the “Integer” bit is explicitly set to 1 for infinities, normal numbers, and NaNs and to 0 for subnormals and zeros. The Integer1 bit is the leading bit of the significand, bit 63 in the encoding. The 64 bits of the significand are in the lower bits of the 128-bit integer, and the code in the question sets them to 63696c400A2D2D2116. In this, bit 63 is 0 (the high digit, 6, is 01102). Since the exponent field is 6E6516 (in bits 79 to 64), this should be a normal number, so bit 63 should be 1. I do not see a specification of the behavior when the Integer bit is improperly set, so we may expect it is not defined. (One might wonder whether this is intentional behavior of <code>isnan</code>, as it is “correctly” reporting that an invalid encoding is not a number.) When <code>0x6369…</code> is corrected to <code>0xE369…</code>, the program correctly reports the value is not a NaN. <h3>Footnote</h3> 1 So-called because the significand is commonly represented as b.bbb…bbb, where the leading bit is the only one left of the radix point and hence is the only bit representing an integer value. The remaining bits of the significand are fraction bits.

Why do printf and isnan disagree whether a long double value is a NaN?

Tags:

c++

c

floating-point

clang

Consider the following code:

#include <math.h>
#include <stdio.h>
#include <string.h>

int main() {
  __uint128_t n = (__uint128_t(0x00007ffff7dd6e65ULL) << 64) |
                               0x63696c400a2d2d21ULL;
  long double d = 0;
  memcpy(&d, &n, sizeof(long double));
  printf("%d\n", isnan(d));
  printf("%Le\n", d);
}

When compiled with clang version 12.0.0 (clang-1200.0.32.29) and run on macOS it produces the following output:

1
3.345927e+3575

Why does isnan reports this long double as NaN while printf prints it as 3.345927e+3575?

Same happens with iostreams and clang++:

std::cout << d; // prints 3.34593e+3575

Specifically, why is there is a difference in behavior between different C and C++ APIs when handling this number (which appears to be unnormal extended precision number)?

360

asked Feb 20 '21 19:02

vitaut

1 Answers

The object formed by the initialization from the 128-bit integer is invalid because its explicit significand bit does not match the other bits.

Apple Clang is using Intel’s 80-bit floating-point format. According to Intel 64 and IA-32 Architectures Software Developer’s Manual (December 2017) 4.2.2, the “Integer” bit is explicitly set to 1 for infinities, normal numbers, and NaNs and to 0 for subnormals and zeros. The Integer¹ bit is the leading bit of the significand, bit 63 in the encoding.

The 64 bits of the significand are in the lower bits of the 128-bit integer, and the code in the question sets them to 63696c400A2D2D21₁₆. In this, bit 63 is 0 (the high digit, 6, is 0110₂). Since the exponent field is 6E65₁₆ (in bits 79 to 64), this should be a normal number, so bit 63 should be 1.

I do not see a specification of the behavior when the Integer bit is improperly set, so we may expect it is not defined. (One might wonder whether this is intentional behavior of isnan, as it is “correctly” reporting that an invalid encoding is not a number.)

When 0x6369… is corrected to 0xE369…, the program correctly reports the value is not a NaN.

Footnote

¹ So-called because the significand is commonly represented as b.bbb…bbb, where the leading bit is the only one left of the radix point and hence is the only bit representing an integer value. The remaining bits of the significand are fraction bits.

158

answered Sep 27 '22 23:09

Eric Postpischil

Related questions
                            
                                Deducing the type of an integer by its compiletime value
                            
                                What causes the following difference in implicit lambda capture behavior?
                            
                                How to get the last directory in a std::filesystem::path?
                            
                                C++: initialize vs assignment?
                            
                                Can't use std::cin with char* or char[] in C++20
                            
                                Why does the std::size on an array passed by value do not work?
                            
                                Is offsetof of a union member always zero?
                            
                                In C++ Core Guidelines Per.4, why is the bad example intended to be faster?
                            
                                Why can the template not be instantiated in this piece of C++ code?
                            
                                Can this member function selection code be written without std::invoke?
                            
                                Does the force kill command (kill -9) in linux cleanup the dynamically allocated memory with new operator in C++ application?
                            
                                Can I specialize forward declared template?
                            
                                More than one operator overload in Rust
                            
                                What kind of value does a pointer hold after using it to explicitly call the pointed object's destructor?
                            
                                Object rvalue propagation for member function calls
                            
                                Why is the symbol of member function weak?
                            
                                C++ syntax question - Why can't I use comma to separate variable definition of different types
                            
                                Multiply-add vectorization slower with AVX than with SSE
                            
                                Forcing inlining of lambda in MSVC C++
                            
                                Possibly learning old C++ standard

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With