Is round-trip through floating point always defined behavior if floating point range is bigger?

Tags:

Let's say I have two arithmetic types, an integer one, I, and a floating point one, F. I also assume that std::numeric_limits<I>::max() is smaller than std::numeric_limits<F>::max().

Now, let's say I have a positive integer value i. Because the representable range of F is larger than I, F(i) should always be defined behavior.

However, if I have a floating point value f such that f == F(i), is I(f) well defined? In other words, is I(F(i)) always defined behavior?

Relevant section from the C++14 standard:

4.9 Floating-integral conversions [conv.fpint]

A prvalue of a floating point type can be converted to a prvalue of an integer type. The conversion truncates; that is, the fractional part is discarded. The behavior is undefined if the truncated value cannot be represented in the destination type. [ Note: If the destination type is bool, see 4.12. — end note ]

A prvalue of an integer type or of an unscoped enumeration type can be converted to a prvalue of a floating point type. The result is exact if possible. If the value being converted is in the range of values that can be represented but the value cannot be represented exactly, it is an implementation-defined choice of either the next lower or higher representable value. [ Note: Loss of precision occurs if the integral value cannot be represented exactly as a value of the floating type. — end note ] If the value being converted is outside the range of values that can be represented, the behavior is undefined. If the source type is bool, the value false is converted to zero and the value true is converted to one.

796

asked Apr 29 '15 00:04

orlp

1 Answers

However, if I have a floating point value f such that f == F(i), is I(f) well defined? In other words, is I(F(i)) always defined behavior?

No.

Suppose that I is a signed two's complement 32 bit integer type, F is a 32 bit single precision floating point type, and i is the maximum positive integer. This is within the range of the floating point type, but it cannot be represented exactly as a floating point number. Some of those 32 bits are used for the exponent.

Instead, the conversion from integer to floating point is implementation dependent, but typically is done by rounding to the closest representable value. That rounded value is one beyond the range of the integer type. The conversion back to integer fails (better said, it's undefined behavior).

130

answered Sep 24 '22 00:09

David Hammen

Related questions
                            
                                Why do C++ data structures for graphs hide contiguous integer indices?
                            
                                How do you compile WebkitGTK on windows for MinGW
                            
                                Boost variant ambiguous construction [duplicate]
                            
                                When does a static constexpr class member need an out-of-class definition?
                            
                                CMake: How to create a file with make command
                            
                                Concurrent std::call_once calls
                            
                                Resolving conversion warnings with compound assignment operators
                            
                                Forcing inline with a single macro in GCC, Clang and Intel Compiler?
                            
                                Set UTF-8 pathname header in libarchive
                            
                                How to make Visual Studio 2013 show unhandled exception message?
                            
                                Overload between rvalue reference and const lvalue reference in template
                            
                                Copying words from one file to another in cpp
                            
                                Why is std::mutex twice as slow as CRITICAL_SECTION
                            
                                Reference to a partial segment of a vector?
                            
                                Deducing the selected overloaded function type for given argument types
                            
                                C++ A nonstatic member reference must be relative to a specific object
                            
                                C++ equivalent to Python's time.time() in Linux? [duplicate]
                            
                                Default Constructor, Java vs C++
                            
                                Standards compliant way to compare float to integral?
                            
                                Botan SSL/TLS example or starting point

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Is round-trip through floating point always defined behavior if floating point range is bigger?

Tags:

c++

language-lawyer

c++14

floating-point-conversion

orlp

People also ask

1 Answers

David Hammen

Recent Activity

Donate For Us