<pre class="prettyprint"><code>float fv = orginal_value; // original_value may be any float value ... double dv = (double)fv; ... fv = (float)dv; </code></pre> SHOULD fv be equal to original_value exactly? Any precision may be lost?

<blockquote> SHOULD fv be equal to original_value exactly? Any precision may be lost? </blockquote> Yes, if the value of <code>dv</code> did not change in between. From section Conversion 6.3.1.5 Real Floating types in C99 specs: <blockquote> <ol> <li>When a float is promoted to double or long double, or a double is promoted to long double, its value is unchanged.</li> <li>When a double is demoted to float, a long double is demoted to double or float, or a value being represented in greater precision and range than required by its semantic type (see 6.3.1.8) is explicitly converted to its semantic type, if the value being converted can be represented exactly in the new type, it is unchanged. If the value being converted is in the range of values that can be represented but cannot be represented exactly, the result is either the nearest higher or nearest lower representable value, chosen in an implementation-defined manner. If the value being converted is outside the range of values that can be represented, the behavior is undefined</li> </ol> </blockquote> For C++, from section 4.6 aka conv.fpprom (draft used: n337 and I believe similar lines are available in final specs) <blockquote> A prvalue of type float can be converted to a prvalue of type double. The value is unchanged. This conversion is called floating point promotion. </blockquote> And section 4.8 aka conv.double <blockquote> A prvalue of floating point type can be converted to a prvalue of another floating point type. If the source value can be exactly represented in the destination type, the result of the conversion is that exact representation. If the source value is between two adjacent destination values, the result of the conversion is an implementation-defined choice of either of those values. Otherwise, the behavior is undefined. The conversions allowed as floating point promotions are excluded from the set of floating point conversions </blockquote> So the values should be equal exactly.

Precision loss from float to double, and from double to float?

Tags:

c++

c

floating-accuracy

float fv = orginal_value;  // original_value may be any float value
...
double dv = (double)fv;
...
fv = (float)dv;

SHOULD fv be equal to original_value exactly? Any precision may be lost?

545

asked Apr 25 '16 12:04

ravin.wang

1 Answers

SHOULD fv be equal to original_value exactly? Any precision may be lost?

Yes, if the value of dv did not change in between.

From section Conversion 6.3.1.5 Real Floating types in C99 specs:

When a float is promoted to double or long double, or a double is promoted to long double, its value is unchanged.

When a double is demoted to float, a long double is demoted to double or float, or a value being represented in greater precision and range than required by its semantic type (see 6.3.1.8) is explicitly converted to its semantic type, if the value being converted can be represented exactly in the new type, it is unchanged. If the value being converted is in the range of values that can be represented but cannot be represented exactly, the result is either the nearest higher or nearest lower representable value, chosen in an implementation-defined manner. If the value being converted is outside the range of values that can be represented, the behavior is undefined

For C++, from section 4.6 aka conv.fpprom (draft used: n337 and I believe similar lines are available in final specs)

A prvalue of type float can be converted to a prvalue of type double. The value is unchanged. This conversion is called floating point promotion.

And section 4.8 aka conv.double

A prvalue of floating point type can be converted to a prvalue of another floating point type. If the source value can be exactly represented in the destination type, the result of the conversion is that exact representation. If the source value is between two adjacent destination values, the result of the conversion is an implementation-defined choice of either of those values. Otherwise, the behavior is undefined. The conversions allowed as floating point promotions are excluded from the set of floating point conversions

So the values should be equal exactly.

161

answered Sep 18 '22 14:09

Mohit Jain

Related questions
                            
                                What is the sizeof std::array<char, N>? [duplicate]
                            
                                Is it safe to use std::prev(vector.begin()) or std::next(vector.begin(), -1) like some_container.rend() as reversed sentry?
                            
                                C++ cout side-effect sequencing
                            
                                why std::sort() requires static Compare function? [duplicate]
                            
                                Compiler does not deduce template parameters (map std::vector -> std::vector)
                            
                                Is it compiler bug or my bug when using boost::tribool in a conditional?
                            
                                enable conversion operator using SFINAE
                            
                                Creating unordered_set of unordered_set
                            
                                typedef and template parameter with same name
                            
                                Cython/Python/C++ - Inheritance: Passing Derived Class as Argument to Function expecting base class
                            
                                What is (+0)+(-0) by IEEE floating point standard?
                            
                                What is the use for buckets interface in std::unordered_map?
                            
                                Non-static member initialization of char array with brace gives an error in gcc while not in clang
                            
                                Passing std::integer_sequence as template parameter to a meta function
                            
                                implicit conversions from and to class types
                            
                                Wrapping each type in a variadic template in a templated class
                            
                                In c++ 11, how to invoke an arbitrary callable object?
                            
                                Understanding the warning: binding r-value to l-value reference
                            
                                Are close() and closesocket() interchangable?
                            
                                Why does copying a const shared_ptr& not violate const-ness?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With