Given a normalized floating point number f what is the next normalized floating point number after/before f. With bit twiddling, extracting mantissa and exponent I have: <pre class="prettyprint"><code>next_normalized(double&){ if mantissa is not all ones maximally denormalize while maintaining equality add 1 to mantissa normalize else check overflow set mantissa to 1 add (mantissa size in bits) to exponent. endif } </code></pre> But rather than do that can it be done with floating point operations? As <pre class="prettyprint"><code>std::numeric_limits<double>::epsilon() </code></pre> is only an error difference in a "neighborhood" of 1. - e.g.: <pre class="prettyprint"><code>normalized(d+=std::numeric_limits<double>::epsilon()) = d for d large </code></pre> it seems more an error ratio than an error difference, thus my naive intuition is <pre class="prettyprint"><code>(1.+std::numeric_limits<double>::epsilon())*f //should be the next. </code></pre> And <pre class="prettyprint"><code>(1.-std::numeric_limits<double>::epsilon())*f //should be the previous. </code></pre> In particular I have 3 questions has anyone done any of the following (for IEEE754): 1)done the error analysis on this issue? 2)proved (or can prove) that for any normalized double d <pre class="prettyprint"><code> (1.+std::numeric_limits<double>::epsilon())*d != d ? </code></pre> 3)proved that for any normalized double number d no double f exists such that <pre class="prettyprint"><code> d < f < (1.+std::numeric_limits<double>::epsilon())*d ? </code></pre>

I’m not sure what you mean by “normalized double number”, but getting the next representable double number is done with the <code>nextafter()</code> function in most C standard math libraries.

What is the next normalised floating point number after(before) a normalised floating point number f?

Q: How are floating-point numbers calculated?

To do so, floating-point uses a biased exponent, which is the original exponent plus a constant bias. 32-bit floating-point uses a bias of 127. For example, for the exponent 7, the biased exponent is 7 + 127 = 134 = 100001102. For the exponent −4, the biased exponent is: −4 + 127 = 123 = 011110112.

Q: How many float numbers are there?

For any given value of the exponent, there are [latex] 2^{24} = 16777216[/latex] possible numbers that can be represented. However, the exponent decides how big that number will be. With a single bit reserved for sign of the exponent, 7 bits are available.

Tags:

floating-point

Given a normalized floating point number f what is the next normalized floating point number after/before f.

With bit twiddling, extracting mantissa and exponent I have:

next_normalized(double&){
      if mantissa is not all ones
          maximally denormalize while maintaining equality 
          add 1 to mantissa
          normalize
      else 
          check overflow
          set mantissa to 1  
          add (mantissa size in bits) to exponent.
      endif
 }

But rather than do that can it be done with floating point operations?

std::numeric_limits<double>::epsilon()

is only an error difference in a "neighborhood" of 1. - e.g.:

normalized(d+=std::numeric_limits<double>::epsilon()) = d for d large

it seems more an error ratio than an error difference, thus my naive intuition is

(1.+std::numeric_limits<double>::epsilon())*f //should be the next.

And

(1.-std::numeric_limits<double>::epsilon())*f //should be the previous.

In particular I have 3 questions has anyone done any of the following (for IEEE754):

1)done the error analysis on this issue?

2)proved (or can prove) that for any normalized double d

    (1.+std::numeric_limits<double>::epsilon())*d != d ?

3)proved that for any normalized double number d no double f exists such that

    d < f < (1.+std::numeric_limits<double>::epsilon())*d ?

956

asked Aug 26 '09 18:08

pgast

1 Answers

I’m not sure what you mean by “normalized double number”, but getting the next representable double number is done with the nextafter() function in most C standard math libraries.

answered Nov 03 '22 00:11

Robert Kern

Related questions
                            
                                Checking for NaN in clojure
                            
                                c++ incorrect floating point arithmetic
                            
                                Cannot convert array to floats python
                            
                                How to convert float to timedelta seconds in python?
                            
                                why 0.0f/0.0f doesn't generate any runtime error?
                            
                                How does javascript print 0.1 with such accuracy?
                            
                                Python float precision float
                            
                                Interchangeability of IEEE 754 floating-point addition and multiplication
                            
                                Numpy's float32 and float comparisons
                            
                                Does a value x of type float exist for which x + 1 == x?
                            
                                define double constant as hexadecimal?
                            
                                Why float shows exact representation when declared
                            
                                C++: How to Convert From Float to String Without Rounding, Truncation or Padding? [duplicate]
                            
                                ArithmeticException thrown in Java
                            
                                x86-64 long double precision
                            
                                Are there any floating-point comparison "anomalies"?
                            
                                want to display exactly 2 digits after floating point
                            
                                Count number of digits after `.` in floating point numbers?
                            
                                Will "min to max" uniform real distribution produce Inf,-Inf, or NaN?
                            
                                Printing floats with a specific number of zeros

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With