I want to round big double number (>1e6) to the closest but bigger float using c/c++. I tried this but I'm not sure it is always correct and there is maybe a fastest way to do that : <pre class="prettyprint"><code>int main() { // x is the double we want to round double x = 100000000005.0; double y = log10(x) - 7.0; float a = pow(10.0, y); float b = (float)x; //c the closest round up float float c = a + b; printf("%.12f %.12f %.12f\n", c, b, x); return 0; } </code></pre> Thank you.

If you use c99, you can use the nextafterf function. <pre class="prettyprint"><code>#include <stdio.h> #include <math.h> #include <float.h> int main(){ // x is the double we want to round double x=100000000005.0; float c = x; if ((double)c <= x) c = nextafterf(c, FLT_MAX); //c the closest round up float printf("%.12f %.12f\n",c,x); return 0; } </code></pre>

C has a nice <code>nextafter</code> function which will help here; <pre class="prettyprint"><code>float toBiggerFloat( const double a ) { const float test = (float) a; return ((double) test < a) ? nextafterf( test, INFINITY ) : test; } </code></pre> Here's a test script which shows it on all classes of number (positive/negative, normal/subnormal, infinite, nan, -0): http://codepad.org/BQ3aqbae (it works fine on anything is the result)

Round a double to the closest and greater float

Tags:

c++

c

floating-point

rounding

I want to round big double number (>1e6) to the closest but bigger float using c/c++. I tried this but I'm not sure it is always correct and there is maybe a fastest way to do that :

int main() {
    // x is the double we want to round
    double x = 100000000005.0;
    double y = log10(x) - 7.0;
    float a = pow(10.0, y);
    float b = (float)x;

    //c the closest round up float
    float c = a + b;
    printf("%.12f %.12f %.12f\n", c, b, x);
    return 0;
}

Thank you.

455

asked Mar 08 '13 12:03

user1482030

3 Answers

Simply assigning a double to float and back should tell, if the float is larger. If it's not, one should simply increment the float by one unit. (for positive floats). If this doesn't still produce expected result, then the double is larger than supported by a float, in which case float should be assigned to Inf.

float next(double a) {
    float b=a;
    if ((double)b > a) return b;
    return std::nextafter(b, std::numeric_limits<float>::infinity());
}

[Hack] C-version of next_after (on selected architectures would be)

float next_after(float a) {
    *(int*)&a += a < 0 ? -1 : 1;
    return a;
}

Better way to do it is:

float next_after(float a) {
   union { float a; int b; } c = { .a = a };
   c.b += a < 0 ? -1 : 1;
   return c.a;
}

Both of these self-made hacks ignore Infs and NaNs (and work on non-negative floats only). The math is based on the fact, that the binary representations of floats are ordered. To get to next representable float, one simply increments the binary representation by one.

159

answered Oct 08 '22 16:10

Aki Suihkonen

If you use c99, you can use the nextafterf function.

#include <stdio.h>
#include <math.h>
#include <float.h>

int main(){
  // x is the double we want to round
  double x=100000000005.0;

  float c = x;

  if ((double)c <= x)
    c = nextafterf(c, FLT_MAX);

  //c the closest round up float
  printf("%.12f %.12f\n",c,x);
  return 0;
}

answered Oct 08 '22 16:10

Mankka

C has a nice nextafter function which will help here;

float toBiggerFloat( const double a ) {
    const float test = (float) a;
    return ((double) test < a) ? nextafterf( test, INFINITY ) : test;
}

Here's a test script which shows it on all classes of number (positive/negative, normal/subnormal, infinite, nan, -0): http://codepad.org/BQ3aqbae (it works fine on anything is the result)

answered Oct 08 '22 16:10

Dave

Related questions
                            
                                Comparing two TCHAR's with same value results false
                            
                                Can list initialization not be used for private members?
                            
                                Zero a 2d array in C++. Do I need 2 for loops?
                            
                                Using a make file string variable in CPP file
                            
                                CUDA performance improves when running more threads than there are cores
                            
                                boost::spirit::qi duplicate parsing on the output
                            
                                Access violation. when using GLEW and GLFW
                            
                                error: ambiguates old declaration ‘double round(double)’
                            
                                DllImport decorated name issue - Unable to find entry point
                            
                                integer indexed with a string in c++ [duplicate]
                            
                                Code runs perfect in g++ but not in Xcode - Cannot find File
                            
                                Error building simple Qt5 application
                            
                                What is the Delphi equivalent to C++ reference parameters?
                            
                                Whole screen capture and render in DirectX [PERFORMANCE]
                            
                                Increase compile-time variable with every instantiation of a generic class
                            
                                deleted default constructor headache
                            
                                Why std::sort doesn't accept Compare classes declared within a function
                            
                                Does returning a dynamically-allocated array from a function cause a memory leak?
                            
                                How I can pass callable object to function as parameter
                            
                                What is the difference between these two ways to compare STL vectors?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Round a double to the closest and greater float

Tags:

c++

c

floating-point

rounding

user1482030

People also ask

3 Answers

Aki Suihkonen

Mankka

Dave

Recent Activity

Donate For Us