Could someone give me an explanation why I get two different numbers, resp. 14 and 15, as an output from the following code? <pre class="prettyprint"><code>#include <stdio.h> int main() { double Vmax = 2.9; double Vmin = 1.4; double step = 0.1; double a =(Vmax-Vmin)/step; int b = (Vmax-Vmin)/step; int c = a; printf("%d %d",b,c); // 14 15, why? return 0; } </code></pre> I expect to get 15 in both cases but it seems I'm missing some fundamentals of the language. I am not sure if it's relevant but I was doing the test in CodeBlocks. However, if I type the same lines of code in some on-line compiler ( this one for example) I get an answer of 15 for the two printed variables.

<blockquote> ... why I get two different numbers ... </blockquote> Aside from the usual float-point issues, the computation paths to <code>b</code> and <code>c</code> are arrived in different ways. <code>c</code> is calculated by first saving the value as <code>double a</code>. <pre class="prettyprint"><code>double a =(Vmax-Vmin)/step; int b = (Vmax-Vmin)/step; int c = a; </code></pre> <hr> C allows intermediate floating-point math to be computed using wider types. Check the value of <code>FLT_EVAL_METHOD</code> from <code><float.h></code>. <blockquote> Except for assignment and cast (which remove all extra range and precision), ... -1 indeterminable; 0 evaluate all operations and constants just to the range and precision of the type; 1 evaluate operations and constants of type <code>float</code> and <code>double</code> to the range and precision of the <code>double</code> type, evaluate <code>long double</code> operations and constants to the range and precision of the <code>long double</code> type; 2 evaluate all operations and constants to the range and precision of the <code>long double</code> type. C11dr §5.2.4.2.2 9 </blockquote> OP reported 2 By saving the quotient in <code>double a = (Vmax-Vmin)/step;</code>, precision is forced to <code>double</code> whereas <code>int b = (Vmax-Vmin)/step;</code> could compute as <code>long double</code>. This subtle difference results from <code>(Vmax-Vmin)/step</code> (computed perhaps as <code>long double</code>) being saved as a <code>double</code> versus remaining a <code>long double</code>. One as 15 (or just above), and the other just under 15. <code>int</code> truncation amplifies this difference to 15 and 14. On another compiler, the results may both have been the same due to <code>FLT_EVAL_METHOD < 2</code> or other floating-point characteristics. <hr> Conversion to <code>int</code> from a floating-point number is severe with numbers near a whole number. Often better to <code>round()</code> or <code>lround()</code>. The best solution is situation dependent.

Nonintuitive result of the assignment of a double precision number to an int variable in C

Tags:

c

type-conversion

floating-point

implicit-conversion

Could someone give me an explanation why I get two different numbers, resp. 14 and 15, as an output from the following code?

#include <stdio.h>    int main() {     double Vmax = 2.9;      double Vmin = 1.4;      double step = 0.1;       double a =(Vmax-Vmin)/step;     int b = (Vmax-Vmin)/step;     int c = a;      printf("%d  %d",b,c);  // 14 15, why?     return 0; }

I expect to get 15 in both cases but it seems I'm missing some fundamentals of the language.

I am not sure if it's relevant but I was doing the test in CodeBlocks. However, if I type the same lines of code in some on-line compiler ( this one for example) I get an answer of 15 for the two printed variables.

254

asked Feb 27 '18 15:02

GeorgiD

1 Answers

... why I get two different numbers ...

Aside from the usual float-point issues, the computation paths to b and c are arrived in different ways. c is calculated by first saving the value as double a.

double a =(Vmax-Vmin)/step; int b = (Vmax-Vmin)/step; int c = a;

C allows intermediate floating-point math to be computed using wider types. Check the value of FLT_EVAL_METHOD from <float.h>.

Except for assignment and cast (which remove all extra range and precision), ...

-1 indeterminable;

0 evaluate all operations and constants just to the range and precision of the type;

1 evaluate operations and constants of type float and double to the range and precision of the double type, evaluate long double operations and constants to the range and precision of the long double type;

2 evaluate all operations and constants to the range and precision of the long double type.

C11dr §5.2.4.2.2 9

OP reported 2

By saving the quotient in double a = (Vmax-Vmin)/step;, precision is forced to double whereas int b = (Vmax-Vmin)/step; could compute as long double.

This subtle difference results from (Vmax-Vmin)/step (computed perhaps as long double) being saved as a double versus remaining a long double. One as 15 (or just above), and the other just under 15. int truncation amplifies this difference to 15 and 14.

On another compiler, the results may both have been the same due to FLT_EVAL_METHOD < 2 or other floating-point characteristics.

Conversion to int from a floating-point number is severe with numbers near a whole number. Often better to round() or lround(). The best solution is situation dependent.

121

answered Oct 18 '22 17:10

chux - Reinstate Monica

Related questions
                            
                                What is -ffreestanding option in gcc?
                            
                                comparing int with size_t
                            
                                What is activation record in the context of C and C++?
                            
                                Purpose of LDA argument in BLAS dgemm?
                            
                                Elegantly call C++ from C
                            
                                Is NULL in C required/defined to be zero?
                            
                                Is there any overhead for using variable-length arrays?
                            
                                difference between <stdlib.h> and <malloc.h>
                            
                                Operation on ... may be undefined?
                            
                                Why is int rather than unsigned int used for C and C++ for loops?
                            
                                Why use array size 1 instead of pointer?
                            
                                Fortran vs C++, does Fortran still hold any advantage in numerical analysis these days? [closed]
                            
                                How to alter a float by its smallest increment (or close to it)?
                            
                                How to printf a memory address in C
                            
                                munmap_chunk(): invalid pointer
                            
                                Double precision - decimal places
                            
                                fatal error: mpi.h: No such file or directory #include <mpi.h>
                            
                                Getting started with Intel x86 SSE SIMD instructions
                            
                                warning: incompatible implicit declaration of built-in function ‘printf’ [enabled by default]
                            
                                Linking against older symbol version in a .so file

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With