What is the type of a floating-point literal having an exponent part, such as the <code>123456e-3</code> in C(99+)? Is it of type <code>float</code> or <code>double</code>? When used as a <code>float</code> initializer in <code>float f = 123456e-3;</code> does it need to have a <code>f</code> suffix?

By default, all floating point literals, with or without an exponent part, have type <code>double</code>. You can add the <code>f</code> suffix to make the type <code>float</code> or <code>L</code> to make the type <code>long double</code>. In the case of <code>float f = 123456e-3;</code>, you're initializing a <code>float</code> with a <code>double</code> constant, so there is the possibility of loss of precision, however this particular constant only has 6 decimal digits of precision so it should be OK.

<blockquote> What is the type of a floating-point literal? </blockquote> Floating constants C defines these as floating constants, not literals. Default type is <code>double</code>. An <code>f</code> or <code>F</code> suffix makes it a <code>float</code>. An <code>l</code> or <code>L</code> suffix makes it a <code>long double</code>. [edit] <code>FLT_EVAL_METHOD</code> C has <code>FLT_EVAL_METHOD</code> which allows constants to be interpreted as a wider type. Example <code>FLT_EVAL_METHOD == 2</code> <blockquote> evaluate all operations and constants to the range and precision of the <code>long double</code> type. </blockquote> In this case, I'd expect <code>v1</code> and <code>v2</code> to have the same value when <code>FLT_EVAL_METHOD == 2</code>, but different values when <code>FLT_EVAL_METHOD == 0</code>. <pre class="prettyprint"><code>long double v1 = 0.1; long double v2 = 0.1L; </code></pre> <hr> <blockquote> When used as a float initializer in float f = 123456e-3; does it need to have a f suffix? </blockquote> For best conversion of the text to <code>float</code>, yes use an <code>f</code>. <code>float f = 123456e-3</code> incurs double rounding. 2 rounding occurs: text-><code>double</code> and <code>double</code> to <code>float</code>. With select values, <code>g</code> may get a different value with <code>float g = x.xxx</code> vs <code>g = x.xxxf;</code>. See following. double rounding example Notice <code>f2</code> and <code>f4</code> have the same constant except the the <code>f</code> suffix. Compiler warns with <code>f4</code>: <blockquote> warning: conversion from 'double' to 'float' changes value from '9.9999997019767761e-1' to '1.0e+0f' [-Wfloat-conversion] </blockquote> <pre class="prettyprint"><code>#include <stdlib.h> int main(void) { // float has 24 bit significand, double has 53 float f1 = 0x0.FFFFFFp0f; // code with 24 bit significand, exact as a float printf("%-20a %.17e\n", f1, f1); float f2 = 0x0.FFFFFF7FFFFFFCp0f; // code with 54 bit significand, rounds down to nearest float printf("%-20a %.17e\n", f2, f2); float f3 = 0x0.FFFFFF80000000p0f; // code with 25 bit significand, rounds up to nearest float printf("%-20a %.17e\n", f3, f3); puts(""); double d1 = 0x0.FFFFFF7FFFFFF8p0; // code constant with 53 bit significand, exact as a double printf("%-20a %.17e\n", d1, d1); double d2 = 0x0.FFFFFF7FFFFFFCp0; // code constant with 54 bit significand, rounds up to nearest double printf("%-20a %.17e\n", d2, d2); float f4 = 0x0.FFFFFF7FFFFFFCp0; // code constant with 54 bit significand, rounds up to nearest double // then rounds up again when double converted to float printf("%-20a %.17e\n", f4, f4); return 0; } </code></pre> Output <pre class="prettyprint"><code>0x1.fffffep-1 9.99999940395355225e-01 0x1.fffffep-1 9.99999940395355225e-01 f2 0x1p+0 1.00000000000000000e+00 0x1.fffffefffffffp-1 9.99999970197677501e-01 0x1.ffffffp-1 9.99999970197677612e-01 0x1p+0 1.00000000000000000e+00 f4 Double Rounding! </code></pre> <hr> For best conversion of the text to <code>long double</code>, definitely use an <code>L</code> else the constant is only a <code>double</code> with less precision. <pre class="prettyprint"><code>long double ld1 = 0x1.00000000000001p1; printf("%.20Le\n", ld1, ld1); long double ld2 = 0x1.00000000000001p1L; // "Same" constant as above with an 'L' printf("%.20Le\n", ld2, ld2); </code></pre> Output <pre class="prettyprint"><code>2.00000000000000000000e+00 2.00000000000000002776e+00 </code></pre>

The type of a floating point literal with exponent

Tags:

c

floating-point

What is the type of a floating-point literal having an exponent part, such as the 123456e-3 in C(99+)? Is it of type float or double? When used as a float initializer in float f = 123456e-3; does it need to have a f suffix?

315

asked Jun 26 '20 14:06

Ron

2 Answers

By default, all floating point literals, with or without an exponent part, have type double. You can add the f suffix to make the type float or L to make the type long double.

In the case of float f = 123456e-3;, you're initializing a float with a double constant, so there is the possibility of loss of precision, however this particular constant only has 6 decimal digits of precision so it should be OK.

101

answered Oct 29 '22 03:10

dbush

What is the type of a floating-point literal?

Floating constants

C defines these as floating constants, not literals. Default type is double.
An f or F suffix makes it a float.
An l or L suffix makes it a long double.

[edit] FLT_EVAL_METHOD

C has FLT_EVAL_METHOD which allows constants to be interpreted as a wider type.

Example FLT_EVAL_METHOD == 2

evaluate all operations and constants to the range and precision of the long double type.

In this case, I'd expect v1 and v2 to have the same value when FLT_EVAL_METHOD == 2, but different values when FLT_EVAL_METHOD == 0.

long double v1 = 0.1;
long double v2 = 0.1L;

When used as a float initializer in float f = 123456e-3; does it need to have a f suffix?

For best conversion of the text to float, yes use an f.

float f = 123456e-3 incurs double rounding. 2 rounding occurs: text->double and double to float.

With select values, g may get a different value with float g = x.xxx vs g = x.xxxf;. See following.

double rounding example

Notice f2 and f4 have the same constant except the the f suffix. Compiler warns with f4:

warning: conversion from 'double' to 'float' changes value from '9.9999997019767761e-1' to '1.0e+0f' [-Wfloat-conversion]

#include <stdlib.h>
int main(void) {
  // float has 24 bit significand, double has 53
  float f1 = 0x0.FFFFFFp0f;         // code with 24 bit significand, exact as a float
  printf("%-20a %.17e\n", f1, f1);
  float f2 = 0x0.FFFFFF7FFFFFFCp0f; // code with 54 bit significand, rounds down to nearest float
  printf("%-20a %.17e\n", f2, f2);
  float f3 = 0x0.FFFFFF80000000p0f; // code with 25 bit significand, rounds up to nearest float
  printf("%-20a %.17e\n", f3, f3);
  puts("");
  double d1 = 0x0.FFFFFF7FFFFFF8p0; // code constant with 53 bit significand, exact as a double
  printf("%-20a %.17e\n", d1, d1);
  double d2 = 0x0.FFFFFF7FFFFFFCp0; // code constant with 54 bit significand, rounds up to nearest double
  printf("%-20a %.17e\n", d2, d2);
  float f4 = 0x0.FFFFFF7FFFFFFCp0;  // code constant with 54 bit significand, rounds up to nearest double
                                    // then rounds up again when double converted to float
  printf("%-20a %.17e\n", f4, f4);
  return 0;
}

Output

0x1.fffffep-1        9.99999940395355225e-01
0x1.fffffep-1        9.99999940395355225e-01  f2
0x1p+0               1.00000000000000000e+00

0x1.fffffefffffffp-1 9.99999970197677501e-01
0x1.ffffffp-1        9.99999970197677612e-01
0x1p+0               1.00000000000000000e+00  f4 Double Rounding!

For best conversion of the text to long double, definitely use an L else the constant is only a double with less precision.

long double ld1 = 0x1.00000000000001p1;
printf("%.20Le\n", ld1, ld1);
long double ld2 = 0x1.00000000000001p1L; // "Same" constant as above with an 'L'
printf("%.20Le\n", ld2, ld2);

Output

2.00000000000000000000e+00
2.00000000000000002776e+00

answered Oct 29 '22 05:10

chux - Reinstate Monica

Related questions
                            
                                Does float always auto-convert to double when multiplying mixed data types?
                            
                                How to look up sine of different frequencies from a fixed sized lookup table?
                            
                                What does a C cast really do?
                            
                                c/c++ - safest way to send time_t over socket
                            
                                initialize static char const *somevar
                            
                                valgrind Conditional jump or move depends on uninitialised value(s) , does this indicate memory leak?
                            
                                Calculate inverse of a function--Library [closed]
                            
                                Why is the fgets function deprecated?
                            
                                Identify the platform linux or windows using C/C++ code [duplicate]
                            
                                Compiling program containing extern "C"
                            
                                Passing matrix as a parameter in function
                            
                                Floating point error in while loop in C++
                            
                                C socket get IP address from filedescriptor returned from accept
                            
                                Sorting an array of struct pointers using qsort
                            
                                clustering image segments in opencv
                            
                                redirection of ./a.out is not capturing segmentation fault
                            
                                libuv allocated memory buffers re-use techniques
                            
                                Finding strings in the text section
                            
                                IEEE-754 compliant round-half-to-even
                            
                                What is the C/C++ equivalence of java.io.Serializable?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With