In numerical computation, it is often needed to scale numbers to be in safe range. For example, computing Euclidean distance: <code>sqrt(a^2+b^2)</code>. Here, if magnitude of <code>a</code> or <code>b</code> is too small/large, then underflow/overflow can happen. A common approach to solve this is to divide numbers by the largest magnitude number. However, this solution is: <ul> <li>slow (division is slow)</li> <li>causes a little extra inaccuracy</li> </ul> So I thought that instead of dividing by the largest magnitude number, let's multiply it with a close power-of-2 reciprocal number. This seems a better solution, as: <ul> <li>multiplication is much faster than division</li> <li>better accuracy, as multiplying with a power-of-2 number is exact</li> </ul> So, I'd like to create a small utility function, which has a logic like this (by <code>^</code>, I mean exponentiation): <pre class="prettyprint"><code>void getScaler(double value, double &scaler, double &scalerReciprocal) { int e = <exponent of value>; if (e<-1022) { scaler=2^-1022; scalerReciprocal = 2^1022; } } else if (e>1022) { scaler=2^1022; scalerReciprocal = 2^-1022; } } else { scaler=2^e; scalerReciprocal = 2^(2046-e); } } </code></pre> This function should return a normalized <code>scaler</code> & <code>scalerReciprocal</code>, both are power-of-2 numbers, where <code>scaler</code> is near to <code>value</code>, and <code>scalerReciprocal</code> is the reciprocal of <code>scaler</code>. The maximum allowed exponents for <code>scaler</code>/<code>scaleReciprocal</code> are <code>-1022..1022</code> (I don't want to work with subnormal <code>scaler</code>, as subnormal numbers can be slow). What would be a fast way to do this? Can this be done with pure floating-point operations? Or should I extract the exponent from <code>value</code>, and use simple <code>if</code>s to do the logic? Is there some kind of trick to do the comparison with (-)1022 fast (as the range is symmetric)? Note: <code>scaler</code> doesn't need to be the closest possible power-of-2. If some logic needs it, <code>scaler</code> can be some small power-of-2 away from the closest value.

Function <code>s = get_scale(z)</code> computes the "close power of 2". Since the fraction bits of <code>s</code> are zero, the inverse of <code>s</code> is just an (inexpensive) integer subtraction: see function <code>inv_of_scale</code>. On x86 <code>get_scale</code> and <code>inv_of_scale</code> compile to quite efficient assembly with clang. Compiler clang translates the ternary operators to <code>minsd</code> and <code>maxsd</code>, see also Peter Cordes' comment. With gcc, it is slightly more efficient to translate these functions to x86 intrinsics code (<code>get_scale_x86</code> and <code>inv_of_scale_x86</code>), see Godbolt. Note that <a href="https://stackoverflow.com/a/11996970">C explicitly permits type-punning through a union, whereas C++ (c++11) has no such permission</a> Although gcc 8.2 and clang 7.0 do not complain about the union, you can improve the C++ portabily by using the <code>memcpy</code> trick instead of the union trick. Such a modification of the code should be trivial. The code should handle subnormals correctly. <pre class="prettyprint"><code>#include<stdio.h> #include<stdint.h> #include<immintrin.h> /* gcc -Wall -m64 -O3 -march=sandybridge dbl_scale.c */ union dbl_int64{ double d; uint64_t i; }; double get_scale(double t){ union dbl_int64 x; union dbl_int64 x_min; union dbl_int64 x_max; uint64_t mask_i; /* 0xFEDCBA9876543210 */ x_min.i = 0x0010000000000000ull; x_max.i = 0x7FD0000000000000ull; mask_i = 0x7FF0000000000000ull; x.d = t; x.i = x.i & mask_i; /* Set fraction bits to zero, take absolute value */ x.d = (x.d < x_min.d) ? x_min.d : x.d; /* If subnormal: set exponent to 1 */ x.d = (x.d > x_max.d) ? x_max.d : x.d; /* If exponent is very large: set exponent to 7FD, otherwise the inverse is a subnormal */ return x.d; } double get_scale_x86(double t){ __m128d x = _mm_set_sd(t); __m128d x_min = _mm_castsi128_pd(_mm_set1_epi64x(0x0010000000000000ull)); __m128d x_max = _mm_castsi128_pd(_mm_set1_epi64x(0x7FD0000000000000ull)); __m128d mask = _mm_castsi128_pd(_mm_set1_epi64x(0x7FF0000000000000ull)); x = _mm_and_pd(x, mask); x = _mm_max_sd(x, x_min); x = _mm_min_sd(x, x_max); return _mm_cvtsd_f64(x); } /* Compute the inverse 1/t of a double t with all zero fraction bits */ /* and exponent between the limits of function get_scale */ /* A single integer subtraction is much less expensive than a */ /* floating point division. */ double inv_of_scale(double t){ union dbl_int64 x; /* 0xFEDCBA9876543210 */ uint64_t inv_mask = 0x7FE0000000000000ull; x.d = t; x.i = inv_mask - x.i; return x.d; } double inv_of_scale_x86(double t){ __m128i inv_mask = _mm_set1_epi64x(0x7FE0000000000000ull); __m128d x = _mm_set_sd(t); __m128i x_i = _mm_sub_epi64(inv_mask, _mm_castpd_si128(x)); return _mm_cvtsd_f64(_mm_castsi128_pd(x_i)); } int main(){ int n = 14; int i; /* Several example values, 4.94e-324 is the smallest subnormal */ double y[14] = { 4.94e-324, 1.1e-320, 1.1e-300, 1.1e-5, 0.7, 1.7, 123.1, 1.1e300, 1.79e308, -1.1e-320, -0.7, -1.7, -123.1, -1.1e307}; double z, s, u; printf("Portable code:\n"); printf(" x pow_of_2 inverse pow2*inv x*inverse \n"); for (i = 0; i < n; i++){ z = y[i]; s = get_scale(z); u = inv_of_scale(s); printf("%14e %14e %14e %14e %14e\n", z, s, u, s*u, z*u); } printf("\nx86 specific SSE code:\n"); printf(" x pow_of_2 inverse pow2*inv x*inverse \n"); for (i = 0; i < n; i++){ z = y[i]; s = get_scale_x86(z); u = inv_of_scale_x86(s); printf("%14e %14e %14e %14e %14e\n", z, s, u, s*u, z*u); } return 0; } </code></pre> The output looks fine: <pre class="prettyprint"><code>Portable code: x pow_of_2 inverse pow2*inv x*inverse 4.940656e-324 2.225074e-308 4.494233e+307 1.000000e+00 2.220446e-16 1.099790e-320 2.225074e-308 4.494233e+307 1.000000e+00 4.942713e-13 1.100000e-300 7.466109e-301 1.339386e+300 1.000000e+00 1.473324e+00 1.100000e-05 7.629395e-06 1.310720e+05 1.000000e+00 1.441792e+00 7.000000e-01 5.000000e-01 2.000000e+00 1.000000e+00 1.400000e+00 1.700000e+00 1.000000e+00 1.000000e+00 1.000000e+00 1.700000e+00 1.231000e+02 6.400000e+01 1.562500e-02 1.000000e+00 1.923437e+00 1.100000e+300 6.696929e+299 1.493222e-300 1.000000e+00 1.642544e+00 1.790000e+308 4.494233e+307 2.225074e-308 1.000000e+00 3.982882e+00 -1.099790e-320 2.225074e-308 4.494233e+307 1.000000e+00 -4.942713e-13 -7.000000e-01 5.000000e-01 2.000000e+00 1.000000e+00 -1.400000e+00 -1.700000e+00 1.000000e+00 1.000000e+00 1.000000e+00 -1.700000e+00 -1.231000e+02 6.400000e+01 1.562500e-02 1.000000e+00 -1.923437e+00 -1.100000e+307 5.617791e+306 1.780059e-307 1.000000e+00 -1.958065e+00 x86 specific SSE code: x pow_of_2 inverse pow2*inv x*inverse 4.940656e-324 2.225074e-308 4.494233e+307 1.000000e+00 2.220446e-16 1.099790e-320 2.225074e-308 4.494233e+307 1.000000e+00 4.942713e-13 1.100000e-300 7.466109e-301 1.339386e+300 1.000000e+00 1.473324e+00 1.100000e-05 7.629395e-06 1.310720e+05 1.000000e+00 1.441792e+00 7.000000e-01 5.000000e-01 2.000000e+00 1.000000e+00 1.400000e+00 1.700000e+00 1.000000e+00 1.000000e+00 1.000000e+00 1.700000e+00 1.231000e+02 6.400000e+01 1.562500e-02 1.000000e+00 1.923437e+00 1.100000e+300 6.696929e+299 1.493222e-300 1.000000e+00 1.642544e+00 1.790000e+308 4.494233e+307 2.225074e-308 1.000000e+00 3.982882e+00 -1.099790e-320 2.225074e-308 4.494233e+307 1.000000e+00 -4.942713e-13 -7.000000e-01 5.000000e-01 2.000000e+00 1.000000e+00 -1.400000e+00 -1.700000e+00 1.000000e+00 1.000000e+00 1.000000e+00 -1.700000e+00 -1.231000e+02 6.400000e+01 1.562500e-02 1.000000e+00 -1.923437e+00 -1.100000e+307 5.617791e+306 1.780059e-307 1.000000e+00 -1.958065e+00 </code></pre> <hr> Vectorization Function <code>get_scale</code> should vectorize with compilers that support auto-vectorization. The following piece of code vectorizes very well with clang (no need to write SSE/AVX intrinsics code). <pre class="prettyprint"><code>/* Test how well get_scale vectorizes: */ void get_scale_vec(double * __restrict__ t, double * __restrict__ x){ int n = 1024; int i; for (i = 0; i < n; i++){ x[i] = get_scale(t[i]); } } </code></pre> Unfortunately gcc doesn't find the <code>vmaxpd</code> and <code>vminpd</code> instructions.

You can use <pre class="prettyprint"><code>double frexp (double x, int* exp); </code></pre> Returned value is the fractional part of of x and exp is the exponent (minus the offset). Alternatively, the following code gets the exponent part of a double. <pre class="prettyprint"><code>int get_exp(double *d) { long long *l = (long long *) d; return ((*l & (0x7ffLL << 52) )>> 52)-1023 ; } </code></pre>

Fast way to get a close power-of-2 number (floating-point)

Tags:

c++

floating-point

x86

ieee-754

In numerical computation, it is often needed to scale numbers to be in safe range.

For example, computing Euclidean distance: sqrt(a^2+b^2). Here, if magnitude of a or b is too small/large, then underflow/overflow can happen.

A common approach to solve this is to divide numbers by the largest magnitude number. However, this solution is:

slow (division is slow)
causes a little extra inaccuracy

So I thought that instead of dividing by the largest magnitude number, let's multiply it with a close power-of-2 reciprocal number. This seems a better solution, as:

multiplication is much faster than division
better accuracy, as multiplying with a power-of-2 number is exact

So, I'd like to create a small utility function, which has a logic like this (by ^, I mean exponentiation):

void getScaler(double value, double &scaler, double &scalerReciprocal) {
    int e = <exponent of value>;
    if (e<-1022) { scaler=2^-1022; scalerReciprocal = 2^1022; }
    } else if (e>1022) { scaler=2^1022; scalerReciprocal = 2^-1022; }
    } else { scaler=2^e; scalerReciprocal = 2^(2046-e); }
}

This function should return a normalized scaler & scalerReciprocal, both are power-of-2 numbers, where scaler is near to value, and scalerReciprocal is the reciprocal of scaler.

The maximum allowed exponents for scaler/scaleReciprocal are -1022..1022 (I don't want to work with subnormal scaler, as subnormal numbers can be slow).

What would be a fast way to do this? Can this be done with pure floating-point operations? Or should I extract the exponent from value, and use simple ifs to do the logic? Is there some kind of trick to do the comparison with (-)1022 fast (as the range is symmetric)?

Note: scaler doesn't need to be the closest possible power-of-2. If some logic needs it, scaler can be some small power-of-2 away from the closest value.

433

asked Jan 21 '19 20:01

geza

3 Answers

Function s = get_scale(z) computes the "close power of 2". Since the fraction bits of s are zero, the inverse of s is just an (inexpensive) integer subtraction: see function inv_of_scale.

On x86 get_scale and inv_of_scale compile to quite efficient assembly with clang. Compiler clang translates the ternary operators to minsd and maxsd, see also Peter Cordes' comment. With gcc, it is slightly more efficient to translate these functions to x86 intrinsics code (get_scale_x86 and inv_of_scale_x86), see Godbolt.

Note that C explicitly permits type-punning through a union, whereas C++ (c++11) has no such permission Although gcc 8.2 and clang 7.0 do not complain about the union, you can improve the C++ portabily by using the memcpy trick instead of the union trick. Such a modification of the code should be trivial. The code should handle subnormals correctly.

#include<stdio.h>
#include<stdint.h>
#include<immintrin.h>
/* gcc -Wall -m64 -O3 -march=sandybridge dbl_scale.c */

union dbl_int64{
    double d;
    uint64_t i;
};

double get_scale(double t){
    union dbl_int64 x;
    union dbl_int64 x_min;
    union dbl_int64 x_max;
    uint64_t mask_i;
           /* 0xFEDCBA9876543210 */
    x_min.i = 0x0010000000000000ull;
    x_max.i = 0x7FD0000000000000ull;
    mask_i =  0x7FF0000000000000ull;
    x.d = t;
    x.i = x.i & mask_i;                    /* Set fraction bits to zero, take absolute value */
    x.d = (x.d < x_min.d) ? x_min.d : x.d; /* If subnormal: set exponent to 1                */
    x.d = (x.d > x_max.d) ? x_max.d : x.d; /* If exponent is very large: set exponent to 7FD, otherwise the inverse is a subnormal */
    return x.d;
}

double get_scale_x86(double t){
    __m128d x = _mm_set_sd(t);
    __m128d x_min = _mm_castsi128_pd(_mm_set1_epi64x(0x0010000000000000ull));
    __m128d x_max = _mm_castsi128_pd(_mm_set1_epi64x(0x7FD0000000000000ull));
    __m128d mask  = _mm_castsi128_pd(_mm_set1_epi64x(0x7FF0000000000000ull));
            x     = _mm_and_pd(x, mask);
            x     = _mm_max_sd(x, x_min);
            x     = _mm_min_sd(x, x_max);
    return _mm_cvtsd_f64(x);
}

/* Compute the inverse 1/t of a double t with all zero fraction bits     */
/* and exponent between the limits of function get_scale                 */
/* A single integer subtraction is much less expensive than a            */
/* floating point division.                                               */
double inv_of_scale(double t){
    union dbl_int64 x;
                     /* 0xFEDCBA9876543210 */
    uint64_t inv_mask = 0x7FE0000000000000ull;
    x.d = t;
    x.i = inv_mask - x.i;
    return x.d;
}

double inv_of_scale_x86(double t){
    __m128i inv_mask = _mm_set1_epi64x(0x7FE0000000000000ull);
    __m128d x        = _mm_set_sd(t);
    __m128i x_i      = _mm_sub_epi64(inv_mask, _mm_castpd_si128(x));
    return _mm_cvtsd_f64(_mm_castsi128_pd(x_i));
}


int main(){
    int n = 14;
    int i;
    /* Several example values, 4.94e-324 is the smallest subnormal */
    double y[14] = { 4.94e-324, 1.1e-320,  1.1e-300,  1.1e-5,  0.7,  1.7,  123.1, 1.1e300,  
                     1.79e308, -1.1e-320,    -0.7, -1.7, -123.1,  -1.1e307};
    double z, s, u;

    printf("Portable code:\n");
    printf("             x       pow_of_2        inverse       pow2*inv      x*inverse \n");
    for (i = 0; i < n; i++){  
        z = y[i];
        s = get_scale(z);
        u = inv_of_scale(s);
        printf("%14e %14e %14e %14e %14e\n", z, s, u, s*u, z*u);
    }

    printf("\nx86 specific SSE code:\n");
    printf("             x       pow_of_2        inverse       pow2*inv      x*inverse \n");
    for (i = 0; i < n; i++){  
        z = y[i];
        s = get_scale_x86(z);
        u = inv_of_scale_x86(s);
        printf("%14e %14e %14e %14e %14e\n", z, s, u, s*u, z*u);
    }

    return 0;
}

The output looks fine:

Portable code:
             x       pow_of_2        inverse       pow2*inv      x*inverse 
 4.940656e-324  2.225074e-308  4.494233e+307   1.000000e+00   2.220446e-16
 1.099790e-320  2.225074e-308  4.494233e+307   1.000000e+00   4.942713e-13
 1.100000e-300  7.466109e-301  1.339386e+300   1.000000e+00   1.473324e+00
  1.100000e-05   7.629395e-06   1.310720e+05   1.000000e+00   1.441792e+00
  7.000000e-01   5.000000e-01   2.000000e+00   1.000000e+00   1.400000e+00
  1.700000e+00   1.000000e+00   1.000000e+00   1.000000e+00   1.700000e+00
  1.231000e+02   6.400000e+01   1.562500e-02   1.000000e+00   1.923437e+00
 1.100000e+300  6.696929e+299  1.493222e-300   1.000000e+00   1.642544e+00
 1.790000e+308  4.494233e+307  2.225074e-308   1.000000e+00   3.982882e+00
-1.099790e-320  2.225074e-308  4.494233e+307   1.000000e+00  -4.942713e-13
 -7.000000e-01   5.000000e-01   2.000000e+00   1.000000e+00  -1.400000e+00
 -1.700000e+00   1.000000e+00   1.000000e+00   1.000000e+00  -1.700000e+00
 -1.231000e+02   6.400000e+01   1.562500e-02   1.000000e+00  -1.923437e+00
-1.100000e+307  5.617791e+306  1.780059e-307   1.000000e+00  -1.958065e+00

x86 specific SSE code:
             x       pow_of_2        inverse       pow2*inv      x*inverse 
 4.940656e-324  2.225074e-308  4.494233e+307   1.000000e+00   2.220446e-16
 1.099790e-320  2.225074e-308  4.494233e+307   1.000000e+00   4.942713e-13
 1.100000e-300  7.466109e-301  1.339386e+300   1.000000e+00   1.473324e+00
  1.100000e-05   7.629395e-06   1.310720e+05   1.000000e+00   1.441792e+00
  7.000000e-01   5.000000e-01   2.000000e+00   1.000000e+00   1.400000e+00
  1.700000e+00   1.000000e+00   1.000000e+00   1.000000e+00   1.700000e+00
  1.231000e+02   6.400000e+01   1.562500e-02   1.000000e+00   1.923437e+00
 1.100000e+300  6.696929e+299  1.493222e-300   1.000000e+00   1.642544e+00
 1.790000e+308  4.494233e+307  2.225074e-308   1.000000e+00   3.982882e+00
-1.099790e-320  2.225074e-308  4.494233e+307   1.000000e+00  -4.942713e-13
 -7.000000e-01   5.000000e-01   2.000000e+00   1.000000e+00  -1.400000e+00
 -1.700000e+00   1.000000e+00   1.000000e+00   1.000000e+00  -1.700000e+00
 -1.231000e+02   6.400000e+01   1.562500e-02   1.000000e+00  -1.923437e+00
-1.100000e+307  5.617791e+306  1.780059e-307   1.000000e+00  -1.958065e+00

Vectorization

Function get_scale should vectorize with compilers that support auto-vectorization. The following piece of code vectorizes very well with clang (no need to write SSE/AVX intrinsics code).

/* Test how well get_scale vectorizes: */
void get_scale_vec(double * __restrict__ t, double * __restrict__ x){
    int n = 1024;
    int i;
    for (i = 0; i < n; i++){
        x[i] = get_scale(t[i]);
    }
}

Unfortunately gcc doesn't find the vmaxpd and vminpd instructions.

137

answered Sep 30 '22 15:09

wim

Based on wim's answer, here's another solution, which can be faster, as it has one less instruction. The output is a little bit different, but still fulfills the requirements.

The idea is to use bit operations to fix border cases: put a 01 to the lsb of the exponent, no matter of its value. So, exponent:

0 becomes 1 (-1023 becomes -1022)
2046 becomes 2045 (1023 becomes 1022)
other exponents modified as well, but just slightly: the number can become two times larger compared to wim's solution (when exponent lsb changes from 00 to 01), or halved (when 10->01) or 1/4 (when 11->01)

So, this modified routine works (and I think that it's pretty cool that the problem can be solved with only 2 fast asm instructions):

#include<stdio.h>
#include<stdint.h>
#include<immintrin.h>
/* gcc -Wall -m64 -O3 -march=sandybridge dbl_scale.c */

union dbl_int64{
    double d;
    uint64_t i;
};

double get_scale(double t){
    union dbl_int64 x;
    uint64_t and_i;
    uint64_t or_i;
         /* 0xFEDCBA9876543210 */
    and_i = 0x7FD0000000000000ull;
    or_i =  0x0010000000000000ull;
    x.d = t;
    x.i = (x.i & and_i)|or_i;                     /* Set fraction bits to zero, take absolute value */
    return x.d;
}

double get_scale_x86(double t){
    __m128d x = _mm_set_sd(t);
    __m128d x_and = _mm_castsi128_pd(_mm_set1_epi64x(0x7FD0000000000000ull));
    __m128d x_or  = _mm_castsi128_pd(_mm_set1_epi64x(0x0010000000000000ull));
            x     = _mm_and_pd(x, x_and);
            x     = _mm_or_pd(x, x_or);
    return _mm_cvtsd_f64(x);
}

/* Compute the inverse 1/t of a double t with all zero fraction bits     */
/* and exponent between the limits of function get_scale                 */
/* A single integer subtraction is much less expensive than a            */
/* floating point division.                                               */
double inv_of_scale(double t){
    union dbl_int64 x;
                     /* 0xFEDCBA9876543210 */
    uint64_t inv_mask = 0x7FE0000000000000ull;
    x.d = t;
    x.i = inv_mask - x.i;
    return x.d;
}

double inv_of_scale_x86(double t){
    __m128i inv_mask = _mm_set1_epi64x(0x7FE0000000000000ull);
    __m128d x        = _mm_set_sd(t);
    __m128i x_i      = _mm_sub_epi64(inv_mask, _mm_castpd_si128(x));
    return _mm_cvtsd_f64(_mm_castsi128_pd(x_i));
}


int main(){
    int n = 14;
    int i;
    /* Several example values, 4.94e-324 is the smallest subnormal */
    double y[14] = { 4.94e-324, 1.1e-320,  1.1e-300,  1.1e-5,  0.7,  1.7,  123.1, 1.1e300,  
                     1.79e308, -1.1e-320,    -0.7, -1.7, -123.1,  -1.1e307};
    double z, s, u;

    printf("Portable code:\n");
    printf("             x       pow_of_2        inverse       pow2*inv      x*inverse \n");
    for (i = 0; i < n; i++){  
        z = y[i];
        s = get_scale(z);
        u = inv_of_scale(s);
        printf("%14e %14e %14e %14e %14e\n", z, s, u, s*u, z*u);
    }

    printf("\nx86 specific SSE code:\n");
    printf("             x       pow_of_2        inverse       pow2*inv      x*inverse \n");
    for (i = 0; i < n; i++){  
        z = y[i];
        s = get_scale_x86(z);
        u = inv_of_scale_x86(s);
        printf("%14e %14e %14e %14e %14e\n", z, s, u, s*u, z*u);
    }

    return 0;
}

answered Sep 30 '22 15:09

geza

You can use

double frexp (double x, int* exp);

Returned value is the fractional part of of x and exp is the exponent (minus the offset).

Alternatively, the following code gets the exponent part of a double.

int get_exp(double *d) {
  long long *l = (long long *) d;
  return ((*l & (0x7ffLL << 52) )>> 52)-1023 ;
}

answered Sep 30 '22 16:09

Alain Merigot

Related questions
                            
                                Why is {} used to access operator() in std::hash?
                            
                                C++ How to properly copy container(vector) of pointers?
                            
                                Can't pass std::min to function, copy of std::min works
                            
                                C++ "size_t" doesn't need "cstddef" header?
                            
                                Is returning a vector slower than passing by reference?
                            
                                C++ async + future (deferred vs async)
                            
                                error C2440: 'initializing': cannot convert from 'initializer list' to 'std::vector<char *,std::allocator<_Ty>>'
                            
                                Can't catch class derived from std::exception by reference to std::exception
                            
                                YUV420 to BGR image from pixel pointers
                            
                                dynamic exception specifications are deprecated
                            
                                Good or bad: Calling destructor in constructor [closed]
                            
                                Read from file and write to cout in one line
                            
                                How to fill STL containers by means of generate_n with index increment
                            
                                Is atomic<T*> always lock free?
                            
                                Given Concepts, are SFINAE helpers still in the spec as non-deprecated?
                            
                                No matching member function for call to 'push_back' error
                            
                                Fastest method to deal with string
                            
                                Brace elision in std::array<std::vector>
                            
                                One template specialization for several enum values
                            
                                C++14 Create vector from variadic templates

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Fast way to get a close power-of-2 number (floating-point)

Tags:

c++

floating-point

x86

ieee-754

geza

People also ask

3 Answers

wim

geza

Alain Merigot

Recent Activity

Donate For Us