Assuming that <code>uint</code> is the largest integral type on my fixed-point platform, I have: <pre class="prettyprint"><code>uint func(uint a, uint b, uint c); </code></pre> Which needs to return a good approximation of <code>a * b / c</code>. The value of <code>c</code> is greater than both the value of <code>a</code> and the value of <code>b</code>. So we know for sure that the value of <code>a * b / c</code> would fit in a <code>uint</code>. However, the value of <code>a * b</code> itself overflows the size of a <code>uint</code>. So one way to compute the value of <code>a * b / c</code> would be: <pre class="prettyprint"><code>return a / c * b; </code></pre> Or even: <pre class="prettyprint"><code>if (a > b) return a / c * b; return b / c * a; </code></pre> However, the value of <code>c</code> is greater than both the value of <code>a</code> and the value of <code>b</code>. So the suggestion above would simply return zero. I need to reduce <code>a * b</code> and <code>c</code> proportionally, but again - the problem is that <code>a * b</code> overflows. Ideally, I would be able to: <ul> <li>Replace <code>a * b</code> with <code>uint(-1)</code> </li> <li>Replace <code>c</code> with <code>uint(-1) / a / b * c</code>.</li> </ul> But no matter how I order the expression <code>uint(-1) / a / b * c</code>, I encounter a problem: <ul> <li> <code>uint(-1) / a / b * c</code> is truncated to zero because of <code>uint(-1) / a / b</code> </li> <li> <code>uint(-1) / a * c / b</code> overflows because of <code>uint(-1) / a * c</code> </li> <li> <code>uint(-1) * c / a / b</code> overflows because of <code>uint(-1) * c</code> </li> </ul> How can I tackle this scenario in order to find a good approximation of <code>a * b / c</code>? <hr> <h3>Edit 1</h3> I do not have things such as <code>_umul128</code> on my platform, when the largest integral type is <code>uint64</code>. My largest type is <code>uint</code>, and I have no support for anything larger than that (neither on the HW level, nor in some pre-existing standard library). My largest type is <code>uint</code>. <h3>Edit 2</h3> In response to numerous duplicate suggestions and comments: I do not have some "larger type" at hand, which I can use for solving this problem. That is why the opening statement of the question is: <blockquote> Assuming that <code>uint</code> is the largest integral type on my fixed-point platform </blockquote> I am assuming that no other type exists, neither on the SW layer (via some built-in standard library) nor on the HW layer.

<blockquote> needs to return a good approximation of <code>a * b / c</code> My largest type is <code>uint</code> both a and b are smaller than c </blockquote> Variation on this 32-bit problem: <pre class="prettyprint"><code>Algorithm: Scale a, b to not overflow SQRT_MAX_P1 as a compile time constant of sqrt(uint_MAX + 1) sh = 0; if (c >= SQRT_MAX_P1) { while (|a| >= SQRT_MAX_P1) a/=2, sh++ while (|b| >= SQRT_MAX_P1) b/=2, sh++ while (|c| >= SQRT_MAX_P1) c/=2, sh-- } result = a*b/c shift result by sh. </code></pre> With an n-bit <code>uint</code>, I expect the result to be correct to at least about <code>n/2</code> significant digits. Could improve things by taking advantage of the smaller of <code>a,b</code> being less than <code>SQRT_MAX_P1</code>. More on that later if interested. <hr> Example <pre class="prettyprint"><code>#include <inttypes.h> #define IMAX_BITS(m) ((m)/((m)%255+1) / 255%255*8 + 7-86/((m)%255+12)) // https://stackoverflow.com/a/4589384/2410359 #define UINTMAX_WIDTH (IMAX_BITS(UINTMAX_MAX)) #define SQRT_UINTMAX_P1 (((uintmax_t)1ull) << (UINTMAX_WIDTH/2)) uintmax_t muldiv_about(uintmax_t a, uintmax_t b, uintmax_t c) { int shift = 0; if (c > SQRT_UINTMAX_P1) { while (a >= SQRT_UINTMAX_P1) { a /= 2; shift++; } while (b >= SQRT_UINTMAX_P1) { b /= 2; shift++; } while (c >= SQRT_UINTMAX_P1) { c /= 2; shift--; } } uintmax_t r = a * b / c; if (shift > 0) r <<= shift; if (shift < 0) r >>= shift; return r; } #include <stdio.h> int main() { uintmax_t a = 12345678; uintmax_t b = 4235266395; uintmax_t c = 4235266396; uintmax_t r = muldiv_about(a,b,c); printf("%ju\n", r); } </code></pre> Output with 32-bit math (Precise answer is 12345677) <pre class="prettyprint"><code>12345600 </code></pre> Output with 64-bit math <pre class="prettyprint"><code>12345677 </code></pre>

Here is another approach that uses recursion and minimal approximation to achieve high precision. First the code and below an explanation. Code: <pre class="prettyprint"><code>uint32_t bp(uint32_t a) { uint32_t b = 0; while (a!=0) { ++b; a >>= 1; }; return b; } int mul_no_ovf(uint32_t a, uint32_t b) { return ((bp(a) + bp(b)) <= 32); } uint32_t f(uint32_t a, uint32_t b, uint32_t c) { if (mul_no_ovf(a, b)) { return (a*b) / c; } uint32_t m = c / b; ++m; uint32_t x = m*b - c; // So m * b == c + x where x = 2 uint32_t n = a/m; uint32_t r = a % m; // So a*b == n * (c + x) + r*b == n*c + n*x + r*b where r*b < c // Approximation: get rid of the r*b part uint32_t res = n; if (r*b > c/2) ++res; return res + f(n, x, c); } </code></pre> Explanation: <pre class="prettyprint"><code>The multiplication a * b can be written as a sum of b a * b = b + b + .... + b Since b < c we can take a number m of these b so that (m-1)*b < c <= m*b, like (b + b + ... + b) + (b + b + ... + b) + .... + b + b + b \---------------/ \---------------/ + \-------/ m*b + m*b + .... + r*b \-------------------------------------/ n times m*b so we have a*b = n*m*b + r*b where r*b < c and m*b > c. Consequently, m*b is equal to c + x, so we have a*b = n*(c + x) + r*b = n*c + n*x + r*b Divide by c : a*b/c = (n*c + n*x + r*b)/c = n + n*x/c + r*b/c The values m, n, x, r can all be calculated from a, b and c without any loss of precision using integer division (/) and remainder (%). The approximation is to look at r*b (which is less than c) and "add zero" when r*b<=c/2 and "add one" when r*b>c/2. So now there are two possibilities: 1) a*b = n + n*x/c 2) a*b = (n + 1) + n*x/c So the problem (i.e. calculating a*b/c) has been changed to the form MULDIV(a1,b1,c) = NUMBER + MULDIV(a2,b2,c) where a2,b2 is less than a1,b2. Consequently, recursion can be used until a2*b2 no longer overflows (and the calculation can be done directly). </code></pre>

How can I compute a * b / c when both a and b are smaller than c, but a * b overflows?

Tags:

c

integer

integer-overflow

integer-arithmetic

Assuming that uint is the largest integral type on my fixed-point platform, I have:

uint func(uint a, uint b, uint c);

Which needs to return a good approximation of a * b / c.

The value of c is greater than both the value of a and the value of b.

So we know for sure that the value of a * b / c would fit in a uint.

However, the value of a * b itself overflows the size of a uint.

So one way to compute the value of a * b / c would be:

return a / c * b;

Or even:

if (a > b)
    return a / c * b;
return b / c * a;

However, the value of c is greater than both the value of a and the value of b.

So the suggestion above would simply return zero.

I need to reduce a * b and c proportionally, but again - the problem is that a * b overflows.

Ideally, I would be able to:

Replace a * b with uint(-1)
Replace c with uint(-1) / a / b * c.

But no matter how I order the expression uint(-1) / a / b * c, I encounter a problem:

uint(-1) / a / b * c is truncated to zero because of uint(-1) / a / b
uint(-1) / a * c / b overflows because of uint(-1) / a * c
uint(-1) * c / a / b overflows because of uint(-1) * c

How can I tackle this scenario in order to find a good approximation of a * b / c?

Edit 1

I do not have things such as _umul128 on my platform, when the largest integral type is uint64. My largest type is uint, and I have no support for anything larger than that (neither on the HW level, nor in some pre-existing standard library).

My largest type is uint.

Edit 2

In response to numerous duplicate suggestions and comments:

I do not have some "larger type" at hand, which I can use for solving this problem. That is why the opening statement of the question is:

Assuming that uint is the largest integral type on my fixed-point platform

I am assuming that no other type exists, neither on the SW layer (via some built-in standard library) nor on the HW layer.

522

asked Oct 28 '20 07:10

goodvibration

2 Answers

needs to return a good approximation of a * b / c
My largest type is uint
both a and b are smaller than c

Variation on this 32-bit problem:

Algorithm: Scale a, b to not overflow

SQRT_MAX_P1 as a compile time constant of sqrt(uint_MAX + 1)
sh = 0;
if (c >= SQRT_MAX_P1) {
  while (|a| >= SQRT_MAX_P1) a/=2, sh++
  while (|b| >= SQRT_MAX_P1) b/=2, sh++
  while (|c| >= SQRT_MAX_P1) c/=2, sh--
}
result = a*b/c

shift result by sh.

With an n-bit uint, I expect the result to be correct to at least about n/2 significant digits.

Could improve things by taking advantage of the smaller of a,b being less than SQRT_MAX_P1. More on that later if interested.

Example

#include <inttypes.h>

#define IMAX_BITS(m) ((m)/((m)%255+1) / 255%255*8 + 7-86/((m)%255+12))
// https://stackoverflow.com/a/4589384/2410359

#define UINTMAX_WIDTH (IMAX_BITS(UINTMAX_MAX))
#define SQRT_UINTMAX_P1 (((uintmax_t)1ull) << (UINTMAX_WIDTH/2))

uintmax_t muldiv_about(uintmax_t a, uintmax_t b, uintmax_t c) {
  int shift = 0;
  if (c > SQRT_UINTMAX_P1) {
    while (a >= SQRT_UINTMAX_P1) {
      a /= 2; shift++;
    }
    while (b >= SQRT_UINTMAX_P1) {
      b /= 2; shift++;
    }
    while (c >= SQRT_UINTMAX_P1) {
      c /= 2; shift--;
    }
  }
  uintmax_t r = a * b / c;
  if (shift > 0) r <<= shift;
  if (shift < 0) r >>= shift;
  return r;
}



#include <stdio.h>

int main() {
  uintmax_t a = 12345678;
  uintmax_t b = 4235266395;
  uintmax_t c = 4235266396;
  uintmax_t r = muldiv_about(a,b,c);
  printf("%ju\n", r);
}

Output with 32-bit math (Precise answer is 12345677)

12345600

Output with 64-bit math

12345677

156

answered Sep 20 '22 12:09

chux - Reinstate Monica

Here is another approach that uses recursion and minimal approximation to achieve high precision.

First the code and below an explanation.

Code:

uint32_t bp(uint32_t a) {
  uint32_t b = 0;
  while (a!=0)
  {
    ++b;
    a >>= 1;
  };
  return b;
}

int mul_no_ovf(uint32_t a, uint32_t b)
{
  return ((bp(a) + bp(b)) <= 32);
}

uint32_t f(uint32_t a, uint32_t b, uint32_t c)
{
  if (mul_no_ovf(a, b))
  {
    return (a*b) / c;
  }

  uint32_t m = c / b;
  ++m;
  uint32_t x = m*b - c;
  // So m * b == c + x where x < b and m >= 2

  uint32_t n = a/m;
  uint32_t r = a % m;
  // So a*b == n * (c + x) + r*b == n*c + n*x + r*b where r*b < c

  // Approximation: get rid of the r*b part
  uint32_t res = n;
  if (r*b > c/2) ++res;

  return res + f(n, x, c);
}

Explanation:

The multiplication a * b can be written as a sum of b

a * b = b + b + .... + b

Since b < c we can take a number m of these b so that (m-1)*b < c <= m*b, like

(b + b + ... + b) + (b + b + ... + b) + .... + b + b + b
\---------------/   \---------------/ +        \-------/
       m*b        +        m*b        + .... +     r*b
     \-------------------------------------/
            n times m*b

so we have

a*b = n*m*b + r*b

where r*b < c and m*b > c. Consequently, m*b is equal to c + x, so we have

a*b = n*(c + x) + r*b = n*c + n*x + r*b

Divide by c :

a*b/c = (n*c + n*x + r*b)/c = n + n*x/c + r*b/c

The values m, n, x, r can all be calculated from a, b and c without any loss of 
precision using integer division (/) and remainder (%).

The approximation is to look at r*b (which is less than c) and "add zero" when r*b<=c/2
and "add one" when r*b>c/2.

So now there are two possibilities:

1) a*b = n + n*x/c

2) a*b = (n + 1) + n*x/c

So the problem (i.e. calculating a*b/c) has been changed to the form

MULDIV(a1,b1,c) = NUMBER + MULDIV(a2,b2,c)

where a2,b2 is less than a1,b2. Consequently, recursion can be used until 
a2*b2 no longer overflows (and the calculation can be done directly).

answered Sep 19 '22 12:09

Support Ukraine

Related questions
                            
                                ifreq's ifr_names are incorrect?
                            
                                Windows 10: Clang, "stdio.h" not found [duplicate]
                            
                                How to get source code of .so file in android
                            
                                A faster way to get a value based on condition then ternary operator?
                            
                                _InterlockedCompareExchange optimization
                            
                                Can't use pread on a file descriptor for a vfio pci device
                            
                                Visual Studio - Compiling 32-bit code inside 64-bit project
                            
                                An efficient method for calculating log base 2 of a number between 1 and 2
                            
                                Integer powers in C
                            
                                Unable to include ASM header file in C without losing preprocessor
                            
                                Is it possible to attach a callback to be executed on a request completion?
                            
                                android NDK fatal error: stdio.h: No such file or directory #include <stdio.h>
                            
                                clang-analyze: how to avoid "garbage value" warning?
                            
                                Redefining functions lin libC
                            
                                C - Using enum for bit flags - warning: enumerated type mixed with another type
                            
                                Rendering an EGL Image to a frame buffer with GTK
                            
                                Does discarding a value result in reading it?
                            
                                getpgid not implemented with valgrind
                            
                                Expected behaviour of freopen() with regards to buffering (setvbuf())?
                            
                                Is there an easy way to analyse the header files and get a resulting list of all #defines? [duplicate]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With