Optimizing bigint calls

Tags:

I am currently using the book 'Programming in D' for learning D. I tried to solve a problem of summing up the squares of numbers from 1 to 10000000. I first made a functional approach to solve the problem with the map and reduce but as the numbers get bigger I have to cast the numbers to bigint to get the correct output.

  long num = 10000001;
  BigInt result;
  result = iota(1,num).map!(a => to!BigInt(a * a)).reduce!((a,b) => (a + b));
  writeln("The sum is : ", result);

The above takes 7s to finish when compiled with dmd -O . I profiled the program and most of the time is wasted on BigInt calls. Though the square of the number can fit into a long I have to typecast them to bigint so that reduce function sums and returns the appropriate sum. The python program takes only 3 seconds to finish it. When num = 100000000 D program gets to 1 minute and 13 seconds to finish. Is there a way to optimize the calls to bigint. The products can themselves be long but they have to be typecasted as bigint objects so that they give right results from reduce operations. I tried pushing the square of the numbers into a bigint array but its also slower. I tried to typecast all the numbers as Bigint

auto bigs_map_nums = iota(1,num).map!(a => to!BigInt(a)).array;
auto bigs_map = sum(bigs_map_nums.map!(a => (a * a)).array);

But its also slower. I read the answers at How to optimize this short factorial function in scala? (Creating 50000 BigInts). Is it a problem with the implementation of multiplication for bigger integers in D too ? Is there a way to optimize the function calls to BigInt ?

python code :

timeit.timeit('print sum(map(lambda num : num * num, range(1,10000000)))',number=1)
333333283333335000000
3.58552622795105

The code was executed on a dual-core 64 bit linux laptop with 2 GB RAM. python : 2.7.4 dmd : DMD64 D Compiler v2.066.1

267

asked Jan 16 '15 20:01

xtreak

2 Answers

Without range coolness: foreach(x; 0 .. num) result += x * x;

With range cool(?)ness:

import std.functional: reverseArgs;
result = iota(1, num)
    .map!(a => a * a)
    .reverseArgs!(reduce!((a, b) => a + b))(BigInt(0) /* seed */);

The key is to avoid BigInting every element, of course.

The range version is a little slower than the non-range one. Both are significantly faster than the python version.

Edit: Oh! Oh! It can be made much more pleasant with std.algorithm.sum:

result = iota(1, num)
    .map!(a => a * a)
    .sum(BigInt(0));

answered Sep 18 '22 10:09

user4463256

The python code is not equivalent to the D code, in fact it does a lot less.

Python uses an int, then it promotes that int to long() when the result is bigger than what can be stored in an int() type. Internally, (at least CPython) uses a long number to store integer bigger than 256, which is at least 32bits. Up until that overflow normal cpu instructions can be used for the multiplication which are quite faster than bigint multiplication.

D's BigInt implementation treats the numbers as BigInt from the start and uses the expensive multiplication operation from 1 until the end. Much more work to be done there.

It's interesting how complicated the multiplication can be when we talk about BigInts.

The D implementation is

https://github.com/D-Programming-Language/phobos/blob/v2.066.1/std/internal/math/biguintcore.d#L1246

Python starts by doing

static PyObject *
int_mul(PyObject *v, PyObject *w)
{
    long a, b;
    long longprod;                      /* a*b in native long arithmetic */
    double doubled_longprod;            /* (double)longprod */
    double doubleprod;                  /* (double)a * (double)b */

    CONVERT_TO_LONG(v, a);
    CONVERT_TO_LONG(w, b);
    /* casts in the next line avoid undefined behaviour on overflow */
    longprod = (long)((unsigned long)a * b);

    ... //check if we have overflowed


    {
        const double diff = doubled_longprod - doubleprod;
        const double absdiff = diff >= 0.0 ? diff : -diff;
        const double absprod = doubleprod >= 0.0 ? doubleprod :
                              -doubleprod;
        /* absdiff/absprod <= 1/32 iff
           32 * absdiff <= absprod -- 5 good bits is "close enough" */
        if (32.0 * absdiff <= absprod)
            return PyInt_FromLong(longprod);
        else
            return PyLong_Type.tp_as_number->nb_multiply(v, w);
    }
}

and if the number is bigger than what a long can hold it does a karatsuba multiplication. Implementation in :

http://svn.python.org/projects/python/trunk/Objects/longobject.c (k_mul function)

The equivalent code would wait to use BigInts until they are no native data types that can hold the number in question.

answered Sep 18 '22 10:09

Tasos Vogiatzoglou

Related questions
                            
                                Multiple non-flagged arguments with cement
                            
                                Django REST Framework: return 404 (not 400) on POST if related field does not exist?
                            
                                How to compute residuals of a point process in python
                            
                                matplotlib: Setting both major and minor ticks forces same x and y scale
                            
                                Force use of scientific style for basemap colorbar labels
                            
                                How to convert float to timedelta seconds in python?
                            
                                zero-mq: socket.recv() call is blocking
                            
                                Read json file from python
                            
                                generating random numbers in python with percentage function on what selected value in python
                            
                                How to parse empty value of parameter in HTTP request in python?
                            
                                Python Sublime Text output window show/hide
                            
                                update method of python dictionary did not work
                            
                                Unittest teardown() del all attributes
                            
                                How to have win32com code completion in IPython?
                            
                                Sending Encrypted strings using socket in Python
                            
                                Read stdin from inlined python in bash
                            
                                py launcher does not find my Python 2.7
                            
                                Python __reverse__ magic method
                            
                                python logging - default value to extra parameters
                            
                                Getting movie properties with python and opencv

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Optimizing bigint calls

Tags:

python

optimization

d

xtreak

People also ask

2 Answers

user4463256

Tasos Vogiatzoglou

Recent Activity

Donate For Us