I chose these numbers randomly, but these results seem to be consistent --- a float exponent is 25%-50% faster than an integer one. How are these handled differently? <pre class="prettyprint"><code>In [209]: %timeit -n 100000 -r 100 np.power(3.71242, 7) 100000 loops, best of 100: 3.45 µs per loop In [210]: %timeit -n 100000 -r 100 np.power(3.71242, 7.0) 100000 loops, best of 100: 1.98 µs per loop </code></pre>

<code>np.power</code> is a universal function (ufunc). These functions can be used on scalars and arrays which have a variety of different datatypes, but must first check the type of input values so that they can determine which internal loop to use to generate suitable output values. If the input types do not map to any of the ufunc's predefined loops, the ufunc will try to cast the input values to suitable types (unless it is told otherwise). This checking and conversion of input values has a performance cost associated with it, explaining the timings observed in the question. The <code>types</code> attribute of a ufunc shows how input datatypes will map to an output datatype. Below is the list of mappings for <code>np.power</code>: <pre class="prettyprint"><code>>>> np.power.types # 'input input -> output' ['bb->b', 'BB->B', 'hh->h', 'HH->H', 'ii->i', 'II->I', 'll->l', 'LL->L', 'qq->q', 'QQ->Q', 'ee->e', 'ff->f', 'dd->d', 'gg->g', 'FF->F', 'DD->D', 'GG->G', 'OO->O'] </code></pre> Floating-point numbers belong to character code <code>'g'</code>, Python integers belong to <code>'l'</code>. A full list of these character codes can be found here. Note that for this ufunc, the datatypes of the two input values must be the same. There is no mapping for a mix of <code>float</code> and <code>int</code> input datatypes, for example. But we can still give <code>np.power</code> different datatypes and let it cast the values to appropriate datatypes. For a <code>float</code> and an <code>int</code>, a <code>float64</code> number is returned: <pre class="prettyprint"><code>>>> np.power(3.71242, 7).dtype dtype('float64') </code></pre> Above you can see that the only input which maps to the <code>float64</code> character code <code>g</code> is two other <code>g</code> values: <code>'gg->g'</code>. So, behind the scenes, <code>np.power(3.71242, 7)</code> took a Python <code>float</code> and a Python <code>int</code> and had to decide which it could safely recast and to what type. The <code>int</code> value was safely promoted to a float type <code>g</code>. The ufunc then knew which loop to run and returned another <code>g</code> value. For this reason, not mixing input datatypes results in better performance for <code>np.power</code>.

Why is numpy.power slower for integer exponents?

Tags:

performance

python

types

numpy

exponentiation

I chose these numbers randomly, but these results seem to be consistent --- a float exponent is 25%-50% faster than an integer one. How are these handled differently?

In [209]: %timeit -n 100000 -r 100 np.power(3.71242, 7)
100000 loops, best of 100: 3.45 µs per loop

In [210]: %timeit -n 100000 -r 100 np.power(3.71242, 7.0)
100000 loops, best of 100: 1.98 µs per loop

911

asked Nov 06 '14 02:11

DilithiumMatrix

1 Answers

np.power is a universal function (ufunc). These functions can be used on scalars and arrays which have a variety of different datatypes, but must first check the type of input values so that they can determine which internal loop to use to generate suitable output values.

If the input types do not map to any of the ufunc's predefined loops, the ufunc will try to cast the input values to suitable types (unless it is told otherwise). This checking and conversion of input values has a performance cost associated with it, explaining the timings observed in the question.

The types attribute of a ufunc shows how input datatypes will map to an output datatype. Below is the list of mappings for np.power:

>>> np.power.types # 'input input -> output'
['bb->b', 'BB->B', 'hh->h', 'HH->H', 'ii->i', 'II->I', 'll->l', 'LL->L', 'qq->q', 
 'QQ->Q', 'ee->e', 'ff->f', 'dd->d', 'gg->g', 'FF->F', 'DD->D', 'GG->G', 'OO->O']

Floating-point numbers belong to character code 'g', Python integers belong to 'l'. A full list of these character codes can be found here.

Note that for this ufunc, the datatypes of the two input values must be the same. There is no mapping for a mix of float and int input datatypes, for example.

But we can still give np.power different datatypes and let it cast the values to appropriate datatypes. For a float and an int, a float64 number is returned:

>>> np.power(3.71242, 7).dtype
dtype('float64')

Above you can see that the only input which maps to the float64 character code g is two other g values: 'gg->g'.

So, behind the scenes, np.power(3.71242, 7) took a Python float and a Python int and had to decide which it could safely recast and to what type. The int value was safely promoted to a float type g. The ufunc then knew which loop to run and returned another g value.

For this reason, not mixing input datatypes results in better performance for np.power.

148

answered Oct 09 '22 22:10

Alex Riley

Related questions
                            
                                Pandas drop duplicates if reverse is present between two columns
                            
                                How to verify an element contains ANY text?
                            
                                Pandas: Find the maximum range in all the columns of dataframe
                            
                                Django Rest Framework won't let me have more than one permission
                            
                                How to colorize the output of Python errors in the Gnome terminal?
                            
                                handle all exception in scrapy with sentry
                            
                                Converting a dictionary with lists for values into a dataframe
                            
                                python logging: is it possible to add module name to formatter
                            
                                How to avoid race condition with unique checks in Django
                            
                                Why won't this django-rest-swagger API documentation display/work properly?
                            
                                Python Pandas custom time format in Excel output
                            
                                Where to get sphinxcontrib.autohttp.flask?
                            
                                Slice pandas dataframe in groups of consecutive values
                            
                                lxml - get a flat list of elements
                            
                                Alembic - sqlalchemy initial migration
                            
                                Flask, cannot assign requested address [duplicate]
                            
                                Adding a column of zeroes to a csr_matrix
                            
                                Decrease array size by averaging adjacent values with numpy
                            
                                PuLP very slow when adding many constraints
                            
                                Convert XML to dictionary in Python using lxml

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With