As the title says, suppose I want to write a sign function (let's forget sign(0) for now), obviously we expect sign(2) = 1 and sign(array([-2,-2,2])) = array([-1,-1,1]). The following function won't work however, because it can't handle numpy arrays. <pre class="prettyprint"><code>def sign(x): if x>0: return 1 else: return -1 </code></pre> The next function won't work either since x doesn't have a shape member if it's just a single number. Even if some trick like y = x*0 + 1 is used, y won't have a [] method. <pre class="prettyprint"><code>def sign(x): y = ones(x.shape) y[x<0] = -1 return y </code></pre> Even with the idea from another question(how can I make a numpy function that accepts a numpy array, an iterable, or a scalar?), the next function won't work when x is a single number because in this case x.shape and y.shape are just () and indexing y is illegal. <pre class="prettyprint"><code>def sign(x): x = asarray(x) y = ones(x.shape) y[x<0] = -1 return y </code></pre> The only solution seems to be that first decide if x is an array or a number, but I want to know if there is something better. Writing branchy code would be cumbersome if you have lots of small functions like this.

<code>np.vectorize</code> can be used to achieve that, but would be slow because all it does, when your decorated function is called with an array, is looping through the array elements and apply the scalar function to each, i.e. not leveraging numpy's speed. A method I find useful for vectorizing functions involving if-else is using <code>np.choose</code>: <pre class="prettyprint"><code>def sign_non_zero(x): return np.choose( x > 0, # bool values, used as indices to the array [ -1, # index=0=False, i.e. x<=0 1, # index=1=True, i.e. x>0 ]) </code></pre> This works when <code>x</code> is either scalar or an array, and is faster than looping in python-space. The only disadvantage of using <code>np.choose</code> is that it is not intuitive to write if-else logic in that manner, and the code is less readable. Whenver I use it, I include comments like the ones above, to make it easier on the reader to understand what is going on.

A python function that accepts as an argument either a scalar or a numpy array

Tags:

python

arrays

function

numpy

As the title says, suppose I want to write a sign function (let's forget sign(0) for now), obviously we expect sign(2) = 1 and sign(array([-2,-2,2])) = array([-1,-1,1]). The following function won't work however, because it can't handle numpy arrays.

def sign(x):
    if x>0: return 1
    else: return -1

The next function won't work either since x doesn't have a shape member if it's just a single number. Even if some trick like y = x*0 + 1 is used, y won't have a [] method.

def sign(x):
    y = ones(x.shape)
    y[x<0] = -1
    return y

Even with the idea from another question(how can I make a numpy function that accepts a numpy array, an iterable, or a scalar?), the next function won't work when x is a single number because in this case x.shape and y.shape are just () and indexing y is illegal.

def sign(x):
    x = asarray(x)
    y = ones(x.shape)
    y[x<0] = -1
    return y

The only solution seems to be that first decide if x is an array or a number, but I want to know if there is something better. Writing branchy code would be cumbersome if you have lots of small functions like this.

490

asked Oct 24 '14 06:10

Taozi

1 Answers

np.vectorize can be used to achieve that, but would be slow because all it does, when your decorated function is called with an array, is looping through the array elements and apply the scalar function to each, i.e. not leveraging numpy's speed.

A method I find useful for vectorizing functions involving if-else is using np.choose:

def sign_non_zero(x):
    return np.choose(
        x > 0,  # bool values, used as indices to the array
        [
            -1, # index=0=False, i.e. x<=0
            1,  # index=1=True, i.e. x>0
        ])

This works when x is either scalar or an array, and is faster than looping in python-space.

The only disadvantage of using np.choose is that it is not intuitive to write if-else logic in that manner, and the code is less readable. Whenver I use it, I include comments like the ones above, to make it easier on the reader to understand what is going on.

138

answered Sep 20 '22 16:09

shx2

Related questions
                            
                                GeoDjango distance filter with distance value stored within model - query
                            
                                Reading/writing to a Popen() subprocess
                            
                                Shared memory between python processes
                            
                                concat pandas DataFrame along timeseries indexes
                            
                                How do I test a module that depends on boto and an Amazon AWS service?
                            
                                How do I compute the variance of a column of a sparse matrix in Scipy?
                            
                                file name vs file object as a function argument
                            
                                python pip: no distributions at all found for an existing package
                            
                                Python error when importing image_to_string from tesseract
                            
                                Matplotlib: make final figure dimensions match figsize with savefig() and bbox_extra_artists
                            
                                Faster alternative to Python's zipfile module?
                            
                                Permission denied doing os.mkdir(d) after running shutil.rmtree(d) in Python
                            
                                WTForms RadioField default values
                            
                                Does something like CanCan (authorization library) exist for flask and python
                            
                                tab complete dictionary keys in ipython
                            
                                Why is copying a list using a slice[:] faster than using the obvious way?
                            
                                Django: ValueError: Lookup failed for model referenced by field account.UserProfile.user: auth.User
                            
                                gdb pretty printing with python a recursive structure
                            
                                How to prevent Exception ignored in: <module 'threading' from ... > while setting signal handler?
                            
                                How to detect if python script is being run as a background process

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With