I have <code>numpy.array</code>s where the columns contain different data types, and the columns should also to have different functions applied to them. I have the functions in an array as well. Let's say: <pre class="prettyprint"><code>a = array([[ 1, 2.0, "three"], [ 4, 5.0, "six" ]], dtype=object) functions_arr = array([act_on_int, act_on_float, act_on_str]) </code></pre> I can certainly think of ways to do this by dividing the thing, but the one thing that seems most natural to me is to think of it as an elementwise multiplication with broadcasting, and the functions as operators. So I'd like to do something like <pre class="prettyprint"><code>functions_arr*a </code></pre> and get the effect of <pre class="prettyprint"><code>array([[act_on_int(1), act_on_float(2.0), act_on_str("three")], [act_on_int(4), act_on_float(5.0), act_on_str("six") ]]) </code></pre> Do you know of a way to achieve something along those lines? Edit: I changed the definition of the array in the question to include <code>dtype=[object]</code> as people pointed out this is important for the array to store types the way I intended. Thank you for your answers and comments! I have accepted senderles answer and feel this is very close to what I had in mind. Since there seems to have been some confusion about how I consider the operation to be like multiplication, let me clarify that with another example: As you're well aware, an operation like: <pre class="prettyprint"><code>v = array([1,2,3]) u = array([[5,7,11], [13,17,19]]) v*u </code></pre> will broadcast <code>v</code> over the rows of <code>u</code> and yields <pre class="prettyprint"><code>array([[ 1*5, 2*7, 3*11], [1*13, 2*17, 3*19]]) </code></pre> i.e. <pre class="prettyprint"><code>array([[ 5, 14, 33], [13, 34, 57]]) </code></pre> If we now were to replace <code>v</code> with for instance the del operator we would have (the following is not actually working python code:) <pre class="prettyprint"><code>V = array([(d/dx),(d/dy),(d/dz)]) u = array([[5,7,11], [13,17,19]]) V*u </code></pre> yielding (in spirit) <pre class="prettyprint"><code>array([[(d/dx)5, (d/dy)7, (d/dz)11]], [(d/dx)13,(d/dy)17,(d/dz)19]]) </code></pre> I admit taking the derivative of a bunch of constants would not be the most interesting of operations, so feel free to replace <code>u</code> with some symbolic mathematical expression in <code>x</code> ,<code>y</code> and <code>z</code>. At any rate I hope this at least makes more clear both my reasoning and the bit about "(using a python function as an operator?)" in the title.

As Sven Marnach reminded me, the array you've created is probably an array of Python objects. Any operation on them will likely be much slower than pure <code>numpy</code> operations. However, you can do what you've asked pretty easily, as long as you don't actually expect this to be very fast! It's not too different from what AFoglia suggested, but it's closer to being exactly what you asked for: <pre class="prettyprint"><code>>>> a = numpy.array([[ 1, 2.0, "three"], ... [ 4, 5.0, "six" ]], dtype=object) >>> funcs = [lambda x: x + 10, lambda x: x / 2, lambda x: x + '!'] >>> apply_vectorized = numpy.vectorize(lambda f, x: f(x), otypes=[object]) >>> apply_vectorized(funcs, a) array([[11, 1.0, three!], [14, 2.5, six!]], dtype=object) </code></pre> Also echoing AFoglia here, there's a good chance you'd be better off using a record array -- this allows you to divide the array up as you like, and work with it in a more natural way using numpy ufuncs -- which are much faster than Python functions, generally: <pre class="prettyprint"><code>rec.array([(1, 2.0, 'three'), (4, 5.0, 'six')], dtype=[('int', '<i8'), ('float', '<f8'), ('str', '|S10')]) >>> a['int'] array([1, 4]) >>> a['float'] array([ 2., 5.]) >>> a['str'] rec.array(['three', 'six'], dtype='|S10') >>> a['int'] += 10 >>> a['int'] array([11, 14]) </code></pre>

you're looking for built-in function zip() A simple example using <code>lists</code>: <pre class="prettyprint"><code>>>> a=[[ 1, 2.0, "three"],[ 4, 5.0, "six" ]] >>> funcs=[lambda x:x**2,lambda y:y*2,lambda z:z.upper()] >>> [[f(v) for v,f in zip(x,funcs)]for x in a] [[1, 4.0, 'THREE'], [16, 10.0, 'SIX']] </code></pre>

Numpy: Apply an array of functions to a same length 2d-array of value as if multiplying elementwise? (using a python function as an operator?)

Tags:

python

numpy

I have numpy.arrays where the columns contain different data types, and the columns should also to have different functions applied to them. I have the functions in an array as well.

Let's say:

a = array([[ 1, 2.0, "three"],
           [ 4, 5.0, "six"  ]], dtype=object)

functions_arr = array([act_on_int, act_on_float, act_on_str])

I can certainly think of ways to do this by dividing the thing, but the one thing that seems most natural to me is to think of it as an elementwise multiplication with broadcasting, and the functions as operators. So I'd like to do something like

functions_arr*a

and get the effect of

array([[act_on_int(1), act_on_float(2.0), act_on_str("three")],
       [act_on_int(4), act_on_float(5.0), act_on_str("six")  ]])

Do you know of a way to achieve something along those lines?

Edit: I changed the definition of the array in the question to include dtype=[object] as people pointed out this is important for the array to store types the way I intended.

Thank you for your answers and comments! I have accepted senderles answer and feel this is very close to what I had in mind.

Since there seems to have been some confusion about how I consider the operation to be like multiplication, let me clarify that with another example:

As you're well aware, an operation like:

v = array([1,2,3])
u = array([[5,7,11],
           [13,17,19]])
v*u

will broadcast v over the rows of u and yields

array([[ 1*5, 2*7,  3*11],
       [1*13, 2*17, 3*19]])

i.e.

array([[ 5, 14, 33],
       [13, 34, 57]])

If we now were to replace v with for instance the del operator we would have (the following is not actually working python code:)

V = array([(d/dx),(d/dy),(d/dz)])
u = array([[5,7,11],
           [13,17,19]])
V*u

yielding (in spirit)

array([[(d/dx)5, (d/dy)7, (d/dz)11]],
       [(d/dx)13,(d/dy)17,(d/dz)19]])

I admit taking the derivative of a bunch of constants would not be the most interesting of operations, so feel free to replace u with some symbolic mathematical expression in x ,y and z. At any rate I hope this at least makes more clear both my reasoning and the bit about "(using a python function as an operator?)" in the title.

795

asked Jul 05 '12 13:07

mirari

2 Answers

As Sven Marnach reminded me, the array you've created is probably an array of Python objects. Any operation on them will likely be much slower than pure numpy operations. However, you can do what you've asked pretty easily, as long as you don't actually expect this to be very fast! It's not too different from what AFoglia suggested, but it's closer to being exactly what you asked for:

>>> a = numpy.array([[ 1, 2.0, "three"],
...                  [ 4, 5.0, "six"  ]], dtype=object)
>>> funcs = [lambda x: x + 10, lambda x: x / 2, lambda x: x + '!']
>>> apply_vectorized = numpy.vectorize(lambda f, x: f(x), otypes=[object])
>>> apply_vectorized(funcs, a)
array([[11, 1.0, three!],
       [14, 2.5, six!]], dtype=object)

Also echoing AFoglia here, there's a good chance you'd be better off using a record array -- this allows you to divide the array up as you like, and work with it in a more natural way using numpy ufuncs -- which are much faster than Python functions, generally:

rec.array([(1, 2.0, 'three'), (4, 5.0, 'six')], 
      dtype=[('int', '<i8'), ('float', '<f8'), ('str', '|S10')])
>>> a['int']
array([1, 4])
>>> a['float']
array([ 2.,  5.])
>>> a['str']
rec.array(['three', 'six'], 
      dtype='|S10')
>>> a['int'] += 10
>>> a['int']
array([11, 14])

112

answered Oct 03 '22 07:10

senderle

you're looking for built-in function zip()

A simple example using lists:

>>> a=[[ 1, 2.0, "three"],[ 4, 5.0, "six"  ]]

>>> funcs=[lambda x:x**2,lambda y:y*2,lambda z:z.upper()]

>>> [[f(v) for v,f in zip(x,funcs)]for x in a]
[[1, 4.0, 'THREE'], [16, 10.0, 'SIX']]

answered Oct 03 '22 08:10

Ashwini Chaudhary

Related questions
                            
                                Scraping javascript-generated data using Python
                            
                                Create Python EXE without MSVCP90.dll
                            
                                Should memory usage increase when using ElementTree.iterparse() when clear()ing trees?
                            
                                Class property using Python C-API
                            
                                Is a spawned subprocess considered a new dyno on Heroku?
                            
                                Determining what tkinter window is currently on top
                            
                                how to make jenkins run a python script that executes a build?
                            
                                How to get properties of picked object in mplot3d (matplotlib + python)?
                            
                                How to draw a line in Python Mayavi?
                            
                                mod_wsgi error - class.__dict__ not accessible in restricted mode
                            
                                ForeignKey vs OneToOne field django [duplicate]
                            
                                Pydoop on Amazon EMR
                            
                                In which py.test callout can I find both 'item' and 'report' data?
                            
                                When to split code into files/modules? [closed]
                            
                                Sending arguments from Batch file to Python script
                            
                                How do I return 404 when tastypie is interfacing non-ORM sources?
                            
                                Exiting the child process after os.fork()
                            
                                Documentation after members in python (with doxygen)
                            
                                Python multiprocessing: how to limit the number of waiting processes?
                            
                                is there any compiler that can convert regexp to fsm? or could convert to human words?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With