I am trying to make some piece of code more efficient by using the vectorized form in numpy. Let me show you an example so you know what I mean.
Given the following code:
a = np.zeros([4,4])
a[0] = [1., 2., 3., 4.]
for i in range(len(a)-1):
    a[i+1] = 2*a[i]
print a
It outputs
[[  1.   2.   3.   4.]
 [  2.   4.   6.   8.]
 [  4.   8.  12.  16.]
 [  8.  16.  24.  32.]]
When I now try to vectorize the code like this:
a = np.zeros([4,4])
a[0] = [1., 2., 3., 4.]
a[1:] = 2*a[0:-1]
print a
I just get the first iteration correct:
[[ 1.  2.  3.  4.]
 [ 2.  4.  6.  8.]
 [ 0.  0.  0.  0.]
 [ 0.  0.  0.  0.]]
Is it possible to write the code above efficiently in a vectorized form (where the next iteration always accesses the previous iteration) or do I have to keep the for loop?
A linear recurrence such as this can be computed using scipy.signal.lfilter:
In [19]: from scipy.signal import lfilter
In [20]: num = np.array([1.0])
In [21]: alpha = 2.0
In [22]: den = np.array([1.0, -alpha])
In [23]: a = np.zeros((4,4))
In [24]: a[0,:] = [1,2,3,4]
In [25]: lfilter(num, den, a, axis=0)
Out[25]: 
array([[  1.,   2.,   3.,   4.],
       [  2.,   4.,   6.,   8.],
       [  4.,   8.,  12.,  16.],
       [  8.,  16.,  24.,  32.]])
See the following for more details: python recursive vectorization with timeseries, Recursive definitions in Pandas
Note that using lfilter really only makes sense if you are solving a nonhomogeneous problem such as x[i+1] = alpha*x[i] + u[i], where u is a given input array.  For the simple recurrence a[i+1] = alpha*a[i], you can use the exact solution a[i] = a[0]*alpha**i.  The solution for multiple initial values can be vectorized using broadcasting.  For example,
In [271]: alpha = 2.0
In [272]: a0 = np.array([1, 2, 3, 4])
In [273]: n = 5
In [274]: a0 * (alpha**np.arange(n).reshape(-1, 1))
Out[274]: 
array([[  1.,   2.,   3.,   4.],
       [  2.,   4.,   6.,   8.],
       [  4.,   8.,  12.,  16.],
       [  8.,  16.,  24.,  32.],
       [ 16.,  32.,  48.,  64.]])
                        Numpy's vector calculations act on the vector, not as a sequence of steps, so you have to vectorize the entire expression. For example:
np.multiply(np.arange(1,5), 2**np.arange(0,4)[np.newaxis].T)
To address the "final" question, yes you have to keep the for loop if you want to do a sequential calculation.  You might make it more efficient with map or [... for ...] but optimizing that way takes a lot of trial and error.  The beauty of thinking in vectorial terms and using Numpy to implement is that you get a result efficiently without all the trial and error.
The cumsum and cumprod functions can do something similar to what you're asking for.  Instead of 2**np.arange(...), you can get the same thing from
np.multiply(np.arange(1,5), np.cumprod([1,2,2,2,])[np.newaxis].T)
                        looking at your result matrix
result = [[ 1, 2,   3, 4,]
          [ 2, 4,   6, 8,]
          [ 4, 8,  12, 16,]
          [ 8, 16, 24, 32,]]
it can be deconstructed into product(elem-wise) of two matrix as
a = [[1, 2, 3, 4],
    [1, 2, 3, 4],
    [1, 2, 3, 4],
    [1, 2, 3, 4]]
b = [[1, 1, 1, 1],
     [2, 2, 2, 2],
     [4, 4, 4, 4]
     [8, 8, 8, 8]]
result = a * b
you can calculate this type of operation using the meshgrid function
aa, bb = np.meshgrid(np.array([1.0, 2.0, 3.0, 4.0]),
                     np.array([1.0, 2.0, 4.0, 8.0]))
result = aa * bb
                        If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With