I have this function to calculate squared Mahalanobis distance of vector x to mean: <pre class="prettyprint"><code>def mahalanobis_sqdist(x, mean, Sigma): ''' Calculates squared Mahalanobis Distance of vector x to distibutions' mean ''' Sigma_inv = np.linalg.inv(Sigma) xdiff = x - mean sqmdist = np.dot(np.dot(xdiff, Sigma_inv), xdiff) return sqmdist </code></pre> I have an numpy array that has a shape of <code>(25, 4)</code>. So, I want to apply that function to all 25 rows of my array without a for loop. So, basically, how can I write the vectorized form of this loop: <pre class="prettyprint"><code>for r in d1: mahalanobis_sqdist(r[0:4], mean1, Sig1) </code></pre> where <code>mean1</code> and <code>Sig1</code> are : <pre class="prettyprint"><code>>>> mean1 array([ 5.028, 3.48 , 1.46 , 0.248]) >>> Sig1 = np.cov(d1[0:25, 0:4].T) >>> Sig1 array([[ 0.16043333, 0.11808333, 0.02408333, 0.01943333], [ 0.11808333, 0.13583333, 0.00625 , 0.02225 ], [ 0.02408333, 0.00625 , 0.03916667, 0.00658333], [ 0.01943333, 0.02225 , 0.00658333, 0.01093333]]) </code></pre> I have tried the following but it didn't work: <pre class="prettyprint"><code>>>> vecdist = np.vectorize(mahalanobis_sqdist) >>> vecdist(d1, mean1, Sig1) Traceback (most recent call last): File "<stdin>", line 1, in <module> File "/usr/lib/python2.7/dist-packages/numpy/lib/function_base.py", line 1862, in __call__ theout = self.thefunc(*newargs) File "<stdin>", line 6, in mahalanobis_sqdist File "/usr/lib/python2.7/dist-packages/numpy/linalg/linalg.py", line 445, in inv return wrap(solve(a, identity(a.shape[0], dtype=a.dtype))) IndexError: tuple index out of range </code></pre>

The answer by @unutbu works very nicely for applying any function to the rows of an array. In this particular case, there are some mathematical symmetries you can use that will speed things up considerably if you are working with large arrays. Here is a modified version of your function: <pre class="prettyprint"><code>def mahalanobis_sqdist3(x, mean, Sigma): Sigma_inv = np.linalg.inv(Sigma) xdiff = x - mean return (xdiff.dot(Sigma_inv)*xdiff).sum(axis=-1) </code></pre> If you end up using any sort of large <code>Sigma</code>, I would recommend that you cache <code>Sigma_inv</code> and pass that in as an argument to your function instead. Since it is 4x4 in this example, this doesn't matter. I'll show how to deal with large <code>Sigma</code> anyway for anyone else who comes across this. If you aren't going to be using the same <code>Sigma</code> repeatedly, you won't be able to cache it, so, instead of inverting the matrix, you could use a different method to solve the linear system. Here I'll use the LU decomposition built in to SciPy. This only improves the time if the number of columns of <code>x</code> is large relative to its number of rows. Here is a function that shows that approach: <pre class="prettyprint"><code>from scipy.linalg import lu_factor, lu_solve def mahalanobis_sqdist4(x, mean, Sigma): xdiff = x - mean Sigma_inv = lu_factor(Sigma) return (xdiff.T*lu_solve(Sigma_inv, xdiff.T)).sum(axis=0) </code></pre> Here are some timings. I'll include the version with <code>einsum</code> as mentioned in the other answer. <pre class="prettyprint"><code>import numpy as np Sig1 = np.array([[ 0.16043333, 0.11808333, 0.02408333, 0.01943333], [ 0.11808333, 0.13583333, 0.00625 , 0.02225 ], [ 0.02408333, 0.00625 , 0.03916667, 0.00658333], [ 0.01943333, 0.02225 , 0.00658333, 0.01093333]]) mean1 = np.array([ 5.028, 3.48 , 1.46 , 0.248]) x = np.random.rand(25, 4) %timeit np.apply_along_axis(mahalanobis_sqdist, 1, x, mean1, Sig1) %timeit mahalanobis_sqdist2(x, mean1, Sig1) %timeit mahalanobis_sqdist3(x, mean1, Sig1) %timeit mahalanobis_sqdist4(x, mean1, Sig1) </code></pre> giving: <pre class="prettyprint"><code>1000 loops, best of 3: 973 µs per loop 10000 loops, best of 3: 36.2 µs per loop 10000 loops, best of 3: 40.8 µs per loop 10000 loops, best of 3: 83.2 µs per loop </code></pre> However, changing the sizes of the arrays involved changes the timing results. For example, letting <code>x = np.random.rand(2500, 4)</code>, the timings are: <pre class="prettyprint"><code>10 loops, best of 3: 95 ms per loop 1000 loops, best of 3: 355 µs per loop 10000 loops, best of 3: 131 µs per loop 1000 loops, best of 3: 337 µs per loop </code></pre> And letting <code>x = np.random.rand(1000, 1000)</code>, <code>Sigma1 = np.random.rand(1000, 1000)</code>, and <code>mean1 = np.random.rand(1000)</code>, the timings are: <pre class="prettyprint"><code>1 loops, best of 3: 1min 24s per loop 1 loops, best of 3: 2.39 s per loop 10 loops, best of 3: 155 ms per loop 10 loops, best of 3: 99.9 ms per loop </code></pre> Edit: I noticed that one of the other answers used the Cholesky decomposition. Given that <code>Sigma</code> is symmetric and positive definite, we can actually do better than my above results. There are some good routines from BLAS and LAPACK available through SciPy that can work with symmetric positive-definite matrices. Here are two faster versions. <pre class="prettyprint"><code>from scipy.linalg.fblas import dsymm def mahalanobis_sqdist5(x, mean, Sigma_inv): xdiff = x - mean Sigma_inv = la.inv(Sigma) return np.einsum('...i,...i->...',dsymm(1., Sigma_inv, xdiff.T).T, xdiff) from scipy.linalg.flapack import dposv def mahalanobis_sqdist6(x, mean, Sigma): xdiff = x - mean return np.einsum('...i,...i->...', xdiff, dposv(Sigma, xdiff.T)[1].T) </code></pre> The first one still inverts Sigma. If you pre-compute the inverse and reuse it, it is much faster (the 1000x1000 case takes 35.6ms on my machine with the pre-computed inverse). I also used einsum to take the product then sum along the last axis. This ended up being marginally faster than doing something like <code>(A * B).sum(axis=-1)</code>. These two functions give the following timings: First test case: <pre class="prettyprint"><code>10000 loops, best of 3: 55.3 µs per loop 100000 loops, best of 3: 14.2 µs per loop </code></pre> Second test case: <pre class="prettyprint"><code>10000 loops, best of 3: 121 µs per loop 10000 loops, best of 3: 79 µs per loop </code></pre> Third test case: <pre class="prettyprint"><code>10 loops, best of 3: 92.5 ms per loop 10 loops, best of 3: 48.2 ms per loop </code></pre>

Apply a function to each row of a ndarray

Tags:

python

arrays

vectorization

numpy

I have this function to calculate squared Mahalanobis distance of vector x to mean:

Apply a function to each row of a ndarray

Tags:

python

arrays

vectorization

numpy

Vahid Mirjalili

People also ask

2 Answers

IanH

unutbu

Related questions

Recent Activity

Donate For Us