Sum the squared difference between 2 Numpy arrays [duplicate]

Tags:

python

numpy

Suppose I have the following 2 arrays:

import numpy as np
a=np.asarray([[1,2,4],
       [3,1,2]])
b=np.asarray([[2,1,1],
       [3,2,3],
       [4,1,2],
       [2,2,1],])

For every row a_row in a, I would like to get the sum of squared difference between a_row and every row in b. The resulted array would be a 2 by 4 array. The expected result would be the following:

array([[ 11.,   5.,  14.,  10.],
       [  2.,   2.,   1.,   3.]])

I've already implemented a solution using loop:

c=np.zeros((2,4))
for e in range(a.shape[0]):
    c[e,:] = np.sum(np.square(b-a[e,:]),axis=1)
print c

What I need is a fully vectorized solution, i.e. no loop is required.

509

asked Jun 07 '16 19:06

Allen

2 Answers

Here is a Numpythonic approach, simply by reshaping the b in order to be able to directly subtract the a from it:

>>> np.square(b[:,None] - a).sum(axis=2).T
array([[11,  5, 14, 10],
       [ 2,  2,  1,  3]])

answered Nov 15 '22 05:11

Mazdak

If you have access to scipy, then you could do:

import scipy
from scipy.spatial.distance import cdist

import numpy as np

a=np.asarray([[1,2,4],
       [3,1,2]])
b=np.asarray([[2,1,1],
       [3,2,3],
       [4,1,2],
       [2,2,1],])

x = cdist(a,b)**2
# print x
# array([[ 11.,   5.,  14.,  10.],
#        [  2.,   2.,   1.,   3.]])

This uses the cdist function which is vectorized and fast. You can possibly get a bit more speed using numba or cython, but it depends on the size of your arrays in practice.

answered Nov 15 '22 04:11

JoshAdel

Related questions
                            
                                numpy matrix to pandas Series
                            
                                Zip list of tuples with flat list
                            
                                Docker container keeps growing
                            
                                # -*- coding: utf-8 -*- on python3 [duplicate]
                            
                                Why do my nested python class instances become tuples?
                            
                                How to speed up pandas groupby - apply function to be comparable to R's data.table
                            
                                efficiently read one file from a zip containing a lot of files in python
                            
                                Pybind11 Type Error
                            
                                BeagleBone Black OpenCV Python is too slow
                            
                                "SignatureError: Failed to verify signature" - Okta, pySAML2
                            
                                How to see full HTTPS URL in wireShark
                            
                                Bash pass string argument to python script
                            
                                Python xarray.concat then xarray.to_netcdf generates huge new file size
                            
                                Converting hard integral to lambda function with lambdify
                            
                                How do you run a python script from a C++ program?
                            
                                How can I shade an area under a curve between two lines in matplotlib / pandas?
                            
                                Add background image to 3d plot
                            
                                Where does a Python list hold its values?
                            
                                Deploy Django project on RedHat
                            
                                Access json content of http post request with Klein in python

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With