I have two numpy arrays of the same length that contain binary values <pre class="prettyprint"><code>import numpy as np a=np.array([1, 1, 1, 1, 1, 1, 0, 1, 1, 0, 1, 1, 1, 0, 0, 0, 0, 1, 1, 1, 0]) b=np.array([1, 1, 1, 1, 0, 1, 1, 0, 1, 0, 1, 0, 1, 0, 1, 0, 0, 1, 1, 0, 1]) </code></pre> I want to compute the hamming distance between them as fast as possible since I have millions of such distance computations to make. A simple but slow option is this (taken from wikipedia): <pre class="prettyprint"><code>%timeit sum(ch1 != ch2 for ch1, ch2 in zip(a, b)) 10000 loops, best of 3: 79 us per loop </code></pre> I have come up with faster options, inspired by some answers here on stack overflow. <pre class="prettyprint"><code>%timeit np.sum(np.bitwise_xor(a,b)) 100000 loops, best of 3: 6.94 us per loop %timeit len(np.bitwise_xor(a,b).nonzero()[0]) 100000 loops, best of 3: 2.43 us per loop </code></pre> I'm wondering if there are even faster ways to compute this, possibly using cython?

Fast hamming distance computation between binary numpy arrays

Tags:

I have two numpy arrays of the same length that contain binary values

import numpy as np
a=np.array([1, 1, 1, 1, 1, 1, 0, 1, 1, 0, 1, 1, 1, 0, 0, 0, 0, 1, 1, 1, 0])
b=np.array([1, 1, 1, 1, 0, 1, 1, 0, 1, 0, 1, 0, 1, 0, 1, 0, 0, 1, 1, 0, 1])

I want to compute the hamming distance between them as fast as possible since I have millions of such distance computations to make.

A simple but slow option is this (taken from wikipedia):

%timeit sum(ch1 != ch2 for ch1, ch2 in zip(a, b))
10000 loops, best of 3: 79 us per loop

I have come up with faster options, inspired by some answers here on stack overflow.

%timeit np.sum(np.bitwise_xor(a,b))
100000 loops, best of 3: 6.94 us per loop

%timeit len(np.bitwise_xor(a,b).nonzero()[0])
100000 loops, best of 3: 2.43 us per loop

I'm wondering if there are even faster ways to compute this, possibly using cython?

Related questions
                            
                                Improving model training speed in caret (R)
                            
                                How to show a notification everyday at a certain time even when the app is closed?
                            
                                Android application crashes when I start an IntentService
                            
                                AWS - EC2 instances not showing up in console
                            
                                How to initialize a variable of date type in Java?
                            
                                Python: Line that does not start with #
                            
                                spinner dropdown start from top of spinner
                            
                                updating state every x seconds
                            
                                serving gzipped files on Firebase Hosting
                            
                                What is the difference between object of an abstract class and list of objects of abstract class?
                            
                                Undefined variable: errors -- Laravel 5.2
                            
                                How to center an image in navigationBar across all UIViewControllers? Swift / Obj-C

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Fast hamming distance computation between binary numpy arrays

Tags:

Related questions

Recent Activity

Donate For Us