Finding the most common subarray within a numpy array

Question

Example data:

array(
  [[ 1.,  1.],
   [ 2.,  1.],
   [ 0.,  1.],
   [ 0.,  0.],
   [ 0.,  0.]])

with a desired result of

>>> [0.,0.]

ie) The most common pair.

Approaches that don't seem to work:

Using statistics as numpy arrays are unhashable.

Using scipy.stats.mode as this returns the mode over each axis, eg) for our example it gives

mode=array([[ 0.,  1.]])

piman314 · Accepted Answer

You can do this efficiently with numpy using the unique function:

pairs, counts = np.unique(a, axis=0, return_counts=True)
print(pairs[counts.argmax()])

Returns: [ 0. 0.]

Donate For Us