Finding the row with the highest average in a numpy array

Question

Given the following array:

complete_matrix = numpy.array([
    [0, 1, 2, 4],
    [1, 0, 3, 5],
    [2, 3, 0, 6],
    [4, 5, 6, 0]])

I would like to identify the row with the highest average, excluding the diagonal zeros. So, in this case, I would be able to identify complete_matrix[:,3] as being the row with the highest average.

kjoppy · Accepted Answer

Note that the presence of the zeros doesn't affect which row has the highest mean because all rows have the same number of elements. Therefore, we just take the mean of each row, and then ask for the index of the largest element.

#Take the mean along the 1st index, ie collapse into a Nx1 array of means
means = np.mean(complete_matrix, 1)
#Now just get the index of the largest mean
idx = np.argmax(means)

idx is now the index of the row with the highest mean!

arshajii · Answer

You don't need to worry about the 0s, they shouldn't effect how the averages compare since there will presumably be one in each row. Hence, you can do something like this to get the index of the row with the highest average:

>>> import numpy as np 
>>> complete_matrix = np.array([
...     [0, 1, 2, 4],
...     [1, 0, 3, 5],
...     [2, 3, 0, 6],
...     [4, 5, 6, 0]])
>>> np.argmax(np.mean(complete_matrix, axis=1))
3

Reference:

numpy.mean
numpy.argmax

John Doe · Answer

As pointed out by a lot of people, presence of zeros isn't an issue as long as you have the same number of zeros in each column. Just in case your intention was to ignore all the zeros, preventing them from participating in the average computation, you could use weights to suppress the contribution of the zeros. The following solution assigns 0 weight to zero entries, 1 otherwise:

numpy.argmax(numpy.average(complete_matrix,axis=0, weights=complete_matrix!=0))

You can always create a weight matrix where the weight is 0 for diagonal entries, and 1 otherwise.

numpy.argmax(numpy.average(complete_matrix,axis=0, weights=complete_matrix!=0))

You can always create a weight matrix where the weight is 0 for diagonal entries, and 1 otherwise.

Finding the row with the highest average in a numpy array

Tags:

python

arrays

numpy

Dana Gray

3 Answers

kjoppy

arshajii

John Doe

Recent Activity

Donate For Us

Finding the row with the highest average in a numpy array

Tags:

python

arrays

numpy

Dana Gray

3 Answers

kjoppy

arshajii

John Doe

Related questions

Recent Activity

Donate For Us