Iterate over numpy with index (numpy equivalent of python enumerate)

Tags:

I'm trying to create a function that will calculate the lattice distance (number of horizontal and vertical steps) between elements in a multi-dimensional numpy array. For this I need to retrieve the actual numbers from the indexes of each element as I iterate through the array. I want to store those values as numbers that I can run through a distance formula.

For the example array A

 A=np.array([[1,2,3],[4,5,6],[7,8,9]])

I'd like to create a loop that iterates through each element and for the first element 1 it would retrieve a=0, b=0 since 1 is at A[0,0], then a=0, b=1 for element 2 as it is located at A[0,1], and so on...

My envisioned output is two numbers (corresponding to the two index values for that element) for each element in the array. So in the example above, it would be the two values that I am assigning to be a and b. I only will need to retrieve these two numbers within the loop (rather than save separately as another data object).

Any thoughts on how to do this would be greatly appreciated!

655

asked Feb 07 '17 05:02

yogz123

3 Answers

You can do it using np.ndenumerate but generally you don't need to iterate over an array.

You can simply create a meshgrid (or open grid) to get all indices at once and you can then process them (vectorized) much faster.

For example

>>> x, y = np.mgrid[slice(A.shape[0]), slice(A.shape[1])]
>>> x
array([[0, 0, 0],
       [1, 1, 1],
       [2, 2, 2]])
>>> y
array([[0, 1, 2],
       [0, 1, 2],
       [0, 1, 2]])

and these can be processed like any other array. So if your function that needs the indices can be vectorized you shouldn't do the manual loop!

For example to calculate the lattice distance for each point to a point say (2, 3):

>>> abs(x - 2) + abs(y - 3)
array([[5, 4, 3],
       [4, 3, 2],
       [3, 2, 1]])

For distances an ogrid would be faster. Just replace np.mgrid with np.ogrid:

>>> x, y = np.ogrid[slice(A.shape[0]), slice(A.shape[1])]
>>> np.hypot(x - 2, y - 3)  # cartesian distance this time! :-)
array([[ 3.60555128,  2.82842712,  2.23606798],
       [ 3.16227766,  2.23606798,  1.41421356],
       [ 3.        ,  2.        ,  1.        ]])

104

answered Oct 31 '22 19:10

MSeifert

As I've become more familiar with the numpy and pandas ecosystem, it's become clearer to me that iteration is usually outright wrong due to how slow it is in comparison, and writing to use a vectorized operation is best whenever possible. Though the style is not as obvious/Pythonic at first, I've (anecdotally) gained ridiculous speedups with vectorized operations; more than 1000x in a case of swapping out a form like some row iteration .apply(lambda)

@MSeifert's answer much better provides this and will be significantly more performant on a dataset of any real size

Original Answer

You can iterate through the values in your array with numpy.ndenumerate to get the indices of the values in your array.

Using the documentation above:

A = np.array([[1,2,3],[4,5,6],[7,8,9]])
for index, values in np.ndenumerate(A):
    print(index, values)  # operate here

answered Oct 31 '22 17:10

ti7

Another possible solution:

import numpy as np

A=np.array([[1,2,3],[4,5,6],[7,8,9]])
for _, val in np.ndenumerate(A):
    ind = np.argwhere(A==val)
    print val, ind

In this case you will obtain the array of indexes if value appears in array not once.

answered Oct 31 '22 17:10

Roman Fursenko

Related questions
                            
                                find the start position of the longest sequence of 1's
                            
                                Why does a Python script to read files cause my computer to emit beeping sounds?
                            
                                Multiple consecutive join with pyspark
                            
                                How can I remove a widget in kivy?
                            
                                NumPy boolean array warning?
                            
                                portable way to write csv file in python 2 or python 3
                            
                                Difference between Python 2 and 3 for shuffle with a given seed
                            
                                Multiple stacked bar plot with pandas
                            
                                How to check if character exists in DataFrame cell
                            
                                pandas convert text feature to numeric value
                            
                                type conversion in python from float to int
                            
                                Problems with updating anaconda and installing new packages
                            
                                Write Python OrderedDict to CSV
                            
                                Python- Why is my Paho Mqtt Message Different Than When I Sent It?
                            
                                Add text next to vertical line in matplotlib
                            
                                Generate random numbers from lognormal distribution in python
                            
                                How to make action logging in Django with Django Rest Framework
                            
                                appending values to dictionary in for loop
                            
                                matplotlib scatterplot with legend
                            
                                numpy savetxt is not adding comma delimiter

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Iterate over numpy with index (numpy equivalent of python enumerate)

Tags:

python

numpy