<p>I need a good explanation (reference) to explain NumPy slicing within (for) loops. I have three cases. </p> <pre class="prettyprint"><code>def example1(array): for row in array: row = row + 1 return array def example2(array): for row in array: row += 1 return array def example3(array): for row in array: row[:] = row + 1 return array </code></pre> <p>A simple case:</p> <pre class="prettyprint"><code>ex1 = np.arange(9).reshape(3, 3) ex2 = ex1.copy() ex3 = ex1.copy() </code></pre> <p>returns:</p> <pre class="prettyprint"><code>>>> example1(ex1) array([[0, 1, 2], [3, 4, 5], [6, 7, 8]]) >>> example2(ex2) array([[1, 2, 3], [4, 5, 6], [7, 8, 9]]) >>> example3(ex3) array([[1, 2, 3], [4, 5, 6], [7, 8, 9]]) </code></pre> <p>It can be seen that the first result differs from the second and third.</p>

<h3>First example:</h3> <p>You extract a row and add 1 to it. Then you <em>redefine</em> the <strong>pointer</strong> <code>row</code> but not what the <code>array</code> contains! So it will not affect the original array.</p> <h3>Second example:</h3> <p>You make an in-place operation - obviously this will affect the original array - as long as it is an array.</p> <p>If you were doing a double loop it wouldn't work anymore:</p> <pre class="prettyprint"><code>def example4(array): for row in array: for column in row: column += 1 return array example4(np.arange(9).reshape(3,3)) array([[0, 1, 2], [3, 4, 5], [6, 7, 8]]) </code></pre> <p>this doesn't work because you don't call <code>np.ndarray</code>'s <code>__iadd__</code> (to modify the data the array points to) but the python <code>int</code>'s <code>__iadd__</code>. So this example only works because your rows are numpy arrays.</p> <h3>Third example:</h3> <p><code>row[:] = row + 1</code> this is interpreted as something like <code>row[0] = row[0]+1, row[1] = row[1]+1, ...</code> again this works in place so this affects the original array.</p> <h3>Bottom Line</h3> <p>If you are operating on mutable objects, like <code>list</code>s or <code>np.ndarray</code> you need to be careful what you change. Such an object only <strong>points</strong> to where the actual data is stored in memory - so changing this <strong>pointer</strong> (<code>example1</code>) doesn't affect the saved data. You need to follow the pointer (either directly by <code>[:]</code> (<code>example3</code>) or indirectly with <code>array.__iadd__</code> (<code>example2</code>)) to change the saved data.</p>

<p>In the first code, you don't do anything with the new computed row; you rebind the name <code>row</code>, and there is no connection to the array anymore. </p> <p>In the second and the third, you dont rebind, but assign values to the old variable. With <code>+=</code> some internal function is called, which varies depending on the type of the object you let it act upon. See links below.</p> <p>If you write <code>row + 1</code> on the right hand side, a new array is computed. In the first case, you tell python to give it the <em>name</em> <code>row</code> (and forget the original object which was called <code>row</code> before). And in the third, the new array is written to the slice of the old <code>row</code>.</p> <p>For further reading follow the link of the comment to the question by @Thiru above. Or read about assignment and rebinding in general... </p>

Slicing a NumPy array within a loop [duplicate]

I need a good explanation (reference) to explain NumPy slicing within (for) loops. I have three cases.

def example1(array):
    for row in array:
        row = row + 1
    return array

def example2(array):
    for row in array:
        row += 1
    return array

def example3(array):
    for row in array:
        row[:] = row + 1
    return array

A simple case:

ex1 = np.arange(9).reshape(3, 3)
ex2 = ex1.copy()
ex3 = ex1.copy()

returns:

>>> example1(ex1)
array([[0, 1, 2],
       [3, 4, 5],
       [6, 7, 8]])

>>> example2(ex2)
array([[1, 2, 3],
       [4, 5, 6],
       [7, 8, 9]])

>>> example3(ex3)
array([[1, 2, 3],
       [4, 5, 6],
       [7, 8, 9]])

It can be seen that the first result differs from the second and third.

How do you repeat an array in NumPy?

NumPy: repeat() function The repeat() function is used to repeat elements of an array. Input array. The number of repetitions for each element. repeats is broadcasted to fit the shape of the given axis.

Can you slice a NumPy array?

You can slice a range of elements from one-dimensional numpy arrays such as the third, fourth and fifth elements, by specifying an index range: [starting_value, ending_value] . Note that the index structure is inclusive of the first index value, but not the second index value.

Is a NumPy array an iterable?

Numpy with PythonIt is an efficient multidimensional iterator object using which it is possible to iterate over an array. Each element of an array is visited using Python's standard Iterator interface.

Is it possible to create an array from a tuple in NumPy?

The asarray() method in NumPy can be used to create an array from data that already exists in the form of lists or tuples.

First example:

You extract a row and add 1 to it. Then you redefine the pointer row but not what the array contains! So it will not affect the original array.

Second example:

You make an in-place operation - obviously this will affect the original array - as long as it is an array.

If you were doing a double loop it wouldn't work anymore:

def example4(array):
    for row in array:
        for column in row:
            column += 1
    return array

example4(np.arange(9).reshape(3,3))
array([[0, 1, 2],
       [3, 4, 5],
       [6, 7, 8]])

this doesn't work because you don't call np.ndarray's __iadd__ (to modify the data the array points to) but the python int's __iadd__. So this example only works because your rows are numpy arrays.

Third example:

row[:] = row + 1 this is interpreted as something like row[0] = row[0]+1, row[1] = row[1]+1, ... again this works in place so this affects the original array.

Bottom Line

If you are operating on mutable objects, like lists or np.ndarray you need to be careful what you change. Such an object only points to where the actual data is stored in memory - so changing this pointer (example1) doesn't affect the saved data. You need to follow the pointer (either directly by [:] (example3) or indirectly with array.__iadd__ (example2)) to change the saved data.

In the first code, you don't do anything with the new computed row; you rebind the name row, and there is no connection to the array anymore.

In the second and the third, you dont rebind, but assign values to the old variable. With += some internal function is called, which varies depending on the type of the object you let it act upon. See links below.

If you write row + 1 on the right hand side, a new array is computed. In the first case, you tell python to give it the name row (and forget the original object which was called row before). And in the third, the new array is written to the slice of the old row.

For further reading follow the link of the comment to the question by @Thiru above. Or read about assignment and rebinding in general...

Slicing a NumPy array within a loop [duplicate]

Tags:

python

arrays

numpy

blaz

People also ask

2 Answers

First example:

Second example:

Third example:

Bottom Line

MSeifert

Ilja

Recent Activity

Donate For Us

Slicing a NumPy array within a loop [duplicate]

Tags:

python

arrays

numpy

blaz

People also ask

2 Answers

First example:

Second example:

Third example:

Bottom Line

MSeifert

Ilja

Related questions

Recent Activity

Donate For Us