I'm new to python, so I'm used to use <code>array[i][j]</code> instead of <code>array[i,j]</code>. Today a script I created following a tutorial was not working until I found out that I was using <pre class="prettyprint"><code>numpy.dot(P[0][:], Q[:][0]) </code></pre> instead of <pre class="prettyprint"><code>numpy.dot(P[0,:], Q[:,0]) </code></pre> For some reason the second one works, while the first one gives me a shape error. The matrixes dimensions are MxK and KxN. I tried to print both <code>P[0][:]</code> and <code>P[0,:]</code>, run <code>id()</code>, <code>type()</code> and <code>P[0][:].shape</code>, but couldn't find a reason to it. Why are these things different? I'm running it on Jupyter Notebook 4.3.0 and Python 2.7.13.

You should almost always use <code>[i, j]</code> instead of <code>[i][j]</code> when dealing with numpy arrays. In many cases there's no real difference but in your case there is. Suppose you have an array like this: <pre class="prettyprint"><code>>>> import numpy as np >>> arr = np.arange(16).reshape(4, 4) >>> arr array([[ 0, 1, 2, 3], [ 4, 5, 6, 7], [ 8, 9, 10, 11], [12, 13, 14, 15]]) </code></pre> When you use <code>[:]</code> that's equivalent to a new view, but if you do <code>[1, :]</code> or <code>[:, 1]</code> it means get the second row (column). Roughly speaking it means: index the dimension where you had the number and leave the dimension where you had the <code>:</code> alone: <pre class="prettyprint"><code>>>> arr[:] array([[ 0, 1, 2, 3], [ 4, 5, 6, 7], [ 8, 9, 10, 11], [12, 13, 14, 15]]) >>> arr[:, 1] # get the second column array([ 1, 5, 9, 13]) >>> arr[:][1] # get a new view of the array, then get the second row array([4, 5, 6, 7]) </code></pre> This is because <code>[1]</code> is interpreted as <code>[1, ...]</code> (<code>...</code> is the Ellipsis object) and for 2D it's equivalent to <code>[1, :]</code>. That's also the reason why the row indexing still works (because it's the first dimension): <pre class="prettyprint"><code>>>> arr[1, :] # get the second row array([4, 5, 6, 7]) >>> arr[1][:] # get the second row, then get a new view of that row array([4, 5, 6, 7]) </code></pre>

Difference between array[i][:] and array[i,:]

Tags:

python

arrays

indexing

numpy

I'm new to python, so I'm used to use array[i][j] instead of array[i,j]. Today a script I created following a tutorial was not working until I found out that I was using

numpy.dot(P[0][:], Q[:][0])

instead of

numpy.dot(P[0,:], Q[:,0])

For some reason the second one works, while the first one gives me a shape error. The matrixes dimensions are MxK and KxN.

I tried to print both P[0][:] and P[0,:], run id(), type() and P[0][:].shape, but couldn't find a reason to it. Why are these things different?

I'm running it on Jupyter Notebook 4.3.0 and Python 2.7.13.

904

asked May 17 '17 01:05

Lauro

2 Answers

You should almost always use [i, j] instead of [i][j] when dealing with numpy arrays. In many cases there's no real difference but in your case there is.

Suppose you have an array like this:

>>> import numpy as np
>>> arr = np.arange(16).reshape(4, 4)
>>> arr 
array([[ 0,  1,  2,  3],
       [ 4,  5,  6,  7],
       [ 8,  9, 10, 11],
       [12, 13, 14, 15]])

When you use [:] that's equivalent to a new view, but if you do [1, :] or [:, 1] it means get the second row (column). Roughly speaking it means: index the dimension where you had the number and leave the dimension where you had the : alone:

>>> arr[:]
array([[ 0,  1,  2,  3],
       [ 4,  5,  6,  7],
       [ 8,  9, 10, 11],
       [12, 13, 14, 15]])

>>> arr[:, 1]  #  get the second column
array([ 1,  5,  9, 13])
>>> arr[:][1]  # get a new view of the array, then get the second row
array([4, 5, 6, 7])

This is because [1] is interpreted as [1, ...] (... is the Ellipsis object) and for 2D it's equivalent to [1, :].

That's also the reason why the row indexing still works (because it's the first dimension):

>>> arr[1, :]  # get the second row
array([4, 5, 6, 7])
>>> arr[1][:]  # get the second row, then get a new view of that row
array([4, 5, 6, 7])

answered Nov 08 '22 14:11

MSeifert

x[:] makes a shallow copy of a list, but is virtually useless when x is an array. It makes a new view - same data and shape, but different array object. If that's confusing you need to review some basic numpy docs about views and copies.

In a 2d array such as A[0,:] or A[:, 1:5], : is a kind of place holder, identifying a dimension that will be used as a whole. : is converted by Python interpreter to slice(None,None,None), while start:stop:step produces slice(start, stop, step).

A[0,:], which can be shortened to A[0], means pick the 1st 'row' of A, and all of its 'columns'. The action generalizes to higher dimensions, where names like row and column have less intuitive meanings.

A[:,0] means pick the 0th column, and all the row.

A[0][:] expands to A[0,:][:], and means apply [:] to the result of A[0,:], in effect, just take a view of the 1st row (which is a 1d array).

A[:][0] is not the same as A[:,0]; it's the same as A[0,:]. A[:] is the same as A[:,:] a view of the whole 2d array.

If it helps, I could expand the indexing expressions into calls to A.__getitem__(...). Each set of [] is a separate expansion.

In the expression A[:] = ... the [:] is significant, but that's another topic.

These 2 expressions are equivalent:

numpy.dot(P[0][:], Q[:][0])
numpy.dot(P[0,:], Q[0,:])

answered Nov 08 '22 12:11

hpaulj

Related questions
                            
                                How to delete numpy nan from a list of strings in Python?
                            
                                Scrapy: How to output items in a specific json format
                            
                                spaCy needs a file that is not there: strings.json
                            
                                Index of multiple minimum elements in a list [duplicate]
                            
                                Why do we still need parser like BeautifulSoup if we can use Selenium?
                            
                                flask-admin: How to make columns read-only according to other columns' value?
                            
                                keras usage of the Activation layer instead of activation parameter
                            
                                Can defaultdict accept callables dependent on the given missing key?
                            
                                Python Jupyter Notebook - Unable to open CSV file through a path
                            
                                Python string formation using list comprehension
                            
                                Tensorflow maxpool with dynamic ksize
                            
                                Python Pretty table with color output
                            
                                How to get last 5 months of data of a Pandas DataFrame?
                            
                                Python- Replace all spaces with underscores and convert to lowercase for all files in a directory
                            
                                vscode working directory when debugging python
                            
                                Django Error: No DjangoTemplates backend is configured
                            
                                Remove groups with size smaller than mean group size in pandas
                            
                                Python Paramiko (Client) Multifactor Authentication
                            
                                Django fe_sendauth: no password supplied error, unable to connect to postgres database
                            
                                how to get data type of a tensor in tensorflow?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With