I just discovered — by chance — that an array in <code>numpy</code> may be indexed by an empty tuple: <pre class="prettyprint"><code>In [62]: a = arange(5) In [63]: a[()] Out[63]: array([0, 1, 2, 3, 4]) </code></pre> I found some documentation on the numpy wiki ZeroRankArray: <blockquote> (Sasha) First, whatever choice is made for x[...] and x[()] they should be the same because ... is just syntactic sugar for "as many : as necessary", which in the case of zero rank leads to ... = (:,)*0 = (). Second, rank zero arrays and numpy scalar types are interchangeable within numpy, but numpy scalars can be use in some python constructs where ndarrays can't. </blockquote> So, for 0-d arrays <code>a[()]</code> and <code>a[...]</code> are supposed to be equivalent. Are they for higher-dimensional arrays, too? They strongly appear to be: <pre class="prettyprint"><code>In [65]: a = arange(25).reshape(5, 5) In [66]: a[()] is a[...] Out[66]: False In [67]: (a[()] == a[...]).all() Out[67]: True In [68]: a = arange(3**7).reshape((3,)*7) In [69]: (a[()] == a[...]).all() Out[69]: True </code></pre> But, it is not syntactic sugar. Not for a high-dimensional array, and not even for a 0-d array: <pre class="prettyprint"><code>In [76]: a[()] is a Out[76]: False In [77]: a[...] is a Out[77]: True In [79]: b = array(0) In [80]: b[()] is b Out[80]: False In [81]: b[...] is b Out[81]: True </code></pre> And then there is the case of indexing by an empty list, which does something else altogether, but appears equivalent to indexing with an empty <code>ndarray</code>: <pre class="prettyprint"><code>In [78]: a[[]] Out[78]: array([], shape=(0, 3, 3, 3, 3, 3, 3), dtype=int64) In [86]: a[arange(0)] Out[86]: array([], shape=(0, 3, 3, 3, 3, 3, 3), dtype=int64) In [82]: b[[]] --------------------------------------------------------------------------- IndexError Traceback (most recent call last) IndexError: 0-d arrays can't be indexed. </code></pre> So, it appears that <code>()</code> and <code>...</code> are similar but not quite identical and indexing with <code>[]</code> means something else altogether. And <code>a[]</code> or <code>b[]</code> are <code>SyntaxError</code>s. Indexing with lists is documented at index arrays, and there is a short notice about indexing with tuples at the end of the same document. That leaves the question: Is the difference between <code>a[()]</code> and <code>a[...]</code> by design? What is the design, then? (Question somehow reminiscent of: What does the empty `()` do on a Matlab matrix?) Edit: In fact, even scalars may be indexed by an empty tuple: <pre class="prettyprint"><code>In [36]: numpy.int64(10)[()] Out[36]: 10 </code></pre>

The treatment of <code>A[...]</code> is a special case, optimised to always return <code>A</code> itself: <pre class="prettyprint"><code>if (op == Py_Ellipsis) { Py_INCREF(self); return (PyObject *)self; } </code></pre> Anything else that should be equivalent e.g. <code>A[:]</code>, <code>A[(Ellipsis,)]</code>, <code>A[()]</code>, <code>A[(slice(None),) * A.ndim]</code> will instead return a view of the entirety of <code>A</code>, whose <code>base</code> is <code>A</code>: <pre class="prettyprint"><code>>>> A[()] is A False >>> A[()].base is A True </code></pre> This seems an unnecessary and premature optimisation, as <code>A[(Ellipsis,)]</code> and <code>A[()]</code> will always give the same result (an entire view on <code>A</code>). From looking at https://github.com/numpy/numpy/commit/fa547b80f7035da85f66f9cbabc4ff75969d23cd it seems that it was originally required because indexing with <code>...</code> didn't work properly on 0d arrays (previously to https://github.com/numpy/numpy/commit/4156b241aa3670f923428d4e72577a9962cdf042 it would return the element as a scalar), then extended to all arrays for consistency; since then, indexing has been fixed on 0d arrays so the optimisation isn't required, but it's managed to stick around vestigially (and there's probably some code that depends on <code>A[...] is A</code> being true).

In numpy, what does indexing an array with the empty tuple vs. ellipsis do?

Q: What is NumPy array explain with the help of indexing and slicing operations?

Numpy with Python Three types of indexing methods are available − field access, basic slicing and advanced indexing. Basic slicing is an extension of Python's basic concept of slicing to n dimensions. A Python slice object is constructed by giving start, stop, and step parameters to the built-in slice function.

Tags:

python

arrays

indexing

numpy

I just discovered — by chance — that an array in numpy may be indexed by an empty tuple:

In [62]: a = arange(5)

In [63]: a[()]
Out[63]: array([0, 1, 2, 3, 4])

I found some documentation on the numpy wiki ZeroRankArray:

(Sasha) First, whatever choice is made for x[...] and x[()] they should be the same because ... is just syntactic sugar for "as many : as necessary", which in the case of zero rank leads to ... = (:,)*0 = (). Second, rank zero arrays and numpy scalar types are interchangeable within numpy, but numpy scalars can be use in some python constructs where ndarrays can't.

So, for 0-d arrays a[()] and a[...] are supposed to be equivalent. Are they for higher-dimensional arrays, too? They strongly appear to be:

In [65]: a = arange(25).reshape(5, 5)

In [66]: a[()] is a[...]
Out[66]: False

In [67]: (a[()] == a[...]).all()
Out[67]: True

In [68]: a = arange(3**7).reshape((3,)*7)

In [69]: (a[()] == a[...]).all()
Out[69]: True

But, it is not syntactic sugar. Not for a high-dimensional array, and not even for a 0-d array:

In [76]: a[()] is a
Out[76]: False

In [77]: a[...] is a
Out[77]: True

In [79]: b = array(0)

In [80]: b[()] is b
Out[80]: False

In [81]: b[...] is b
Out[81]: True

And then there is the case of indexing by an empty list, which does something else altogether, but appears equivalent to indexing with an empty ndarray:

In [78]: a[[]]
Out[78]: array([], shape=(0, 3, 3, 3, 3, 3, 3), dtype=int64)

In [86]: a[arange(0)]
Out[86]: array([], shape=(0, 3, 3, 3, 3, 3, 3), dtype=int64)

In [82]: b[[]]
---------------------------------------------------------------------------
IndexError                                Traceback (most recent call last)

IndexError: 0-d arrays can't be indexed.

So, it appears that () and ... are similar but not quite identical and indexing with [] means something else altogether. And a[] or b[] are SyntaxErrors. Indexing with lists is documented at index arrays, and there is a short notice about indexing with tuples at the end of the same document.

That leaves the question:

Is the difference between a[()] and a[...] by design? What is the design, then?

(Question somehow reminiscent of: What does the empty `()` do on a Matlab matrix?)

Edit:

In fact, even scalars may be indexed by an empty tuple:

In [36]: numpy.int64(10)[()]
Out[36]: 10

643

asked Feb 04 '13 14:02

gerrit

Video Answer

1 Answers

The treatment of A[...] is a special case, optimised to always return A itself:

if (op == Py_Ellipsis) {
    Py_INCREF(self);
    return (PyObject *)self;
}

Anything else that should be equivalent e.g. A[:], A[(Ellipsis,)], A[()], A[(slice(None),) * A.ndim] will instead return a view of the entirety of A, whose base is A:

>>> A[()] is A
False
>>> A[()].base is A
True

This seems an unnecessary and premature optimisation, as A[(Ellipsis,)] and A[()] will always give the same result (an entire view on A). From looking at https://github.com/numpy/numpy/commit/fa547b80f7035da85f66f9cbabc4ff75969d23cd it seems that it was originally required because indexing with ... didn't work properly on 0d arrays (previously to https://github.com/numpy/numpy/commit/4156b241aa3670f923428d4e72577a9962cdf042 it would return the element as a scalar), then extended to all arrays for consistency; since then, indexing has been fixed on 0d arrays so the optimisation isn't required, but it's managed to stick around vestigially (and there's probably some code that depends on A[...] is A being true).

143

answered Oct 25 '22 11:10

ecatmur

Related questions
                            
                                Python: memory usage statistics per object-types (or source code line)
                            
                                Django - reverse lookups with ManyToManyField
                            
                                Distributing python code with virtualenv?
                            
                                Safeguarding MySQL password when developing in Python?
                            
                                Is it possible to get writing access to raw devices using python with windows?
                            
                                Difference between HDF5 file and PyTables file
                            
                                psycopg2 does not execute PostgreSQL function
                            
                                Drop in Single Breakpoint in Ruby Code
                            
                                Why PEP8 states imports should usually be on separate lines?
                            
                                Listening to keyboard events without trapping them?
                            
                                How do I stop all spiders and the engine immediately after a condition in a pipeline is met?
                            
                                Python: get key with the least value from a dictionary BUT multiple minimum values
                            
                                Compress PDFs using Python
                            
                                Django-Pinax : How do you use a pinax app apart from what you get with a pinax base project?
                            
                                Python tcl is not installed properly
                            
                                how to compare 2 json in python [closed]
                            
                                How to implement __iadd__ for a Python property
                            
                                How to pass in command line arguments when using ideone?
                            
                                How do I plot just the positive error bar with pyplot.bar?
                            
                                Running Python interactively from within Sublime Text 2

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With