Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Why are slices in Python 3 still copies and not views?

As I only now noticed after commenting on this answer, slices in Python 3 return shallow copies of whatever they're slicing rather than views. Why is this still the case? Even leaving aside numpy's usage of views rather than copies for slicing, the fact that dict.keys, dict.values, and dict.items all return views in Python 3, and that there are many other aspects of Python 3 geared towards greater use of iterators, makes it seem that there would have been a movement towards slices becoming similar. itertools does have an islice function that makes iterative slices, but that's more limited than normal slicing and does not provide view functionality along the lines of dict.keys or dict.values.

As well, the fact that you can use assignment to slices to modify the original list, but slices are themselves copies and not views, is a contradictory aspect of the language and seems like it violates several of the principles illustrated in the Zen of Python.

That is, the fact you can do

>>> a = [1, 2, 3, 4, 5] >>> a[::2] = [0, 0, 0] >>> a [0, 2, 0, 4, 0] 

But not

>>> a = [1, 2, 3, 4, 5] >>> a[::2][0] = 0 >>> a [0, 2, 3, 4, 5] 

or something like

>>> a = [1, 2, 3, 4, 5] >>> b = a[::2] >>> b view(a[::2] -> [1, 3, 5])   # numpy doesn't explicitly state that its slices are views, but it would probably be a good idea to do it in some way for regular Python >>> b[0] = 0 >>> b view(a[::2] -> [0, 3, 5]) >>> a [0, 2, 3, 4, 5] 

Seems somewhat arbitrary/undesirable.

I'm aware of http://www.python.org/dev/peps/pep-3099/ and the part where it says "Slices and extended slices won't go away (even if the __getslice__ and __setslice__ APIs may be replaced) nor will they return views for the standard object types.", but the linked discussion provides no mention of why the decision about slicing with views was made; in fact, the majority of the comments on that specific suggestion out of the suggestions listed in the original post seemed to be positive.

What prevented something like this from being implemented in Python 3.0, which was specifically designed to not be strictly backwards-compatible with Python 2.x and thus would have been the best time to implement such a change in design, and is there anything that may prevent it in future versions of Python?

like image 563
JAB Avatar asked Aug 01 '11 17:08

JAB


People also ask

Does python slicing make a copy?

Slicing lists does not generate copies of the objects in the list; it just copies the references to them. That is the answer to the question as asked.

Does slice make a copy?

Copy a List Using a Slice. Without indices, the slice will duplicate the entire list. Again, however, this will not perform a deep copy.

How does slices work in python?

Python slice() FunctionThe slice() function returns a slice object. A slice object is used to specify how to slice a sequence. You can specify where to start the slicing, and where to end. You can also specify the step, which allows you to e.g. slice only every other item.

Does slice make a deep copy?

All slices, like x_list[1:4] and the empty slice, are shallow copies. This makes sense; slicing operations just take part of an existing list, so it would be inefficient to create a deep copy and duplicate existing values. Slicing is just a command as to how to display data from the original list.


2 Answers

As well, the fact that you can use assignment to slices to modify the original list, but slices are themselves copies and not views.

Hmm.. that's not quite right; although I can see how you might think that. In other languages, a slice assignment, something like:

a[b:c] = d 

is equivalent to

tmp = a.operator[](slice(b, c)) # which returns some sort of reference tmp.operator=(d)        # which has a special meaning for the reference type. 

But in python, the first statement is actually converted to this:

a.__setitem__(slice(b, c), d) 

Which is to say that an item assignment is actually specially recognized in python to have a special meaning, separate from item lookup and assignment; they may be unrelated. This is consistent with python as a whole, because python doesn't have concepts like the "lvalues" found in C/C++; There's no way to overload the assignment operator itself; only specific cases when the left side of the assignment is not a plain identifier.

Suppose lists did have views; And you tried to use it:

myView = myList[1:10] yourList = [1, 2, 3, 4] myView = yourList 

In languages besides python, there might be a way to shove yourList into myList, but in python, since the name myView appears as a bare identifier, it can only mean a variable assignemnt; the view is lost.

like image 104
SingleNegationElimination Avatar answered Sep 28 '22 02:09

SingleNegationElimination


Well it seems I found a lot of the reasoning behind the views decision, going by the thread starting with http://mail.python.org/pipermail/python-3000/2006-August/003224.html (it's primarily about slicing strings, but at least one e-mail in the thread mentions mutable objects like lists), and also some things from:

http://mail.python.org/pipermail/python-3000/2007-February/005739.html
http://mail.python.org/pipermail/python-dev/2008-May/079692.html and following e-mails in the thread

Looks like the advantages of switching to this style for base Python would be vastly outweighed by the induced complexity and various undesirable edge cases. Oh well.

...And as I then started wondering about the possibility of just replacing the current way slice objects are worked with with an iterable form a la itertools.islice, just as zip, map, etc. all return iterables instead of lists in Python 3, I started realizing all the unexpected behavior and possible problems that could come out of that. Looks like this might be a dead end for now.

On the plus side, numpy's arrays are fairly flexible, so in situations where this sort of thing might be necessary, it wouldn't be too hard to use one-dimensional ndarrays instead of lists. However, it seems ndarrays don't support using slicing to insert additional items within arrays, as happens with Python lists:

>>> a = [0, 0] >>> a[:1] = [2, 3] >>> a [2, 3, 0] 

I think the numpy equivalent would instead be something like this:

>>> a = np.array([0, 0])  # or a = np.zeros([2]), but that's not important here >>> a = np.hstack(([2, 3], a[1:])) >>> a array([2, 3, 0]) 

A slightly more complicated case:

>>> a = [1, 2, 3, 4] >>> a[1:3] = [0, 0, 0] >>> a [1, 0, 0, 0, 4] 

versus

>>> a = np.array([1, 2, 3, 4]) >>> a = np.hstack((a[:1], [0, 0, 0], a[3:])) >>> a array([1, 0, 0, 0, 4]) 

And, of course, the above numpy examples don't store the result in the original array as happens with the regular Python list expansion.

like image 23
JAB Avatar answered Sep 28 '22 01:09

JAB