So I just came across what seems to me like a strange Python feature and wanted some clarification about it. The following array manipulation somewhat makes sense: <pre class="prettyprint"><code>p = [1,2,3] p[3:] = [4] p = [1,2,3,4] </code></pre> I imagine it is actually just appending this value to the end, correct? Why can I do this, however? <pre class="prettyprint"><code>p[20:22] = [5,6] p = [1,2,3,4,5,6] </code></pre> And even more so this: <pre class="prettyprint"><code>p[20:100] = [7,8] p = [1,2,3,4,5,6,7,8] </code></pre> This just seems like wrong logic. It seems like this should throw an error! Any explanation? -Is it just a weird thing Python does? -Is there a purpose to it? -Or am I thinking about this the wrong way?

<h3>Part of question regarding out-of-range indices</h3> Slice logic automatically clips the indices to the length of the sequence. Allowing slice indices to extend past end points was done for convenience. It would be a pain to have to range check every expression and then adjust the limits manually, so Python does it for you. Consider the use case of wanting to display no more than the first 50 characters of a text message. The easy way (what Python does now): <pre class="prettyprint"><code>preview = msg[:50] </code></pre> Or the hard way (do the limit checks yourself): <pre class="prettyprint"><code>n = len(msg) preview = msg[:50] if n > 50 else msg </code></pre> Manually implementing that logic for adjustment of end points would be easy to forget, would be easy to get wrong (updating the 50 in two places), would be wordy, and would be slow. Python moves that logic to its internals where it is succint, automatic, fast, and correct. This is one of the reasons I love Python :-) <h3>Part of question regarding assignments length mismatch from input length</h3> The OP also wanted to know the rationale for allowing assignments such as <code>p[20:100] = [7,8]</code> where the assignment target has a different length (80) than the replacement data length (2). It's easiest to see the motivation by an analogy with strings. Consider, <code>"five little monkeys".replace("little", "humongous")</code>. Note that the target "little" has only six letters and "humongous" has nine. We can do the same with lists: <pre class="prettyprint"><code>>>> s = list("five little monkeys") >>> i = s.index('l') >>> n = len('little') >>> s[i : i+n ] = list("humongous") >>> ''.join(s) 'five humongous monkeys' </code></pre> This all comes down to convenience. Prior to the introduction of the copy() and clear() methods, these used to be popular idioms: <pre class="prettyprint"><code>s[:] = [] # clear a list t = u[:] # copy a list </code></pre> Even now, we use this to update lists when filtering: <pre class="prettyprint"><code>s[:] = [x for x in s if not math.isnan(x)] # filter-out NaN values </code></pre> Hope these practical examples give a good perspective on why slicing works as it does.

The documentation has your answer: <blockquote> <code>s[i:j]</code>: slice of <code>s</code> from <code>i</code> to <code>j</code> (note (4)) (4) The slice of <code>s</code> from <code>i</code> to <code>j</code> is defined as the sequence of items with index <code>k</code> such that <code>i <= k < j</code>. If <code>i</code> or <code>j</code> is greater than <code>len(s)</code>, use <code>len(s)</code>. If <code>i</code> is omitted or <code>None</code>, use <code>0</code>. If <code>j</code> is omitted or <code>None</code>, use <code>len(s)</code>. If <code>i</code> is greater than or equal to <code>j</code>, the slice is empty. </blockquote> The documentation of <code>IndexError</code> confirms this behavior: <blockquote> exception <code>IndexError</code> Raised when a sequence subscript is out of range. (Slice indices are silently truncated to fall in the allowed range; if an index is not an integer, <code>TypeError</code> is raised.) </blockquote> Essentially, stuff like <code>p[20:100]</code> is being reduced to <code>p[len(p):len(p]</code>. <code>p[len(p):len(p]</code> is an empty slice at the end of the list, and assigning a list to it will modify the end of the list to contain said list. Thus, it works like appending/extending the original list. This behavior is the same as what happens when you assign a list to an empty slice anywhere in the original list. For example: <pre class="prettyprint"><code>In [1]: p = [1, 2, 3, 4] In [2]: p[2:2] = [42, 42, 42] In [3]: p Out[3]: [1, 2, 42, 42, 42, 3, 4] </code></pre>

Why does Python allow out-of-range slice indexes for sequences?

Tags:

python

slice

python-3.x

sequence

range-checking

So I just came across what seems to me like a strange Python feature and wanted some clarification about it.

The following array manipulation somewhat makes sense:

Click to copy

p = [1,2,3] p[3:] = [4]  p = [1,2,3,4]

I imagine it is actually just appending this value to the end, correct?
Why can I do this, however?

Click to copy

p[20:22] = [5,6] p = [1,2,3,4,5,6]

And even more so this:

Click to copy

p[20:100] = [7,8] p = [1,2,3,4,5,6,7,8]

This just seems like wrong logic. It seems like this should throw an error!

Any explanation?
-Is it just a weird thing Python does?
-Is there a purpose to it?
-Or am I thinking about this the wrong way?

409

asked Feb 10 '19 05:02

Akaisteph7

2 Answers

Part of question regarding out-of-range indices

Slice logic automatically clips the indices to the length of the sequence.

Allowing slice indices to extend past end points was done for convenience. It would be a pain to have to range check every expression and then adjust the limits manually, so Python does it for you.

Consider the use case of wanting to display no more than the first 50 characters of a text message.

The easy way (what Python does now):

Click to copy

preview = msg[:50]

Or the hard way (do the limit checks yourself):

Click to copy

n = len(msg) preview = msg[:50] if n > 50 else msg

Manually implementing that logic for adjustment of end points would be easy to forget, would be easy to get wrong (updating the 50 in two places), would be wordy, and would be slow. Python moves that logic to its internals where it is succint, automatic, fast, and correct. This is one of the reasons I love Python :-)

Part of question regarding assignments length mismatch from input length

The OP also wanted to know the rationale for allowing assignments such as p[20:100] = [7,8] where the assignment target has a different length (80) than the replacement data length (2).

It's easiest to see the motivation by an analogy with strings. Consider, "five little monkeys".replace("little", "humongous"). Note that the target "little" has only six letters and "humongous" has nine. We can do the same with lists:

Click to copy

>>> s = list("five little monkeys") >>> i = s.index('l') >>> n = len('little') >>> s[i : i+n ] = list("humongous") >>> ''.join(s) 'five humongous monkeys'

This all comes down to convenience.

Prior to the introduction of the copy() and clear() methods, these used to be popular idioms:

Click to copy

s[:] = []           # clear a list t = u[:]            # copy a list

Even now, we use this to update lists when filtering:

Click to copy

s[:] = [x for x in s if not math.isnan(x)]   # filter-out NaN values

Hope these practical examples give a good perspective on why slicing works as it does.

121

answered Sep 24 '22 00:09

Raymond Hettinger

The documentation has your answer:

s[i:j]: slice of s from i to j (note (4))

(4) The slice of s from i to j is defined as the sequence of items with index k such that i <= k < j. If i or j is greater than len(s), use len(s). If i is omitted or None, use 0. If j is omitted or None, use len(s). If i is greater than or equal to j, the slice is empty.

The documentation of IndexError confirms this behavior:

exception IndexError

Raised when a sequence subscript is out of range. (Slice indices are silently truncated to fall in the allowed range; if an index is not an integer, TypeError is raised.)

Essentially, stuff like p[20:100] is being reduced to p[len(p):len(p]. p[len(p):len(p] is an empty slice at the end of the list, and assigning a list to it will modify the end of the list to contain said list. Thus, it works like appending/extending the original list.

This behavior is the same as what happens when you assign a list to an empty slice anywhere in the original list. For example:

Click to copy

In [1]: p = [1, 2, 3, 4]  In [2]: p[2:2] = [42, 42, 42]  In [3]: p Out[3]: [1, 2, 42, 42, 42, 3, 4]

answered Sep 23 '22 00:09

iz_

Related questions
                            
                                Django : Testing if the page has redirected to the desired url
                            
                                How to make heapq evaluate the heap off of a specific attribute?
                            
                                Multi-level defaultdict with variable depth?
                            
                                Python: The _imagingft C module is not installed
                            
                                "The headers or library files could not be found for jpeg" installing Pillow on Alpine Linux
                            
                                Installed Python 3 on Mac OS X but its still Python 2.7
                            
                                Threading in Python [closed]
                            
                                Visual Studio Code pylint: Unable to import 'protorpc'
                            
                                Playing mp3 song on python
                            
                                finding first day of the month in python
                            
                                Pairwise circular Python 'for' loop
                            
                                Is there any way to use pythonappend with SWIG's new builtin feature?
                            
                                Infinite integer in Python
                            
                                How to replicate tee behavior in Python when using subprocess?
                            
                                Python: self.__class__ vs. type(self) [duplicate]
                            
                                Vim automatically removes indentation on Python comments [duplicate]
                            
                                TypeError: 'tuple' object does not support item assignment when swapping values
                            
                                dict.fromkeys all point to same list
                            
                                Log output of multiprocessing.Process
                            
                                What is python-dev package used for

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Why does Python allow out-of-range slice indexes for sequences?

Tags:

python

slice

python-3.x

sequence

range-checking

Akaisteph7

People also ask

2 Answers

Part of question regarding out-of-range indices

Part of question regarding assignments length mismatch from input length

Raymond Hettinger

iz_

Recent Activity

Donate For Us