Given <code>v = vec![1,2,3,4]</code>, why does <code>v[4..]</code> return an empty vector, but <code>v[5..]</code> panics, while both <code>v[4]</code> and <code>v[5]</code> panic? I suspect this has to do with the implementation of slicing without specifying either the start- or endpoint, but I couldn't find any information on this online.

This is simply because <code>std::ops::RangeFrom</code> is defined to be "bounded inclusively below". A quick recap of all the plumbing: <code>v[4..]</code> desugars to <code>std::ops::Index</code> using <code>4..</code> (which parses as a <code>std::ops::RangeFrom</code>) as the parameter. <code>std::ops::RangeFrom</code> implements <code>std::slice::SliceIndex</code> and <code>Vec</code> has an implementation for <code>std::ops::Index</code> for any parameter that implements <code>std::slice::SliceIndex</code>. So what you are looking at is a <code>RangeFrom</code> being used to <code>std::ops::Index</code> the <code>Vec</code>. <code>std::ops::RangeFrom</code> is defined to always be inclusive on the lower bound. For example <code>[0..]</code> will include the first element of the thing being indexed. If (in your case) the <code>Vec</code> is empty, then <code>[0..]</code> will be the empty slice. Notice: if the lower bound wasn't inclusive, there would be no way to slice an empty <code>Vec</code> at all without causing a panic, which would be cumbersome. A simple way to think about it is "where the fence-post is put". A <code>v[0..]</code> in a <code>vec![0, 1, 2 ,3]</code> is <pre class="prettyprint"><code>| 0 1 2 3 | ^ |- You are slicing from here. This includes the entire `Vec` (even if it was empty) </code></pre> In <code>v[4..]</code> it is <pre class="prettyprint"><code>| 0 1 2 3 | ^ |- You are slicing from here to the end of the Vector. Which results in, well, nothing. </code></pre> while a <code>v[5..]</code> would be <pre class="prettyprint"><code>| 0 1 2 3 | ^ |- Slicing from here to infinity is definitely outside the `Vec` and, also, the caller's fault, so panic! </code></pre> and a <code>v[3..]</code> is <pre class="prettyprint"><code>| 0 1 2 3 | ^ |- slicing from here to the end results in `&[3]` </code></pre>

While the other answer explains how to understand and remember the indexing behavior implemented in Rust standard library, the real reason why it is the way it is has nothing to do with technical limitations. It comes down to the design decision made by the authors of Rust standard library. <blockquote> Given <code>v = vec![1,2,3,4]</code>, why does <code>v[4..]</code> return an empty vector, but <code>v[5..]</code> panics [..] ? </blockquote> Because it was decided so. The code below that handles slice indexing (full source) will panic if the start index is larger than the slice's length. <pre class="prettyprint"><code>fn index(self, slice: &[T]) -> &[T] { if self.start > slice.len() { slice_start_index_len_fail(self.start, slice.len()); } // SAFETY: `self` is checked to be valid and in bounds above. unsafe { &*self.get_unchecked(slice) } } fn slice_start_index_len_fail(index: usize, len: usize) -> ! { panic!("range start index {} out of range for slice of length {}", index, len); } </code></pre> How could it be implemented differently? I personally like how Python does it. <pre class="prettyprint lang-py prettyprint-override"><code>v = [1, 2, 3, 4] a = v[4] # -> Raises an exception - Similar to Rust's behavior (panic) b = v[5] # -> Same, raises an exception - Also similar to Rust's # (Equivalent to Rust's v[4..]) w = v[4:] # -> Returns an empty list - Similar to Rust's x = v[5:] # -> Also returns an empty list - Different from Rust's, which panics </code></pre> Python's approach is not necessarily better than Rust's, because there's always a trade-off. Python's approach is more convenient (there's no need to check if a start index is not greater than the length), but if there's a bug, it's harder to find because it doesn't fail early. Although Rust can technically follow Python's approach, its designers decided to fail early by panicking in order that a bug can be faster to find, but with a cost of some inconvenience (programmers need to ensure that a start index is not greater than the length).

Why can I start a slice past the end of a vector in Rust?

Tags:

rust

Given v = vec![1,2,3,4], why does v[4..] return an empty vector, but v[5..] panics, while both v[4] and v[5] panic? I suspect this has to do with the implementation of slicing without specifying either the start- or endpoint, but I couldn't find any information on this online.

778

asked Feb 06 '21 02:02

Armadillan

2 Answers

This is simply because std::ops::RangeFrom is defined to be "bounded inclusively below".

A quick recap of all the plumbing: v[4..] desugars to std::ops::Index using 4.. (which parses as a std::ops::RangeFrom) as the parameter. std::ops::RangeFrom implements std::slice::SliceIndex and Vec has an implementation for std::ops::Index for any parameter that implements std::slice::SliceIndex. So what you are looking at is a RangeFrom being used to std::ops::Index the Vec.

std::ops::RangeFrom is defined to always be inclusive on the lower bound. For example [0..] will include the first element of the thing being indexed. If (in your case) the Vec is empty, then [0..] will be the empty slice. Notice: if the lower bound wasn't inclusive, there would be no way to slice an empty Vec at all without causing a panic, which would be cumbersome.

A simple way to think about it is "where the fence-post is put".

A v[0..] in a vec![0, 1, 2 ,3] is

|  0    1    2    3   |
  ^
  |- You are slicing from here. This includes the
     entire `Vec` (even if it was empty)

In v[4..] it is

|  0    1    2    3   |
                    ^
                    |- You are slicing from here to the end of the Vector.
                       Which results in, well, nothing.

while a v[5..] would be

|  0    1    2    3   |
                        ^
                        |- Slicing from here to infinity is definitely
                           outside the `Vec` and, also, the
                           caller's fault, so panic!

and a v[3..] is

|  0    1    2    3   |
                ^
                |- slicing from here to the end results in `&[3]`

answered Dec 13 '22 10:12

user2722968

While the other answer explains how to understand and remember the indexing behavior implemented in Rust standard library, the real reason why it is the way it is has nothing to do with technical limitations. It comes down to the design decision made by the authors of Rust standard library.

Given v = vec![1,2,3,4], why does v[4..] return an empty vector, but v[5..] panics [..] ?

Because it was decided so. The code below that handles slice indexing (full source) will panic if the start index is larger than the slice's length.

fn index(self, slice: &[T]) -> &[T] {
    if self.start > slice.len() {
        slice_start_index_len_fail(self.start, slice.len());
    }
    // SAFETY: `self` is checked to be valid and in bounds above.
    unsafe { &*self.get_unchecked(slice) }
}

fn slice_start_index_len_fail(index: usize, len: usize) -> ! {
    panic!("range start index {} out of range for slice of length {}", index, len);
}

How could it be implemented differently? I personally like how Python does it.

v = [1, 2, 3, 4]

a = v[4]   # -> Raises an exception        - Similar to Rust's behavior (panic)
b = v[5]   # -> Same, raises an exception  - Also similar to Rust's

# (Equivalent to Rust's v[4..])
w = v[4:]  # -> Returns an empty list      - Similar to Rust's
x = v[5:]  # -> Also returns an empty list - Different from Rust's, which panics

Python's approach is not necessarily better than Rust's, because there's always a trade-off. Python's approach is more convenient (there's no need to check if a start index is not greater than the length), but if there's a bug, it's harder to find because it doesn't fail early.

Although Rust can technically follow Python's approach, its designers decided to fail early by panicking in order that a bug can be faster to find, but with a cost of some inconvenience (programmers need to ensure that a start index is not greater than the length).

answered Dec 13 '22 10:12

Daniel

Related questions
                            
                                How do I pass a mutable vector as a function parameter in Rust?
                            
                                How can I split a string (String or &str) on more than one delimiter?
                            
                                How to set the thread stack size during compile time?
                            
                                "error: closure may outlive the current function" but it will not outlive it
                            
                                Is it possible to destructure the `self` argument of a method?
                            
                                Mutable borrow in loop [duplicate]
                            
                                Can't clone Vec<Box<Trait>> because Trait cannot be made into an object
                            
                                How is `pub(self)` visibility different from no `pub` attribute?
                            
                                Why does serde_json::from_reader take ownership of the reader?
                            
                                How do I subtract one character from another in Rust?
                            
                                Does Rust devirtualize trait object function calls?
                            
                                Higher order macros
                            
                                How do I write a wrapper for a macro without repeating the rules?
                            
                                How are elements of a vector left-shifted in Rust?
                            
                                Per-thread initialization in Rayon
                            
                                Why is it possible to implement a trait for both `T: Display` and `str`?
                            
                                Global mutable HashMap in a library [duplicate]
                            
                                How to change value inside an array while iterating over it in Rust
                            
                                What is the standard pattern to relate three tables (many-to-many relation) within Diesel?
                            
                                Performance of Rust vector (`Vec<T>`) versus array (`[T; n]`) [closed]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With