I think I understand, at a very high level what the difference between <code>&</code> and <code>*</code> in Rust is as it pertains to memory management. What is the difference between the following code snippets. Are there dangers to applying one approach versus the other? <pre class="prettyprint"><code>for (i, item) in bytes.iter().enumerate() { if *item == b' ' { return i; } } </code></pre> <pre class="prettyprint"><code>for (i, &item) in bytes.iter().enumerate() { if item == b' ' { return i; } } </code></pre> <pre class="prettyprint"><code>for (i, item) in bytes.iter().enumerate() { if item == &b' ' { return i; } } </code></pre> As I understand it, when I return a value from <code>iter()</code> I am returning a reference to the element found in <code>bytes</code>. If I want to make a comparison on the item, I need to either compare between two references <code>&u8</code> or I need to make <code>&item</code> a reference itself so that when I call <code>item</code> it is of type <code>u8</code>, or I need to dereference <code>item</code> when I compare it so that <code>item</code> = <code>&u8</code> -> <code>*item</code> = <code>u8</code>. <ol> <li>When I run the code using <code>(i, &item)</code>, when I call <code>item</code> later on, is this exactly the same thing as dereferencing in the second example, or is there a fundamental difference about how the compiler is interpreting the first code snippet and the second code snippet?</li> <li>Is there anything wrong with the third code snippet? I realize this is a bit of an opinion based question. I realize that if I were to assign a value to another variable using <code>item</code> (or <code>*item</code>, or assigning the value as a reference) I would have different datatypes being returned later on. Aside from managing your data types, is there anything else to keep in mind when considering if <code>item == &b' '</code> is the right tool for the job?</li> </ol>

There is no difference, whatsoever, between these snippets. They generate the exact same assembly: <pre class="prettyprint"><code>pub fn a(bytes: &[u8]) -> usize { for (i, item) in bytes.iter().enumerate() { if *item == b' ' { return i; } } 0 } pub fn b(bytes: &[u8]) -> usize { for (i, &item) in bytes.iter().enumerate() { if item == b' ' { return i; } } 0 } pub fn c(bytes: &[u8]) -> usize { for (i, item) in bytes.iter().enumerate() { if item == &b' ' { return i; } } 0 } </code></pre> <pre class="prettyprint lang-none prettyprint-override"><code>playground::a: negq %rsi movq $-1, %rax .LBB0_1: leaq (%rsi,%rax), %rcx cmpq $-1, %rcx je .LBB0_2 cmpb $32, 1(%rdi,%rax) leaq 1(%rax), %rax jne .LBB0_1 retq .LBB0_2: xorl %eax, %eax retq ; The code is identical so the functions are aliased .set playground::b, playground::a .set playground::c, playground::a </code></pre> For what it's worth, I'd write the function as <pre class="prettyprint"><code>pub fn a(bytes: &[u8]) -> Option<usize> { bytes.iter().position(|&b| b == b' ') } </code></pre> <blockquote> <code>iter()</code> [...] a reference to the element found in <code>bytes</code> </blockquote> Yes, <code>iter</code> is typically a function that returns an iterator of references. <blockquote> I need to either compare between </blockquote> Generally, you need to compare between two things with the same amount of references or sometimes one level of reference difference. How you achieve this is immaterial — referencing a value or dereferencing another, or dereferencing via <code>*</code> as an expression or via <code>&</code> in a pattern. See also: <ul> <li>Can't compare `&Thing` with `Thing`</li> </ul>

What are the differences between using * and & to compare values for equality?

Tags:

rust

I think I understand, at a very high level what the difference between & and * in Rust is as it pertains to memory management.

What is the difference between the following code snippets. Are there dangers to applying one approach versus the other?

for (i, item) in bytes.iter().enumerate() {
    if *item == b' ' {
        return i;
    }
}

for (i, &item) in bytes.iter().enumerate() {
    if item == b' ' {
        return i;
    }
}

for (i, item) in bytes.iter().enumerate() {
    if item == &b' ' {
        return i;
    }
}

As I understand it, when I return a value from iter() I am returning a reference to the element found in bytes. If I want to make a comparison on the item, I need to either compare between two references &u8 or I need to make &item a reference itself so that when I call item it is of type u8, or I need to dereference item when I compare it so that item = &u8 -> *item = u8.

When I run the code using (i, &item), when I call item later on, is this exactly the same thing as dereferencing in the second example, or is there a fundamental difference about how the compiler is interpreting the first code snippet and the second code snippet?
Is there anything wrong with the third code snippet? I realize this is a bit of an opinion based question. I realize that if I were to assign a value to another variable using item (or *item, or assigning the value as a reference) I would have different datatypes being returned later on. Aside from managing your data types, is there anything else to keep in mind when considering if item == &b' ' is the right tool for the job?

621

asked Oct 06 '18 18:10

Matt

1 Answers

There is no difference, whatsoever, between these snippets. They generate the exact same assembly:

pub fn a(bytes: &[u8]) -> usize {
    for (i, item) in bytes.iter().enumerate() {
        if *item == b' ' {
            return i;
        }
    }
    0
}

pub fn b(bytes: &[u8]) -> usize {
    for (i, &item) in bytes.iter().enumerate() {
        if item == b' ' {
            return i;
        }
    }
    0
}

pub fn c(bytes: &[u8]) -> usize {
    for (i, item) in bytes.iter().enumerate() {
        if item == &b' ' {
            return i;
        }
    }
    0
}

playground::a:
    negq    %rsi
    movq    $-1, %rax

.LBB0_1:
    leaq    (%rsi,%rax), %rcx
    cmpq    $-1, %rcx
    je  .LBB0_2
    cmpb    $32, 1(%rdi,%rax)
    leaq    1(%rax), %rax
    jne .LBB0_1
    retq

.LBB0_2:
    xorl    %eax, %eax
    retq

; The code is identical so the functions are aliased
.set playground::b, playground::a
.set playground::c, playground::a

For what it's worth, I'd write the function as

pub fn a(bytes: &[u8]) -> Option<usize> {
    bytes.iter().position(|&b| b == b' ')
}

iter() [...] a reference to the element found in bytes

Yes, iter is typically a function that returns an iterator of references.

I need to either compare between

Generally, you need to compare between two things with the same amount of references or sometimes one level of reference difference. How you achieve this is immaterial — referencing a value or dereferencing another, or dereferencing via * as an expression or via & in a pattern.

Shepmaster

Related questions
                            
                                Method not compatible with trait with confusing error message
                            
                                Rust cargo: how to use different features for a dep when a particular feature is enabled?
                            
                                Implement function for trait implementor with dynamic and static dispatch
                            
                                Why does Cargo create multiple directories for the same registry?
                            
                                Are reference values copied in Rust? [duplicate]
                            
                                How to use multiple variables in Rust's for loop?
                            
                                unconstrained type parameter error
                            
                                How do I write the lifetimes for references in a type constraint when one of them is a local reference?
                            
                                Why does iter borrow mutably when used in a pattern guard?
                            
                                Why do I get "conflicting implementations of trait" for f32 which does not implement Ord?
                            
                                How can I prevent functions from being aligned to 16 bytes boundary when compiling for X86?
                            
                                Collect iterators of length 2 into HashMap
                            
                                How do I translate x86 GCC-style C inline assembly to Rust inline assembly?
                            
                                Iterating through a recursive structure using mutable references and returning the last valid reference
                            
                                How do I specify the lifetime of an AsRef?
                            
                                "Can't assign requested address" when sending to a UdpSocket
                            
                                Can't get image::load_from_memory() to work when compiled to WebAssembly
                            
                                Why doesn't the comparison operation in my iterator filter over generic types work?
                            
                                Where does nom's "$i" macro argument come from?
                            
                                How to run multiple futures that call thread::sleep in parallel? [duplicate]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With