Experimenting with the programming language Rust, I found that the compiler is able to track a move of a field of some struct on the stack very accurately (it knows exactly what field has moved). However, when I put one part of the structure into a <code>Box</code> (i.e. putting it onto the heap), the compiler is no longer able to determine field-level moves for everything that happens after the dereference of the box. It will assume that the whole structure "inside the box" has moved. Let's first see an example where everything is on the stack: <pre class="prettyprint lang-rust prettyprint-override"><code>struct OuterContainer { inner: InnerContainer } struct InnerContainer { val_a: ValContainer, val_b: ValContainer } struct ValContainer { i: i32 } fn main() { // Note that the whole structure lives on the stack. let structure = OuterContainer { inner: InnerContainer { val_a: ValContainer { i: 42 }, val_b: ValContainer { i: 100 } } }; // Move just one field (val_a) of the inner container. let move_me = structure.inner.val_a; // We can still borrow the other field (val_b). let borrow_me = &structure.inner.val_b; } </code></pre> And now the same example but with one minor change: We put the <code>InnerContainer</code> into a box (<code>Box<InnerContainer></code>). <pre class="prettyprint lang-rust prettyprint-override"><code>struct OuterContainer { inner: Box<InnerContainer> } struct InnerContainer { val_a: ValContainer, val_b: ValContainer } struct ValContainer { i: i32 } fn main() { // Note that the whole structure lives on the stack. let structure = OuterContainer { inner: Box::new(InnerContainer { val_a: ValContainer { i: 42 }, val_b: ValContainer { i: 100 } }) }; // Move just one field (val_a) of the inner container. // Note that now, the inner container lives on the heap. let move_me = structure.inner.val_a; // We can no longer borrow the other field (val_b). let borrow_me = &structure.inner.val_b; // error: "value used after move" } </code></pre> I suspect that it has something to do with the nature of the stack vs. the nature of the heap, where the former is static (per stack frame at least), and the latter is dynamic. Maybe the compiler needs to play it safe because of some reason I cannot articulate/identify well enough.

In the abstract, a <code>struct</code> on the stack is kind of just a bunch of variables under a common name. The compiler knows this, and can break a structure into a set of otherwise independent stack variables. This lets it track the movement of each field independently. It can't do that with a <code>Box</code>, or any other kind of custom allocation, because the compiler doesn't control <code>Box</code>es. <code>Box</code> is just some code in the standard library, not an intrinsic part of the language. <code>Box</code> has no way of reasoning about different parts of itself suddenly becoming not valid. When it comes time to destroy a <code>Box</code>, it's <code>Drop</code> implementation only knows to destroy everything. To put it another way: on the stack, the compiler is in full control, and can thus do fancy things like breaking structures up and moving them piecemeal. As soon as custom allocation enters the picture, all bets are off, and the compiler has to back off and stop trying to be clever.

Ownership tracking in Rust: Difference between Box<T> (heap) and T (stack)

Tags:

heap-memory

dynamic

rust

stack-memory

memory-safety

Experimenting with the programming language Rust, I found that the compiler is able to track a move of a field of some struct on the stack very accurately (it knows exactly what field has moved). However, when I put one part of the structure into a Box (i.e. putting it onto the heap), the compiler is no longer able to determine field-level moves for everything that happens after the dereference of the box. It will assume that the whole structure "inside the box" has moved. Let's first see an example where everything is on the stack:

struct OuterContainer {
    inner: InnerContainer
}

struct InnerContainer {
    val_a: ValContainer,
    val_b: ValContainer
}

struct ValContainer {
    i: i32
}


fn main() {
    // Note that the whole structure lives on the stack.
    let structure = OuterContainer {
        inner: InnerContainer {
            val_a: ValContainer { i: 42 },
            val_b: ValContainer { i: 100 }
        }
    };

    // Move just one field (val_a) of the inner container.
    let move_me = structure.inner.val_a;

    // We can still borrow the other field (val_b).
    let borrow_me = &structure.inner.val_b;
}

And now the same example but with one minor change: We put the InnerContainer into a box (Box<InnerContainer>).

struct OuterContainer {
    inner: Box<InnerContainer>
}

struct InnerContainer {
    val_a: ValContainer,
    val_b: ValContainer
}

struct ValContainer {
    i: i32
}


fn main() {
    // Note that the whole structure lives on the stack.
    let structure = OuterContainer {
        inner: Box::new(InnerContainer {
            val_a: ValContainer { i: 42 },
            val_b: ValContainer { i: 100 }
        })
    };

    // Move just one field (val_a) of the inner container.
    // Note that now, the inner container lives on the heap.
    let move_me = structure.inner.val_a;

    // We can no longer borrow the other field (val_b).
    let borrow_me = &structure.inner.val_b; // error: "value used after move"
}

I suspect that it has something to do with the nature of the stack vs. the nature of the heap, where the former is static (per stack frame at least), and the latter is dynamic. Maybe the compiler needs to play it safe because of some reason I cannot articulate/identify well enough.

265

asked May 20 '17 04:05

domin

1 Answers

In the abstract, a struct on the stack is kind of just a bunch of variables under a common name. The compiler knows this, and can break a structure into a set of otherwise independent stack variables. This lets it track the movement of each field independently.

It can't do that with a Box, or any other kind of custom allocation, because the compiler doesn't control Boxes. Box is just some code in the standard library, not an intrinsic part of the language. Box has no way of reasoning about different parts of itself suddenly becoming not valid. When it comes time to destroy a Box, it's Drop implementation only knows to destroy everything.

To put it another way: on the stack, the compiler is in full control, and can thus do fancy things like breaking structures up and moving them piecemeal. As soon as custom allocation enters the picture, all bets are off, and the compiler has to back off and stop trying to be clever.

answered Oct 16 '22 19:10

DK.

Related questions
                            
                                Yii2 set db connection at runtime
                            
                                Dynamic objects in c++ [closed]
                            
                                dplyr mutate new dynamic variables with case_when
                            
                                Error 'A value of type 'dynamic' can't be assigned to a variable of type 'String'.' in Dart 2.2
                            
                                Attaching properties and methods at runtime in C# 4.0?
                            
                                C# dynamic types - heaven or hell?
                            
                                How to dynamically add and remove a tab in p:tabView component
                            
                                Why does this runtime dynamic binding fail?
                            
                                Copy array to dynamically allocated memory
                            
                                How can I dynamically populate a CheckedListBox?
                            
                                RuntimeBinderException when using dynamic object
                            
                                Using DynamicObject (IDynamicMetaObjectProvider) as a component of a static type leads to infinite loop
                            
                                How to dynamically call an operator in Elixir
                            
                                How to define a C# object at run time?
                            
                                Using AJAX to load WordPress pages
                            
                                Change for loop index variable inside the loop
                            
                                GridView Header Text in asp.net
                            
                                Javascript dynamically getter/setter for private properties
                            
                                Does awk support dynamic user-defined variables?
                            
                                How to set protractor(v1.4.0) baseUrl using protractor API instead of configuration file?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With