I'm having trouble understanding the rules about traits in algebraic data types. Here's a simplified example: <pre class="prettyprint lang-rust prettyprint-override"><code>use std::rc::Rc; use std::cell::RefCell; trait Quack { fn quack(&self); } struct Duck; impl Quack for Duck { fn quack(&self) { println!("Quack!"); } } fn main() { let mut pond: Vec<Box<Quack>> = Vec::new(); let duck: Box<Duck> = Box::new(Duck); pond.push(duck); // This is valid. let mut lake: Vec<Rc<RefCell<Box<Quack>>>> = Vec::new(); let mallard: Rc<RefCell<Box<Duck>>> = Rc::new(RefCell::new(Box::new(Duck))); lake.push(mallard); // This is a type mismatch. } </code></pre> The above fails to compile, yielding the following error message: <pre class="prettyprint lang-none prettyprint-override"><code> expected `alloc::rc::Rc<core::cell::RefCell<Box<Quack>>>`, found `alloc::rc::Rc<core::cell::RefCell<Box<Duck>>>` (expected trait Quack, found struct `Duck`) [E0308] src/main.rs:19 lake.push(mallard); </code></pre> Why is it that <code>pond.push(duck)</code> is valid, yet <code>lake.push(mallard)</code> isn't? In both cases, a <code>Duck</code> has been supplied where a <code>Quack</code> was expected. In the former, the compiler is happy, but in the latter, it's not. Is the reason for this difference related to <code>CoerceUnsized</code>?

Vladimir's answer explained what the compiler is doing. Based on that information, I developed a solution: Creating a wrapper struct around <code>Box<Quack></code>. The wrapper is called <code>QuackWrap</code>. It has a fixed size, and it can be used just like any other struct (I think). The <code>Box</code> inside <code>QuackWrap</code> allows me to build a <code>QuackWrap</code> around any trait that implements <code>Quack</code>. Thus, I can have a <code>Vec<Rc<RefCell<QuackWrap>>></code> where the inner values are a mixture of <code>Duck</code>s, <code>Goose</code>s, etc. <pre class="prettyprint lang-rust prettyprint-override"><code>use std::rc::Rc; use std::cell::RefCell; trait Quack { fn quack(&self); } struct Duck; impl Quack for Duck { fn quack(&self) { println!("Quack!"); } } struct QuackWrap(Box<Quack>); impl QuackWrap { pub fn new<T: Quack + 'static>(value: T) -> QuackWrap { QuackWrap(Box::new(value)) } } fn main() { let mut pond: Vec<Box<Quack>> = Vec::new(); let duck: Box<Duck> = Box::new(Duck); pond.push(duck); // This is valid. // This would be a type error: //let mut lake: Vec<Rc<RefCell<Box<Quack>>>> = Vec::new(); //let mallard: Rc<RefCell<Box<Duck>>> = Rc::new(RefCell::new(Box::new(Duck))); //lake.push(mallard); // This is a type mismatch. // Instead, we can do this: let mut lake: Vec<Rc<RefCell<QuackWrap>>> = Vec::new(); let mallard: Rc<RefCell<QuackWrap>> = Rc::new(RefCell::new(QuackWrap::new(Duck))); lake.push(mallard); // This is valid. } </code></pre> As an added convenience, I'll probably want to implement <code>Deref</code> and <code>DefrefMut</code> on <code>QuackWrap</code>. But that's not necessary for the above example.

Traits in algebraic data types

Tags:

rust

I'm having trouble understanding the rules about traits in algebraic data types. Here's a simplified example:

use std::rc::Rc;
use std::cell::RefCell;

trait Quack {
    fn quack(&self);
}

struct Duck;

impl Quack for Duck {
    fn quack(&self) { println!("Quack!"); }
}

fn main() {
    let mut pond: Vec<Box<Quack>> = Vec::new();
    let duck: Box<Duck> = Box::new(Duck);
    pond.push(duck); // This is valid.

    let mut lake: Vec<Rc<RefCell<Box<Quack>>>> = Vec::new();
    let mallard: Rc<RefCell<Box<Duck>>> = Rc::new(RefCell::new(Box::new(Duck)));
    lake.push(mallard); // This is a type mismatch.
}

The above fails to compile, yielding the following error message:

 expected `alloc::rc::Rc<core::cell::RefCell<Box<Quack>>>`,
    found `alloc::rc::Rc<core::cell::RefCell<Box<Duck>>>`
(expected trait Quack,
    found struct `Duck`) [E0308]
src/main.rs:19     lake.push(mallard);

Why is it that pond.push(duck) is valid, yet lake.push(mallard) isn't? In both cases, a Duck has been supplied where a Quack was expected. In the former, the compiler is happy, but in the latter, it's not.

Is the reason for this difference related to CoerceUnsized?

588

asked Jun 05 '15 02:06

rlkw1024

2 Answers

This is a correct behavior, even if it is somewhat unfortunate.

In the first case we have this:

let mut pond: Vec<Box<Quack>> = Vec::new();
let duck: Box<Duck> = Box::new(Duck);
pond.push(duck);

Note that push(), when called on Vec<Box<Quack>>, accepts Box<Quack>, and you're passing Box<Duck>. This is OK - rustc is able to understand that you want to convert a boxed value to a trait object, like here:

let duck: Box<Duck> = Box::new(Duck);
let quack: Box<Quack> = duck;  // automatic coercion to a trait object

In the second case we have this:

let mut lake: Vec<Rc<RefCell<Box<Quack>>>> = Vec::new();
let mallard: Rc<RefCell<Box<Duck>>> = Rc::new(RefCell::new(Box::new(Duck)));
lake.push(mallard);

Here push() accepts Rc<RefCell<Box<Quack>>> while you provide Rc<RefCell<Box<Duck>>>:

let mallard: Rc<RefCell<Box<Duck>>> = Rc::new(RefCell::new(Box::new(Duck)));
let quack: Rc<RefCell<Box<Quack>>> = mallard;

And now there is a trouble. Box<T> is a DST-compatible type, so it can be used as a container for a trait object. The same thing will soon be true for Rc and other smart pointers when this RFC is implemented. However, in this case there is no coercion from a concrete type to a trait object because Box<Duck> is inside of additional layers of types (Rc<RefCell<..>>).

Remember, trait object is a fat pointer, so Box<Duck> is different from Box<Quack> in size. Consequently, in principle, they are not directly compatible: you can't just take bytes of Box<Duck> and write them to where Box<Quack> is expected. Rust performs a special conversion, that is, it obtains a pointer to the virtual table for Duck, constructs a fat pointer and writes it to Box<Quack>-typed variable.

When you have Rc<RefCell<Box<Duck>>>, however, rustc would need to know how to construct and destructure both RefCell and Rc in order to apply the same fat pointer conversion to its internals. Naturally, because these are library types, it can't know how to do it. This is also true for any other wrapper type, e.g. Arc or Mutex or even Vec. You don't expect that it would be possible to use Vec<Box<Duck>> as Vec<Box<Quack>>, right?

Also there is a fact that in the example with Rc the Rcs created out of Box<Duck> and Box<Quack> wouldn't have been connected - they would have had different reference counters.

That is, a conversion from a concrete type to a trait object can only happen if you have direct access to a smart pointer which supports DST, not when it is hidden inside some other structure.

That said, I see how it may be possible to allow this for a few select types. For example, we could introduce some kind of Construct/Unwrap traits which are known to the compiler and which it could use to "reach" inside of a stack of wrappers and perform trait object conversion inside them. However, no one designed this thing and provided an RFC about it yet - probably because it is not a widely needed feature.

134

answered Oct 14 '22 23:10

Vladimir Matveev

Vladimir's answer explained what the compiler is doing. Based on that information, I developed a solution: Creating a wrapper struct around Box<Quack>.

The wrapper is called QuackWrap. It has a fixed size, and it can be used just like any other struct (I think). The Box inside QuackWrap allows me to build a QuackWrap around any trait that implements Quack. Thus, I can have a Vec<Rc<RefCell<QuackWrap>>> where the inner values are a mixture of Ducks, Gooses, etc.

use std::rc::Rc;
use std::cell::RefCell;

trait Quack {
    fn quack(&self);
}

struct Duck;

impl Quack for Duck {
    fn quack(&self) { println!("Quack!"); }
}

struct QuackWrap(Box<Quack>);

impl QuackWrap {
    pub fn new<T: Quack + 'static>(value: T) -> QuackWrap {
        QuackWrap(Box::new(value))
    }
}

fn main() {
    let mut pond: Vec<Box<Quack>> = Vec::new();
    let duck: Box<Duck> = Box::new(Duck);
    pond.push(duck); // This is valid.

    // This would be a type error:
    //let mut lake: Vec<Rc<RefCell<Box<Quack>>>> = Vec::new();
    //let mallard: Rc<RefCell<Box<Duck>>> = Rc::new(RefCell::new(Box::new(Duck)));
    //lake.push(mallard); // This is a type mismatch.

    // Instead, we can do this:
    let mut lake: Vec<Rc<RefCell<QuackWrap>>> = Vec::new();
    let mallard: Rc<RefCell<QuackWrap>> = Rc::new(RefCell::new(QuackWrap::new(Duck)));
    lake.push(mallard); // This is valid.
}

As an added convenience, I'll probably want to implement Deref and DefrefMut on QuackWrap. But that's not necessary for the above example.

answered Oct 15 '22 01:10

rlkw1024

Related questions
                            
                                Why Rust don't use default generic parameter type
                            
                                Casting a function reference producing an invalid pointer?
                            
                                Is it safe to have a value that may be changed by processor unexpectedly?
                            
                                Declaring array using a constant expression for its size
                            
                                Why disallow re-using the same value in a format! macro
                            
                                What is the proper way to go from a String to a *const i8?
                            
                                How to make VS Code build and run Rust programs?
                            
                                Is it possible to share data with threads without any cloning?
                            
                                Ruby string to rust and back again
                            
                                Unable to tackle optional fields in JSON with Rustc-serialize
                            
                                Why is the value moved into the closure here rather than borrowed?
                            
                                Program with a spawned thread panics when optimization enabled
                            
                                Is there a way to get a BufWriter's buffer length?
                            
                                Why do generic lifetimes not conform to the smaller lifetime of a nested scope?
                            
                                Is it possible to store a Rust struct containing a closure in a different struct?
                            
                                How to write a trait method taking an iterator of strings, avoiding monomorphization (static dispatch)?
                            
                                In Rust, can you own a string literal?
                            
                                Cannot infer type for type parameter `S` when using HashSet::from_iter
                            
                                Rust persistent TcpStream
                            
                                Creating a callback system using closures

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Traits in algebraic data types

Tags:

rust

rlkw1024

People also ask

2 Answers

Vladimir Matveev

rlkw1024

Recent Activity

Donate For Us