Came across a proposal called "rvalue reference for *this" in clang's C++11 status page. I've read quite a bit about rvalue references and understood them, but I don't think I know about this. I also couldn't find much resources on the web using the terms. There's a link to the proposal paper on the page: N2439 (Extending move semantics to *this), but I'm also not getting much examples from there. What is this feature about?

First, "ref-qualifiers for *this" is a just a "marketing statement". The type of <code>*this</code> never changes, see the bottom of this post. It's way easier to understand it with this wording though. Next, the following code chooses the function to be called based on the ref-qualifier of the "implicit object parameter" of the function&dagger;: <pre class="prettyprint"><code>// t.cpp #include <iostream> struct test{ void f() &{ std::cout << "lvalue object\n"; } void f() &&{ std::cout << "rvalue object\n"; } }; int main(){ test t; t.f(); // lvalue test().f(); // rvalue } </code></pre> Output: <pre class="prettyprint"><code>$ clang++ -std=c++0x -stdlib=libc++ -Wall -pedantic t.cpp $ ./a.out lvalue object rvalue object </code></pre> The whole thing is done to allow you to take advantage of the fact when the object the function is called on is an rvalue (unnamed temporary, for example). Take the following code as a further example: <pre class="prettyprint"><code>struct test2{ std::unique_ptr<int[]> heavy_resource; test2() : heavy_resource(new int[500]) {} operator std::unique_ptr<int[]>() const&{ // lvalue object, deep copy std::unique_ptr<int[]> p(new int[500]); for(int i=0; i < 500; ++i) p[i] = heavy_resource[i]; return p; } operator std::unique_ptr<int[]>() &&{ // rvalue object // we are garbage anyways, just move resource return std::move(heavy_resource); } }; </code></pre> This may be a bit contrived, but you should get the idea. Note that you can combine the cv-qualifiers (<code>const</code> and <code>volatile</code>) and ref-qualifiers (<code>&</code> and <code>&&</code>). <hr> Note: Many standard quotes and overload resolution explanation after here! &dagger; To understand how this works, and why @Nicol Bolas' answer is at least partly wrong, we have to dig in the C++ standard for a bit (the part explaining why @Nicol's answer is wrong is at the bottom, if you're only interested in that). Which function is going to be called is determined by a process called overload resolution. This process is fairly complicated, so we'll only touch the bit that is important to us. First, it's important to see how overload resolution for member functions works: <code>§13.3.1 [over.match.funcs]</code> <blockquote> p2 The set of candidate functions can contain both member and non-member functions to be resolved against the same argument list. So that argument and parameter lists are comparable within this heterogeneous set, a member function is considered to have an extra parameter, called the implicit object parameter, which represents the object for which the member function has been called. [...] p3 Similarly, when appropriate, the context can construct an argument list that contains an implied object argument to denote the object to be operated on. </blockquote> Why do we even need to compare member and non-member functions? Operator overloading, that's why. Consider this: <pre class="prettyprint"><code>struct foo{ foo& operator<<(void*); // implementation unimportant }; foo& operator<<(foo&, char const*); // implementation unimportant </code></pre> You'd certainly want the following to call the free function, don't you? <pre class="prettyprint"><code>char const* s = "free foo!\n"; foo f; f << s; </code></pre> That's why member and non-member functions are included in the so-called overload-set. To make the resolution less complicated, the bold part of the standard quote exists. Additionally, this is the important bit for us (same clause): <blockquote> p4 For non-static member functions, the type of the implicit object parameter is <ul> <li> “lvalue reference to cv <code>X</code>” for functions declared without a ref-qualifier or with the <code>&</code> ref-qualifier </li> <li> “rvalue reference to cv <code>X</code>” for functions declared with the <code>&&</code> ref-qualifier </li> </ul> where <code>X</code> is the class of which the function is a member and cv is the cv-qualification on the member function declaration. [...] p5 During overload resolution [...] [t]he implicit object parameter [...] retains its identity since conversions on the corresponding argument shall obey these additional rules: <ul> <li> no temporary object can be introduced to hold the argument for the implicit object parameter; and </li> <li> no user-defined conversions can be applied to achieve a type match with it </li> </ul> [...] </blockquote> (The last bit just means that you can't cheat overload resolution based on implicit conversions of the object a member function (or operator) is called on.) Let's take the first example at the top of this post. After the aforementioned transformation, the overload-set looks something like this: <pre class="prettyprint"><code>void f1(test&); // will only match lvalues, linked to 'void test::f() &' void f2(test&&); // will only match rvalues, linked to 'void test::f() &&' </code></pre> Then the argument list, containing an implied object argument, is matched against the parameter-list of every function contained in the overload-set. In our case, the argument list will only contain that object argument. Let's see how that looks like: <pre class="prettyprint"><code>// first call to 'f' in 'main' test t; f1(t); // 't' (lvalue) can match 'test&' (lvalue reference) // kept in overload-set f2(t); // 't' not an rvalue, can't match 'test&&' (rvalue reference) // taken out of overload-set </code></pre> If, after all overloads in the set are tested, only one remains, the overload resolution succeeded and the function linked to that transformed overload is called. The same goes for the second call to 'f': <pre class="prettyprint"><code>// second call to 'f' in 'main' f1(test()); // 'test()' not an lvalue, can't match 'test&' (lvalue reference) // taken out of overload-set f2(test()); // 'test()' (rvalue) can match 'test&&' (rvalue reference) // kept in overload-set </code></pre> Note however that, had we not provided any ref-qualifier (and as such not overloaded the function), that <code>f1</code> would match an rvalue (still <code>§13.3.1</code>): <blockquote> p5 [...] For non-static member functions declared without a ref-qualifier, an additional rule applies: <ul> <li>even if the implicit object parameter is not <code>const</code>-qualified, an rvalue can be bound to the parameter as long as in all other respects the argument can be converted to the type of the implicit object parameter.</li> </ul> </blockquote> <pre class="prettyprint"><code>struct test{ void f() { std::cout << "lvalue or rvalue object\n"; } }; int main(){ test t; t.f(); // OK test().f(); // OK too } </code></pre> <hr> Now, onto why @Nicol's answer is atleast partly wrong. He says: <blockquote> Note that this declaration changes the type of <code>*this</code>. </blockquote> That is wrong, <code>*this</code> is always an lvalue: <code>§5.3.1 [expr.unary.op] p1</code> <blockquote> The unary <code>*</code> operator performs indirection: the expression to which it is applied shall be a pointer to an object type, or a pointer to a function type and the result is an lvalue referring to the object or function to which the expression points. </blockquote> <code>§9.3.2 [class.this] p1</code> <blockquote> In the body of a non-static (9.3) member function, the keyword <code>this</code> is a prvalue expression whose value is the address of the object for which the function is called. The type of <code>this</code> in a member function of a class <code>X</code> is <code>X*</code>. [...] </blockquote>

Let's say you have two functions on a class, both with the same name and signature. But one of them is declared <code>const</code>: <pre class="prettyprint"><code>void SomeFunc() const; void SomeFunc(); </code></pre> If a class instance is not <code>const</code>, overload resolution will preferentially select the non-const version. If the instance is <code>const</code>, the user can only call the <code>const</code> version. And the <code>this</code> pointer is a <code>const</code> pointer, so the instance cannot be changed. What "r-value reference for this` does is allow you to add another alternative: <pre class="prettyprint"><code>void RValueFunc() &&; </code></pre> This allows you to have a function that can only be called if the user calls it through a proper r-value. So if this is in the type <code>Object</code>: <pre class="prettyprint"><code>Object foo; foo.RValueFunc(); //error: no `RValueFunc` version exists that takes `this` as l-value. Object().RValueFunc(); //calls the non-const, && version. </code></pre> This way, you can specialize behavior based on whether the object is being accessed via an r-value or not. Note that you are not allowed to overload between the r-value reference versions and the non-reference versions. That is, if you have a member function name, all of its versions either use the l/r-value qualifiers on <code>this</code>, or none of them do. You can't do this: <pre class="prettyprint"><code>void SomeFunc(); void SomeFunc() &&; </code></pre> You must do this: <pre class="prettyprint"><code>void SomeFunc() &; void SomeFunc() &&; </code></pre> Note that this declaration changes the type of <code>*this</code>. This means that the <code>&&</code> versions all access members as r-value references. So it becomes possible to easily move from within the object. The example given in the first version of the proposal is (note: the following may not be correct with the final version of C++11; it's straight from the initial "r-value from this" proposal): <pre class="prettyprint"><code>class X { std::vector<char> data_; public: // ... std::vector<char> const & data() const & { return data_; } std::vector<char> && data() && { return data_; } }; X f(); // ... X x; std::vector<char> a = x.data(); // copy std::vector<char> b = f().data(); // move </code></pre>

Ampersand after function declaration [duplicate]

2 Answers

First, "ref-qualifiers for *this" is a just a "marketing statement". The type of *this never changes, see the bottom of this post. It's way easier to understand it with this wording though.

Next, the following code chooses the function to be called based on the ref-qualifier of the "implicit object parameter" of the function^†:

// t.cpp
#include <iostream>

struct test{
  void f() &{ std::cout << "lvalue object\n"; }
  void f() &&{ std::cout << "rvalue object\n"; }
};

int main(){
  test t;
  t.f(); // lvalue
  test().f(); // rvalue
}

Output:

$ clang++ -std=c++0x -stdlib=libc++ -Wall -pedantic t.cpp
$ ./a.out
lvalue object
rvalue object

The whole thing is done to allow you to take advantage of the fact when the object the function is called on is an rvalue (unnamed temporary, for example). Take the following code as a further example:

struct test2{
  std::unique_ptr<int[]> heavy_resource;

  test2()
    : heavy_resource(new int[500]) {}

  operator std::unique_ptr<int[]>() const&{
    // lvalue object, deep copy
    std::unique_ptr<int[]> p(new int[500]);
    for(int i=0; i < 500; ++i)
      p[i] = heavy_resource[i];

    return p;
  }

  operator std::unique_ptr<int[]>() &&{
    // rvalue object
    // we are garbage anyways, just move resource
    return std::move(heavy_resource);
  }
};

This may be a bit contrived, but you should get the idea.

Note that you can combine the cv-qualifiers (const and volatile) and ref-qualifiers (& and &&).

^{Note: Many standard quotes and overload resolution explanation after here!}

† To understand how this works, and why @Nicol Bolas' answer is at least partly wrong, we have to dig in the C++ standard for a bit (the part explaining why @Nicol's answer is wrong is at the bottom, if you're only interested in that).

Which function is going to be called is determined by a process called overload resolution. This process is fairly complicated, so we'll only touch the bit that is important to us.

First, it's important to see how overload resolution for member functions works:

§13.3.1 [over.match.funcs]

p2 The set of candidate functions can contain both member and non-member functions to be resolved against the same argument list. So that argument and parameter lists are comparable within this heterogeneous set, a member function is considered to have an extra parameter, called the implicit object parameter, which represents the object for which the member function has been called. [...]

p3 Similarly, when appropriate, the context can construct an argument list that contains an implied object argument to denote the object to be operated on.

Why do we even need to compare member and non-member functions? Operator overloading, that's why. Consider this:

struct foo{
  foo& operator<<(void*); // implementation unimportant
};

foo& operator<<(foo&, char const*); // implementation unimportant

You'd certainly want the following to call the free function, don't you?

char const* s = "free foo!\n";
foo f;
f << s;

That's why member and non-member functions are included in the so-called overload-set. To make the resolution less complicated, the bold part of the standard quote exists. Additionally, this is the important bit for us (same clause):

p4 For non-static member functions, the type of the implicit object parameter is

“lvalue reference to cv X” for functions declared without a ref-qualifier or with the & ref-qualifier

“rvalue reference to cv X” for functions declared with the && ref-qualifier

where X is the class of which the function is a member and cv is the cv-qualification on the member function declaration. [...]

p5 During overload resolution [...] [t]he implicit object parameter [...] retains its identity since conversions on the corresponding argument shall obey these additional rules:

no temporary object can be introduced to hold the argument for the implicit object parameter; and

no user-defined conversions can be applied to achieve a type match with it

[...]

(The last bit just means that you can't cheat overload resolution based on implicit conversions of the object a member function (or operator) is called on.)

Let's take the first example at the top of this post. After the aforementioned transformation, the overload-set looks something like this:

void f1(test&); // will only match lvalues, linked to 'void test::f() &'
void f2(test&&); // will only match rvalues, linked to 'void test::f() &&'

Then the argument list, containing an implied object argument, is matched against the parameter-list of every function contained in the overload-set. In our case, the argument list will only contain that object argument. Let's see how that looks like:

// first call to 'f' in 'main'
test t;
f1(t); // 't' (lvalue) can match 'test&' (lvalue reference)
       // kept in overload-set
f2(t); // 't' not an rvalue, can't match 'test&&' (rvalue reference)
       // taken out of overload-set

If, after all overloads in the set are tested, only one remains, the overload resolution succeeded and the function linked to that transformed overload is called. The same goes for the second call to 'f':

// second call to 'f' in 'main'
f1(test()); // 'test()' not an lvalue, can't match 'test&' (lvalue reference)
            // taken out of overload-set
f2(test()); // 'test()' (rvalue) can match 'test&&' (rvalue reference)
            // kept in overload-set

Note however that, had we not provided any ref-qualifier (and as such not overloaded the function), that f1 would match an rvalue (still §13.3.1):

p5 [...] For non-static member functions declared without a ref-qualifier, an additional rule applies:

even if the implicit object parameter is not const-qualified, an rvalue can be bound to the parameter as long as in all other respects the argument can be converted to the type of the implicit object parameter.

struct test{
  void f() { std::cout << "lvalue or rvalue object\n"; }
};

int main(){
  test t;
  t.f(); // OK
  test().f(); // OK too
}

Now, onto why @Nicol's answer is atleast partly wrong. He says:

Note that this declaration changes the type of *this.

That is wrong, *this is always an lvalue:

§5.3.1 [expr.unary.op] p1

The unary * operator performs indirection: the expression to which it is applied shall be a pointer to an object type, or a pointer to a function type and the result is an lvalue referring to the object or function to which the expression points.

§9.3.2 [class.this] p1

In the body of a non-static (9.3) member function, the keyword this is a prvalue expression whose value is the address of the object for which the function is called. The type of this in a member function of a class X is X*. [...]

193

answered Sep 27 '22 20:09

Xeo

Let's say you have two functions on a class, both with the same name and signature. But one of them is declared const:

void SomeFunc() const;
void SomeFunc();

If a class instance is not const, overload resolution will preferentially select the non-const version. If the instance is const, the user can only call the const version. And the this pointer is a const pointer, so the instance cannot be changed.

What "r-value reference for this` does is allow you to add another alternative:

void RValueFunc() &&;

This allows you to have a function that can only be called if the user calls it through a proper r-value. So if this is in the type Object:

Object foo;
foo.RValueFunc(); //error: no `RValueFunc` version exists that takes `this` as l-value.
Object().RValueFunc(); //calls the non-const, && version.

This way, you can specialize behavior based on whether the object is being accessed via an r-value or not.

Note that you are not allowed to overload between the r-value reference versions and the non-reference versions. That is, if you have a member function name, all of its versions either use the l/r-value qualifiers on this, or none of them do. You can't do this:

void SomeFunc();
void SomeFunc() &&;

You must do this:

void SomeFunc() &;
void SomeFunc() &&;

Note that this declaration changes the type of *this. This means that the && versions all access members as r-value references. So it becomes possible to easily move from within the object. The example given in the first version of the proposal is (note: the following may not be correct with the final version of C++11; it's straight from the initial "r-value from this" proposal):

class X {
   std::vector<char> data_;
public:
   // ...
   std::vector<char> const & data() const & { return data_; }
   std::vector<char> && data() && { return data_; }
};

X f();

// ...
X x;
std::vector<char> a = x.data(); // copy
std::vector<char> b = f().data(); // move

answered Sep 27 '22 20:09

Nicol Bolas

Related questions
                            
                                is it possible in C or C++ to create a function inside another?
                            
                                Determining Whether Pointer is Valid
                            
                                skipped when looking for precompiled header
                            
                                Visual Studio 6 tips and tricks [closed]
                            
                                New to C++: should I use Visual Studio? [closed]
                            
                                What is the most violent way that an application can terminate itself (linux)
                            
                                Is it possible to define multiple classes in just one .cpp file?
                            
                                Is using NULL references OK?
                            
                                What are the limits of Python? [closed]
                            
                                Is Embarcadero C++ Builder a good choice as an IDE? [closed]
                            
                                why there is no find for vector in C++
                            
                                What are the disadvantages of using templates?
                            
                                C++ example of Coding Horror or Brilliant Idea?
                            
                                nanoseconds to milliseconds - fast division by 1000000
                            
                                Determining if a number is prime
                            
                                What's wrong with const?
                            
                                How to detect declared but undefined functions in C++?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Ampersand after function declaration [duplicate]

Tags:

c++

c++-faq

c++11

move-semantics

qualifiers

ryaner

People also ask

2 Answers

Xeo

Nicol Bolas

Recent Activity

Donate For Us