I was investigating the performance of moving <code>std::string</code>. For the longest time, I've regarded string moves as almost free, thinking the compiler will inline everything and it will only involve a few cheap assignments. In fact, my mental model for moving is literally <pre class="prettyprint"><code>string& operator=(string&& rhs) noexcept { swap(*this, rhs); return *this; } friend void swap(string& x, string& y) noexcept { // for disposition only unsigned char buf[sizeof(string)]; memcpy(buf, &x, sizeof(string)); memcpy(&x, &y, sizeof(string)); memcpy(&y, buf, sizeof(string)); } </code></pre> To the best of my understanding, this is a legal implementation if the <code>memcpy</code> is changed to assigning individual fields. It is to my great surprise to find gcc's implementation of moving involves creating a new string and might possibly throw due to the allocations despite being <code>noexcept</code>. Is this even conforming? Equally important, should I not think moving is almost free? <hr> Bewilderingly, <code>std::vector<char></code> compiles down to what I'd expect. clang's implementation is much different, although there is a suspicious <code>std::string::reserve</code>

I've only analyzed GCC's version. Here's what's going on: the code handles different kind of allocators. If the allocator has the trait of <code>_S_propagate_on_move_assign</code> or <code>_S_always_equal</code>, then the move is almost free, as you expect. This is the <code>if</code> in move <code>operator=</code>: <pre class="prettyprint"><code>if (!__str._M_is_local() && (_Alloc_traits::_S_propagate_on_move_assign() || _Alloc_traits::_S_always_equal())) // cheap move else assign(__str); </code></pre> If the condition is true (<code>_M_is_local()</code> means small string, description here), then the move is cheap. If it is false, then it calls normal <code>assign</code> (not the moving one). This is the case when either: <ul> <li>the string is small, so the <code>assign</code> will do a simple memcpy (cheap)</li> <li>or the allocator doesn't have the trait always-equal nor propagate-on-move-assign, so the assign will allocate (not cheap)</li> </ul> What does this mean? It means, that if you use the default allocator (or any allocator with traits mentioned earlier), then the move is still almost free. On the other hand, the generated code is unnecessarily huge, and can be improved I think. It should have a separate code for handling usual allocators, or have a better <code>assign</code> code (the problem is that <code>assign</code> doesn't check for <code>_M_is_local()</code>, but it does a capacity check, so the compiler cannot decide whether an allocation is needed or not, so it puts the allocation codepath into the executable unnecessarily - you can check out the exact details in the source code).

On the implementation of std::string moves

Tags:

c++

string

move-semantics

I was investigating the performance of moving std::string. For the longest time, I've regarded string moves as almost free, thinking the compiler will inline everything and it will only involve a few cheap assignments.

In fact, my mental model for moving is literally

string& operator=(string&& rhs) noexcept
{
    swap(*this, rhs);
    return *this;
}

friend void swap(string& x, string& y) noexcept
{
    // for disposition only
    unsigned char buf[sizeof(string)];
    memcpy(buf, &x, sizeof(string));
    memcpy(&x, &y, sizeof(string));
    memcpy(&y, buf, sizeof(string));
}

To the best of my understanding, this is a legal implementation if the memcpy is changed to assigning individual fields.

It is to my great surprise to find gcc's implementation of moving involves creating a new string and might possibly throw due to the allocations despite being noexcept.

Is this even conforming? Equally important, should I not think moving is almost free?

Bewilderingly, std::vector<char> compiles down to what I'd expect.

clang's implementation is much different, although there is a suspicious std::string::reserve

994

asked May 29 '18 14:05

Passer By

1 Answers

I've only analyzed GCC's version. Here's what's going on: the code handles different kind of allocators. If the allocator has the trait of _S_propagate_on_move_assign or _S_always_equal, then the move is almost free, as you expect. This is the if in move operator=:

if (!__str._M_is_local()
    && (_Alloc_traits::_S_propagate_on_move_assign()
      || _Alloc_traits::_S_always_equal()))
          // cheap move
else assign(__str);

If the condition is true (_M_is_local() means small string, description here), then the move is cheap.

If it is false, then it calls normal assign (not the moving one). This is the case when either:

the string is small, so the assign will do a simple memcpy (cheap)
or the allocator doesn't have the trait always-equal nor propagate-on-move-assign, so the assign will allocate (not cheap)

What does this mean?

It means, that if you use the default allocator (or any allocator with traits mentioned earlier), then the move is still almost free.

On the other hand, the generated code is unnecessarily huge, and can be improved I think. It should have a separate code for handling usual allocators, or have a better assign code (the problem is that assign doesn't check for _M_is_local(), but it does a capacity check, so the compiler cannot decide whether an allocation is needed or not, so it puts the allocation codepath into the executable unnecessarily - you can check out the exact details in the source code).

answered Oct 17 '22 07:10

geza

Related questions
                            
                                Substitution failure in an atomic constraint of template function requires-clause
                            
                                How to develop small software or application? [closed]
                            
                                Boost::asio, Shared Memory and Interprocess Communication
                            
                                How can I wrap a c++ class in php extension?
                            
                                alias template substitution and deduction failure with gcc
                            
                                How much does the C standard library extensibility affect C++ programs?
                            
                                Why c++ standard support function strftime but not strptime?
                            
                                packaging c++ program using boost libraries with cmake/cpack
                            
                                How to get AssImp to work properly?
                            
                                why does a conditional variable fix our power consumption?
                            
                                Is there any workaround to "reserve" a cache fraction?
                            
                                Fixed address is occupied in .NET
                            
                                Emscripten: emmake generating .js files
                            
                                static_assert on inline function gives error
                            
                                Accessing variable values within a macro
                            
                                Determining which overload was selected
                            
                                Qt5 QGeoPositionInfoSource::createDefaultSource() crashes on Android 5.0
                            
                                Finding out whether static initialization is over

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With