I'm working on my first C++ project, which is a CSV parser (full source code here). It's at the point where it's working, and now I want to do basic refactoring / improve performance. Currently the way the parser works is by returning each row as a <code>std::vector<std::string></code>, and I figured that instead of allocating a new vector and a new string every time I'd just have an internal vector and internal string with reserved memory that I'd clear again and again. That worked, and I started looking at other places where I might be doing memory allocation, and I saw this function which copies the internal vector, and then clears it: <pre class="prettyprint"><code>auto add_row() -> std::vector<std::string> { auto row(m_bufvec); m_bufvec.clear(); return row; } </code></pre> I figured that if I instead changed this line <pre class="prettyprint"><code>auto row(m_bufvec); </code></pre> to <pre class="prettyprint"><code>auto row(std::move(m_bufvec)); </code></pre> It'd result in some sort of speed boost because according to http://en.cppreference.com/w/cpp/container/vector/vector it would take constant time instead of linear. To my surprise, it made the parser significantly slower (according to my really rough benchmark of running <code>time ./main.o</code> over this file). I'm completely new to optimization, benchmarking, and everything else that comes with tuning C++ code. Perhaps this optimization is useless even if it worked, but regardless, I'm curious as to why <code>std::move</code> causes a slowdown. Am I missing something?

When you copy bufvec, its capacity is unchanged, but when you move it, its capacity is cleared. Thus, later when you fill bufvec, a logarithmic number of allocations are done to expand its capacity again, and such allocations can easily be your performance bottleneck. The move version makes that function faster. But it makes other code slower. Micro optimizations do not reliably make programs faster. <hr> Edit by OP: The solution proposed by <code>Cheers and hth. - Alf</code> in the comments of <code>m_bufvec.reserve(row.size())</code> after the move fixes the problem, and confirms that the above reasoning was correct. Moreover it is more efficient, (albeit only slightly) because <blockquote> you avoid copying the items [in bufvec]. If the items are simple integer values, that doesn't matter so much. If the items are e.g. strings, with dynamic allocation, then it really does matter. </blockquote>

Vector move constructor slower than copy constructor

Tags:

c++

csv

c++11

move-semantics

vector

I'm working on my first C++ project, which is a CSV parser (full source code here). It's at the point where it's working, and now I want to do basic refactoring / improve performance.

Currently the way the parser works is by returning each row as a std::vector<std::string>, and I figured that instead of allocating a new vector and a new string every time I'd just have an internal vector and internal string with reserved memory that I'd clear again and again.

That worked, and I started looking at other places where I might be doing memory allocation, and I saw this function which copies the internal vector, and then clears it:

auto add_row() -> std::vector<std::string> {
  auto row(m_bufvec);
  m_bufvec.clear();
  return row;
}

I figured that if I instead changed this line

auto row(m_bufvec);

auto row(std::move(m_bufvec));

It'd result in some sort of speed boost because according to http://en.cppreference.com/w/cpp/container/vector/vector it would take constant time instead of linear. To my surprise, it made the parser significantly slower (according to my really rough benchmark of running time ./main.o over this file).

I'm completely new to optimization, benchmarking, and everything else that comes with tuning C++ code. Perhaps this optimization is useless even if it worked, but regardless, I'm curious as to why std::move causes a slowdown. Am I missing something?

576

asked Jan 26 '17 05:01

m0meni

2 Answers

When you copy bufvec, its capacity is unchanged, but when you move it, its capacity is cleared. Thus, later when you fill bufvec, a logarithmic number of allocations are done to expand its capacity again, and such allocations can easily be your performance bottleneck.

The move version makes that function faster. But it makes other code slower. Micro optimizations do not reliably make programs faster.

Edit by OP:

The solution proposed by Cheers and hth. - Alf in the comments of m_bufvec.reserve(row.size()) after the move fixes the problem, and confirms that the above reasoning was correct. Moreover it is more efficient, (albeit only slightly) because

you avoid copying the items [in bufvec]. If the items are simple integer values, that doesn't matter so much. If the items are e.g. strings, with dynamic allocation, then it really does matter.

137

answered Sep 28 '22 15:09

Yakk - Adam Nevraumont

Indeed the first version is expected to be faster. The reason is:

auto row(m_bufvec);

invokes the copy constuctor, which allocates the necessary memory for row just at once. bufvec also keeps its allocated memory. As a result, allocations per-element are minimized, and this is important because they involve an amount of relocations.

In the second version, auto row(std::move(m_bufvec)); bufvec's memory becomes owned by row, this operation is faster than the copy constructor. But as bufvec has lost its allocated memory, when you later fill it element by element, it will do many re-allocations and (expensive) relocation. The number of re-allocations is usually logarithmic with the final size of the vector.

EDIT

The above explains the "unexpected" results in the main question. Finally, it turns out that the "ideal" for this operation is to move then reserve immediately:

auto row(std::move(m_bufvec);
m_bufvec.reserve(row.size());
return row;

This achieves the three goals:

no element-by-element allocation
no useless initialization for bufvec
no useless copying of elements from m_bufvec into row.

answered Sep 28 '22 16:09

A.S.H

Related questions
                            
                                Using `this->` in a lambda that captures `this`
                            
                                Using setters in constructor
                            
                                Does std::exception own what?
                            
                                In QT, can we have two slots with same name but different arguments?
                            
                                Assigning make_unique<X> to shared_ptr<X>
                            
                                C++ Assigning function pointer to another
                            
                                constexpr performing worse at runtime
                            
                                What Are the Maximum Number of Base-10 Digits in the Fractional Part of a Floating Point Number
                            
                                Preferred way of creating shared pointers
                            
                                Inhowfar do IEEE754 floats satisfy LessThanComparable?
                            
                                Why lambda removes cv and ref?
                            
                                Deduce template argument for size of initializer list
                            
                                What's the equivalent of Python function decorators in C++?
                            
                                Getting unexpected result when compiling with clang optimization
                            
                                Get an iterator from a char pointer (C++)
                            
                                C++ Makefile headers and cpp
                            
                                What does "class" mean before parameter?
                            
                                How do I create JSON array using QT
                            
                                clang++ warning: "warning: unknown warning option '-Wno-maybe-uninitialized'"
                            
                                graphics.h not working in code blocks with MinGW in windows 7 64bit

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With