I just found out that <code>std::vector<T>::resize</code> "doubles" its capacity even when resizing to one element above the current size: <pre class="prettyprint"><code>std::vector<int> v(50); v.resize(51); std::cout << v.capacity() << std::endl; </code></pre> This program outputs 100 with GCC and Clang, and 75 with Visual C++. However, when I switch from <code>resize</code> to <code>reserve</code>: <pre class="prettyprint"><code>std::vector<int> v(50); v.reserve(51); std::cout << v.capacity() << std::endl; </code></pre> The output is 51 with all three compilers. I wonder why implementations use a different expansion strategy for <code>resize</code> and <code>reserve</code>. It seems inconsistent, and I would expect the same behavior here. <hr> I am just adding a link to a motivation for my question, where the impact on performance is reported: Why are C++ STL vectors 1000x slower when doing many reserves? <hr> Adding a quote from C++11 Standard to clarify requirements for <code>reserve</code>; §23.3.6.3(2): <blockquote> After <code>reserve()</code>, <code>capacity()</code> is greater or equal to the argument of <code>reserve</code> if reallocation happens... </blockquote> <hr> Some additional thoughts: From C++11 Standard: <blockquote> Complexity: The complexity is linear in the number of elements inserted plus the distance to the end of the vector. </blockquote> Which, effectively, implies constant (amortized) complexity for inserting a single element at the end. However, this applies only for vector modifiers, such as <code>push_back</code> or <code>insert</code> (§23.3.6.5). <code>resize</code> is not listed among modifiers. It's listed in §23.3.6.3 <code>vector</code> capacity section. And, there are no complexity requirements for <code>resize</code>. However, in the <code>vector</code> overview section (§23.3.6.1), there is written: <blockquote> it (<code>vector</code>) supports (amortized) constant time insert and erase operations at the end </blockquote> The question is whether <code>resize(size()+1)</code> is considered to be "insertion at the end".

As far as I can tell, neither <code>resize</code> nor <code>reserve</code> is required to have the demonstrated behaviour. Both are however allowed such behaviour although both could either allocate the exact amount, and both could multiply the previous allocation as far as the standard is concerned. Each allocation strategies have their advantages. The advantage of allocating exact amount is that it has no memory overhead when the maximum allocation is known beforehand. The advantage of multiplying is that it maintains the constant amortized property when mixed with end-insertion operations. The approach chosen by the tested implementations has the advantage that it allows both strategies when resizing. To use one strategy, one can reserve and then resize. To use the other, just resize. Of course, one has to be aware of the unspecified behaviour to take advantage of this. This advantage may or might not be the reasoning behind the choice of these implementations. One might consider it a failure of the vector API, as specified in the standard, that expressing the intended reallocation behaviour is not possible (in a way that is guaranteed by the standard).

When you <code>resize</code> more than there is capacity you already "demonstrate" that you don't want to reserve just the right capacity. On the other hand, if you use <code>reserve</code> you explicitly ask for the right capacity. If <code>reserve</code> would use the same strategy as <code>resize</code> there would be no way to reserve just the right amount. In this sense <code>resize</code> without <code>reserve</code> is for the lazy ones or in case you don't know the exact amount to reserve. You call <code>reserve</code> if you know what capacity you need. That's two different scenarios. PS: As StoryTeller pointed out, also <code>reserve</code> is not required to reserve the exact amount that is asked for as per the standard. Nevertheless I think my main argument still holds: <code>resize</code> (without <code>reserve</code>) and <code>reserve</code> are meant for different scenarios, where you either give a hint of how much you want to reserve or don't care about the actual capacity and just want to have the container sized to what you ask for.

Why does std::vector reserve not "double" its capacity, while resize does?

Tags:

c++

vector

resize

capacity

I just found out that std::vector<T>::resize "doubles" its capacity even when resizing to one element above the current size:

std::vector<int> v(50); v.resize(51); std::cout << v.capacity() << std::endl;

This program outputs 100 with GCC and Clang, and 75 with Visual C++. However, when I switch from resize to reserve:

std::vector<int> v(50); v.reserve(51); std::cout << v.capacity() << std::endl;

The output is 51 with all three compilers.

I wonder why implementations use a different expansion strategy for resize and reserve. It seems inconsistent, and I would expect the same behavior here.

I am just adding a link to a motivation for my question, where the impact on performance is reported: Why are C++ STL vectors 1000x slower when doing many reserves?

Adding a quote from C++11 Standard to clarify requirements for reserve; §23.3.6.3(2):

After reserve(), capacity() is greater or equal to the argument of reserve if reallocation happens...

Some additional thoughts: From C++11 Standard:

Complexity: The complexity is linear in the number of elements inserted plus the distance to the end of the vector.

Which, effectively, implies constant (amortized) complexity for inserting a single element at the end. However, this applies only for vector modifiers, such as push_back or insert (§23.3.6.5).

resize is not listed among modifiers. It's listed in §23.3.6.3 vector capacity section. And, there are no complexity requirements for resize.

However, in the vector overview section (§23.3.6.1), there is written:

it (vector) supports (amortized) constant time insert and erase operations at the end

The question is whether resize(size()+1) is considered to be "insertion at the end".

260

asked Jan 31 '18 08:01

Daniel Langr

2 Answers

As far as I can tell, neither resize nor reserve is required to have the demonstrated behaviour. Both are however allowed such behaviour although both could either allocate the exact amount, and both could multiply the previous allocation as far as the standard is concerned.

Each allocation strategies have their advantages. The advantage of allocating exact amount is that it has no memory overhead when the maximum allocation is known beforehand. The advantage of multiplying is that it maintains the constant amortized property when mixed with end-insertion operations.

The approach chosen by the tested implementations has the advantage that it allows both strategies when resizing. To use one strategy, one can reserve and then resize. To use the other, just resize. Of course, one has to be aware of the unspecified behaviour to take advantage of this. This advantage may or might not be the reasoning behind the choice of these implementations.

One might consider it a failure of the vector API, as specified in the standard, that expressing the intended reallocation behaviour is not possible (in a way that is guaranteed by the standard).

100

answered Sep 21 '22 15:09

eerorika

When you resize more than there is capacity you already "demonstrate" that you don't want to reserve just the right capacity. On the other hand, if you use reserve you explicitly ask for the right capacity. If reserve would use the same strategy as resize there would be no way to reserve just the right amount.

In this sense resize without reserve is for the lazy ones or in case you don't know the exact amount to reserve. You call reserve if you know what capacity you need. That's two different scenarios.

PS: As StoryTeller pointed out, also reserve is not required to reserve the exact amount that is asked for as per the standard. Nevertheless I think my main argument still holds: resize (without reserve) and reserve are meant for different scenarios, where you either give a hint of how much you want to reserve or don't care about the actual capacity and just want to have the container sized to what you ask for.

answered Sep 18 '22 15:09

463035818_is_not_a_number

Related questions
                            
                                best way to do variant visitation with lambdas
                            
                                Qt foreach loop ordering vs. for loop for QList
                            
                                why is std::lock_guard not movable?
                            
                                Qt - add a hyperlink to a dialog
                            
                                Why define operator + or += outside a class, and how to do it properly?
                            
                                Simple object detection using OpenCV and machine learning
                            
                                Creating new types in C++
                            
                                How do I invoke the MinGW cross-compiler on Linux?
                            
                                Using std::tie as a range for loop target
                            
                                What are _mm_prefetch() locality hints?
                            
                                How can you detect if two regular expressions overlap in the strings they can match?
                            
                                How can i use tesseract ocr(or any other free ocr) in small c++ project?
                            
                                Should I use the same name for a member variable and a function parameter in C++?
                            
                                Boost::asio - how to interrupt a blocked tcp server thread?
                            
                                Are there any disadvantages to "multi-processor compilation" in Visual Studio?
                            
                                Newton Raphson with SSE2 - can someone explain me these 3 lines
                            
                                Correct use of std::cout.precision() - not printing trailing zeros
                            
                                OpenMP vs C++11 threads
                            
                                Writing a video file using H.264 compression in OpenCV
                            
                                Why does `<< std::endl` not call the operator I want it to call?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With