Is it more efficient to preallocate a vector?

Tags:

stl

In C++ Primer fourth edition, by Stanley B.Lippman, Josee Lajoie and Barbara E. Moo it states:

Because vectors grow efficiently, it is usually best to let the vector grow by adding elements to it dynamically as the element values are known.

and

Readers accustomed to using c or java might expect that because vector elements are stored contiguously, it would be best to preallocate the vector at its expected size. In fact the contrary is the case...

and

Allthough we can preallocate a given number of elements in a vector, it is usually more efficient to define an empty vector and add elements to it.

Assuming this is correct (the authors are as reputable as they come, one is a co-author of C++ itself) then can anyone give me a case that proves this statement, and explain why?

201

asked Aug 09 '12 16:08

2 Answers

It depends.

If you don't know what the final size will be, then let the vector allocate using its allocation scheme (usually doubles each time, or somewhere around there). This way you avoid reallocating for every single element:

Click to copy

std::vector<int> v;  // good: for (/* populate v */) // unknown number of iterations {     v.push_back(i); // possible reallocation, but not often }  // bad: for (/* populate v */) // unknown number of iterations {     v.reserve(v.size() + 1); // definite reallocation, every time     v.push_back(i); // (no reallocation) }

But if you know ahead of time you won't be reallocating, then preallocate:

Click to copy

std::vector<int> v;  // good: v.reserve(10);  for (/* populate v */) // only 10 iterations (for example) {     v.push_back(i); // no reallocations }  // not bad, but not the best: for (/* populate v */) // only 10 iterations (for example) {     v.push_back(i); // possible reallocation, but not often (but more than needed!) }

110

answered Sep 20 '22 20:09

GManNickG

I timed this simple example:

Click to copy

#include<iostream> #include<vector>  int main() {      int limit = 100 * 1000 * 1000;     std::vector<long> my_vec;     my_vec.reserve(limit); // comment out this line to not preallocate      for (int i=0; i < limit; i++) {         my_vec.push_back(i);     }      long my_sum = 0;     for (int i=0; i < limit; i++) {         my_sum += my_vec[i];     }      std::cout << my_sum << std::endl;     return 0; }

Complied with:

Click to copy

g++ -std=c++11 -O2 my_file.cpp -o my_exec

And found the difference to be substantial:

Without preallocation:

Click to copy

real    0m3.366s user    0m1.656s sys     0m1.660s

With preallocation:

Click to copy

real    0m1.688s user    0m0.732s sys     0m0.936s

My conclusion here is: If building a vector is a big part of the program, then preallocating for efficiency makes sense. However, building a larger vector over and over is unlikely, and thus it is rarely a bottle neck. However, using reserve() has other advantages besides preallocating.

Bjarne Stroustrup in The C++ programming language (4th addition) has this to say:

I used to be careful about using reserve() when I was reading into a vector. I was surprised to find that for essentially all my uses, calling reserve() did not measurably affect performance. The default growth strategy worked just as well as my estimates, so I stopped trying to improve performance using reserve(). Instead I use it to increase predictability of reallocation delays and to prevent invalidation of pointers and iterators.

answered Sep 18 '22 20:09

Akavall

Related questions
                            
                                How is dynamic_cast implemented
                            
                                Robust Random Number Generation [closed]
                            
                                Programming slim C++ programs (like uTorrent) for Windows [closed]
                            
                                Do I really need to implement user-provided constructor for const objects?
                            
                                Why does same_as concept check type equality twice?
                            
                                When are static and global variables initialized?
                            
                                How to understand two pairs of parentheses in this code fragment?
                            
                                Sharing precompiled headers between projects in Visual Studio
                            
                                Learning to read GCC assembler output
                            
                                Why/when is __declspec( dllimport ) not needed?
                            
                                Eclipse multiple tab rows [duplicate]
                            
                                Overloading operator<<: cannot bind lvalue to ‘std::basic_ostream<char>&&’
                            
                                How to use lambda function as hash function in unordered_map?
                            
                                Constexpr if with a non-bool condition
                            
                                How to name a thread in Linux? [duplicate]
                            
                                What does C++ struct syntax "a : b" mean
                            
                                What library does ld option -lrt refer to (Bionic libc)?
                            
                                Is std::cout guaranteed to be initialized?
                            
                                Near and Far pointers
                            
                                Difference between unsigned and unsigned int in C++

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Is it more efficient to preallocate a vector?

Tags:

c++

stl

dangerousdave

People also ask

2 Answers

GManNickG

Akavall

Recent Activity

Donate For Us