I'm currently making an application using vectors with C++. I know how pre-optimization is the root of all evil. But I really can't help being curious. I'm adding parts of other vectors into another vector. We'll say the vector will have a size that never changes of 300. Since I always append to the end of the vector Is it faster to do: <code>a.reserve(300);</code> <code>a.insert(a.end(), b.begin(), b.end());</code> or would it be faster to loop through the vector I want to append and add each items individually(while still reserving beforehand) with <code>push_back</code> or <code>emplace</code>. (unsure which is faster) Anyone can help me on this?

I guess it really depends on the compiler (library implementation), compiling options and architecture. Doing a quick benchmark in VS2005 without optimization (/Od) on Intel Xeon: <pre class="prettyprint"><code>std::vector<int> a; std::vector<int> b; // fill 'a' with random values for giggles timer.start() // copy values from 'a' to 'b' timer.stop() </code></pre> I get these results for 10 000 000 items using these different methods of "copy values...": <ol> <li>Reserve space for 'b', then for-loop using <code>b.push_back(a[i]);</code>: 0.808 sec</li> <li>Resize 'b', then for-loop using indices assignment <code>b[i] = a[i];</code>: 0.264 sec</li> <li>No re-sizing 'b', just <code>b.insert(b.end(), a.begin(), a.end());</code>: 0.021 sec (no significant difference with reserve first)</li> <li> <code>std::copy(a.begin(), a.end(), std::back_inserter(b));</code>: 0.944 sec (0.871 with reserve first)</li> <li>Resize 'b', then memcopy on the base pointers <code>memcpy(&(b[0]), &(a[0]), 10000000*sizeof(int));</code>: 0.061 sec</li> </ol> With optimizations turned on (/Ox) however, it's a different story. I had to increase the size to 100 000 000 to get more differentiation: <ol> <li>push_back loop: 0.659 sec</li> <li>index loop: 0.482 sec</li> <li>insert: 0.210 sec (no significant difference with reserve first)</li> <li>std::copy: 0.422 sec with reserve first. Got a bad_alloc without it.</li> <li>memcpy: 0.329 sec</li> </ol> What's interesting to note is that with or without optimizations, the insert method scaled linearly. Other methods were clearly inefficient without optimizations but still couldn't get quite as fast with them. As James Kanze noted, it's different on g++. Run a test with your own platform to validate.

C++ push_back vs Insert vs emplace

2 Answers

Here's a general principle: when a library provides both do_x_once and do_x_in_batch, then the latter should be at least as fast as calling do_x_once in a simple loop. If it isn't, then the library is very badly implemented since a simple loop is enough to get a faster version. Often, such batch functions/methods can perform additional optimizations because they have knowledge of data structure internals.

So, insert should be at least as fast as push_back in a loop. In this particular case, a smart implementation of insert can do a single reserve for all the elements you want to insert. push_back would have to check the vector's capacity every time. Don't try to outsmart the library :)

answered Sep 30 '22 01:09

Fred Foo

I guess it really depends on the compiler (library implementation), compiling options and architecture. Doing a quick benchmark in VS2005 without optimization (/Od) on Intel Xeon:

std::vector<int> a;
std::vector<int> b;

// fill 'a' with random values for giggles

timer.start()
// copy values from 'a' to 'b'
timer.stop()

I get these results for 10 000 000 items using these different methods of "copy values...":

Reserve space for 'b', then for-loop using b.push_back(a[i]);: 0.808 sec
Resize 'b', then for-loop using indices assignment b[i] = a[i];: 0.264 sec
No re-sizing 'b', just b.insert(b.end(), a.begin(), a.end());: 0.021 sec (no significant difference with reserve first)
std::copy(a.begin(), a.end(), std::back_inserter(b));: 0.944 sec (0.871 with reserve first)
Resize 'b', then memcopy on the base pointers memcpy(&(b[0]), &(a[0]), 10000000*sizeof(int));: 0.061 sec

With optimizations turned on (/Ox) however, it's a different story. I had to increase the size to 100 000 000 to get more differentiation:

push_back loop: 0.659 sec
index loop: 0.482 sec
insert: 0.210 sec (no significant difference with reserve first)
std::copy: 0.422 sec with reserve first. Got a bad_alloc without it.
memcpy: 0.329 sec

What's interesting to note is that with or without optimizations, the insert method scaled linearly. Other methods were clearly inefficient without optimizations but still couldn't get quite as fast with them. As James Kanze noted, it's different on g++. Run a test with your own platform to validate.

answered Sep 30 '22 01:09

OlivierD

Related questions
                            
                                jsoncpp how to check if tag is null .isNull() throw assertion
                            
                                Multiple source file executable slower than single source file executable
                            
                                How fast is dynamic_cast<>
                            
                                allocate more than 1 GB memory on 32 bit XP
                            
                                Can each Iteration of a for loop/for_each be done in parallel? (C++11)
                            
                                Calculating inverse of a very large matrix
                            
                                Is static init thread-safe with VC2010?
                            
                                Load a DLL from another directory at program start
                            
                                Is this why the move constructor in C++ 11 makes sense?
                            
                                Perform different methods based on template variable type
                            
                                Documenting google tests
                            
                                do integer reads need to be critical section protected?
                            
                                error C1004: unexpected end-of-file found in Visual Studio 2012
                            
                                How to write a Live555 FramedSource to allow me to stream H.264 live
                            
                                How to read a json file into a C++ string
                            
                                Another case where whitespace matters (maybe?)
                            
                                Making a program portable between machines that have different number of bits in a "machine byte"
                            
                                Try to execute command line codes from c++ linux
                            
                                '_T' was not declared in this scope?
                            
                                Creating a future from intermediate futures?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

C++ push_back vs Insert vs emplace

Tags:

c++

insert

vector

push-back

Darkalfx

People also ask

2 Answers

Fred Foo

OlivierD

Recent Activity

Donate For Us