Performance issue for vector::size() in a loop in C++

People also ask

Is std::vector fast?

A std::vector can never be faster than an array, as it has (a pointer to the first element of) an array as one of its data members. But the difference in run-time speed is slim and absent in any non-trivial program. One reason for this myth to persist, are examples that compare raw arrays with mis-used std::vectors.

What is the procedure we call to increase the size of the vector?

C++ Vector Library - resize() Function The C++ function std::vector::resize() changes the size of vector. If n is smaller than current size then extra elements are destroyed. If n is greater than current container size then new elements are inserted at the end of vector.

Does vector double its size?

The only time it doubles is when there's one element in the vector and you add another one. otherwise the size increases by one when you push_back one element to the vector.

How do I run a vector loop?

Use a for loop and reference pointer In C++ , vectors can be indexed with []operator , similar to arrays. To iterate through the vector, run a for loop from i = 0 to i = vec. size() . Where i is the index.

In theory, it is called each time, since a for loop:

for(initialization; condition; increment)
    body;

is expanded to something like

{
    initialization;
    while(condition)
    {
        body;
        increment;
    }
}

(notice the curly braces, because initialization is already in an inner scope)

In practice, if the compiler understands that a piece of your condition is invariant through all the duration of the loop and it does not have side-effects, it can be smart enough to move it out. This is routinely done with strlen and things like that (that the compiler knows well) in loops where its argument isn't written.

However it must be noted that this last condition isn't always trivial to prove; in general, it's easy if the container is local to the function and is never passed to external functions; if the container is not local (e.g. it's passed by reference - even if it's const) and the loop body contains calls to other functions, the compiler often has to assume that such functions may alter it, thus blocking the hoisting of the length calculation.

Doing that optimization by hand is worthy if you know that a part of your condition is "expensive" to evaluate (and such condition usually isn't, since it usually boils down to a pointer subtraction, which is almost surely inlined).

Edit: as others said, in general with containers it's better to use iterators, but for vectors it's not so important, because random access to elements via operator[] is guaranteed to be O(1); actually with vectors it usually is a pointer sum (vector base+index) and dereference vs the pointer increment (preceding element+1) and dereference of iterators. Since the target address is still the same, I don't think that you can gain something from iterators in terms of cache locality (and even if so, if you're not walking big arrays in tight loops you shouldn't even notice such kind of improvements).

For lists and other containers, instead, using iterators instead of random access can be really important, since using random access may mean walk every time the list, while incrementing an iterator is just a pointer dereference.

It's 'called' each time, but I put called into quotes because it really probably is just an inline method call, so you don't have to worry about its performance.

Why not use vector<int>::iterator instead?

The size() member function is called each time, but it would be a really bad implementation that wouldn't inline it, and a strange one where it wouldn't be a simple access of a fixed datum or a subtraction of two pointers.
Anyway, you shouldn't worry yourself with such trivialities until you have profiled your application and found out that this is a bottleneck.

However, what you should pay attention to is:

The correct type for a vector's index is std::vector<T>::size_type.
There are types (some iterators, for example) where i++ might be slower than ++i.

Therefore, the loop should be:

for(vector<int>::size_type i=0; i<var.size(); ++i)
  ...

It must be called everytime because size() might return a different value everytime.

Therefore there's no big choice it simply must be.

Related questions
                            
                                Why is this C++11 code containing rand() slower with multiple threads than with one?
                            
                                How to profile memory usage?
                            
                                Access friend function defined in class
                            
                                Why does enable_if_t in template arguments complains about redefinitions?
                            
                                forcing use of cbegin()/cend() in range-based for
                            
                                What happens to global variables declared in a DLL?
                            
                                The use of enum without a variable name
                            
                                dependent scope; need typename in front;
                            
                                Why shared_from_this can't be used in constructor from technical standpoint?
                            
                                CreateProcess from memory buffer
                            
                                What is dynamic initialization of object in c++?
                            
                                Return a const reference or a copy in a getter function?
                            
                                Linking Fortran and C++ binaries using gcc
                            
                                Efficient 128-bit addition using carry flag
                            
                                connected components in OpenCV
                            
                                Why installing vcredist_x86.exe doesn't fix SideBySide error when I develop an EXE on one machine and run it on another one?
                            
                                Can you avoid locking by guaranteeing that multiple threads won't access the same memory?
                            
                                Returning an empty string : efficient way in c++
                            
                                Why are multiple pre-increments allowed in C++ but not in C? [duplicate]
                            
                                Why is __int128_t faster than long long on x86-64 GCC?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Performance issue for vector::size() in a loop in C++

Tags:

c++

performance

for-loop

vector

stdvector

People also ask

Recent Activity

Donate For Us