I wish to store a large vector of d-dimensional points (d fixed and small: <10). If I define a <code>Point</code> as <code>vector<int></code>, I think a <code>vector<Point></code> would store in each position a pointer to a Point. But if define a <code>Point</code> as a fixed-size object like: <code>std::tuple<int,int,...,int></code> or <code>std::array<int, d></code>, will the program store all points in contiguous memory or will the additional level of indirection remain? In case the answer is that arrays avoid the additional indirection, could this have a large impact on performance (cache exploit locality) while scanning the <code>vector<Point></code>?

If you define your <code>Point</code> as having contiguous data storage (e.g. <code>struct Point { int a; int b; int c; }</code> or using <code>std::array</code>), then <code>std::vector<Point></code> will store the <code>Point</code>s in contiguous memory locations, so your memory layout will be: <pre class="prettyprint"><code>p0.a, p0.b, p0.c, p1.a, p1.b, p1.c, ..., p(N-1).a, p(N-1).b, p(N-1).c </code></pre> On the other hand, if you define <code>Point</code> as a <code>vector<int></code>, then a <code>vector<Point></code> has the layout of <code>vector<vector<int>></code>, which is not contiguous, as <code>vector</code> stores pointers to dynamically allocated memory. So you have contiguity for single <code>Point</code>s, but not for the whole structure. The first solution is much more efficient than the second (as modern CPUs love accessing contiguous memory locations).

<code>vector</code> will store whatever your type contains in contiguous memory. So yes, if that's an <code>array</code> or a <code>tuple</code>, or probably even better, a custom type, it will avoid indirection. Performance-wise, as always, you have to measure it. Don't speculate. At least as far as scanning is concerned. However, there will definitely be a huge performance gain when you create those points in the first place, because you'll avoid unnecessary memory allocations for every <code>vector</code> that stores a point. And memory allocations are usually very expensive in C++.

Vector storage in C++

Tags:

c++

memory

stdvector

I wish to store a large vector of d-dimensional points (d fixed and small: <10).

If I define a Point as vector<int>, I think a vector<Point> would store in each position a pointer to a Point.

But if define a Point as a fixed-size object like: std::tuple<int,int,...,int> or std::array<int, d>, will the program store all points in contiguous memory or will the additional level of indirection remain?

In case the answer is that arrays avoid the additional indirection, could this have a large impact on performance (cache exploit locality) while scanning the vector<Point>?

488

asked Oct 28 '16 10:10

Joseph Stack

2 Answers

If you define your Point as having contiguous data storage (e.g. struct Point { int a; int b; int c; } or using std::array), then std::vector<Point> will store the Points in contiguous memory locations, so your memory layout will be:

p0.a, p0.b, p0.c, p1.a, p1.b, p1.c, ..., p(N-1).a, p(N-1).b, p(N-1).c

On the other hand, if you define Point as a vector<int>, then a vector<Point> has the layout of vector<vector<int>>, which is not contiguous, as vector stores pointers to dynamically allocated memory. So you have contiguity for single Points, but not for the whole structure.

The first solution is much more efficient than the second (as modern CPUs love accessing contiguous memory locations).

188

answered Sep 30 '22 21:09

Mr.C64

vector will store whatever your type contains in contiguous memory. So yes, if that's an array or a tuple, or probably even better, a custom type, it will avoid indirection.

Performance-wise, as always, you have to measure it. Don't speculate. At least as far as scanning is concerned.

However, there will definitely be a huge performance gain when you create those points in the first place, because you'll avoid unnecessary memory allocations for every vector that stores a point. And memory allocations are usually very expensive in C++.

answered Sep 30 '22 22:09

Sergei Tachenov

Related questions
                            
                                When should I use std::bind?
                            
                                Reducing on array in OpenMP
                            
                                How can I see the assembly code that is generated by a gcc (any flavor) compiler for a C/C++ program?
                            
                                get length of `wchar_t*` in c++
                            
                                Data Compression Algorithms
                            
                                What's the result of a & b?
                            
                                Why is there no 2-byte float and does an implementation already exist?
                            
                                How do I add elements to an empty vector in a loop?
                            
                                declare template friend function of template class
                            
                                What to put in precompiled header? (MSVC)
                            
                                Difference between std::remove and erase for vector?
                            
                                Nicer syntax for setting default argument value to default constructor
                            
                                Why does std::map operator[] create an object if the key doesn't exist?
                            
                                Convert std::chrono::time_point to unix timestamp
                            
                                What is the purpose of unary plus operator on char array?
                            
                                What does const mean following a function/method signature? [duplicate]
                            
                                why am I getting "non-aggregate cannot be initialized with initializer list"
                            
                                A good hash function for a vector
                            
                                inline template function?
                            
                                Simple IPC between C++ and Python (cross platform)

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With