Just wondering if someone would confirm a few aliasing rules for me. I know that aliasing (i.e load-store issues) could cause the following type of code to be suboptimal, because we can't assume that <code>x, y, z</code> don't overlap: <pre class="prettyprint"><code>// case 1: void plus(size_t n, double *x, double *y, double *z) { for (size_t i = 0; i != n; ++i) z[i] = x[i] + y[i]; } </code></pre> I know that there's a C keyword <code>__restrict</code> that hints to the compiler that it shouldn't consider the overlapping case, and hence potentially generate better code: <pre class="prettyprint"><code>// case 2: void plus(size_t n, double *__restrict x, double *__restrict y, double *__restrict z) { // as above... } </code></pre> But how does aliasing work with C++ style code, where we would be dealing with container objects passed by reference, rather than the C-like examples above with raw pointers?? For instance, I'm assuming that there would be aliasing issues if we did the following: <pre class="prettyprint"><code>// case 3: void plus(std::vector<double> &x, std::vector<double> &y, std::vector<double> &z) { // similar to above... } </code></pre> And to move to a less trivial example, does it make any difference if the underlying data types in the containers are different?? At the implementation level most containers dynamically manage storage with pointers, so it's not clear to me how the compiler could ensure that the following doesn't alias: <pre class="prettyprint"><code>// case 4: void foo(std::vector<mytype1> &x, std::vector<mytype2> &y) { // interwoven operations on x, y... } </code></pre> I'm not trying to micro-optimise, but I'm wondering if it's currently better to pass restricted pointers to containers around, rather than references. EDIT: To clear up some terminology, as pointed out: <code>restrict</code> is the C99 keyword. There's also <code>__restrict</code> and <code>__restrict__</code> in various compilers, but they all do the same thing.

According to the strict-aliasing rule, you are not allowed to alias the same memory with pointers to different types (except <code>char*</code> and friends), so case 4 could only apply if one of the types was a <code>char*</code>. Case 3 though isn't all that different from case 1, as references are implemented as pointers on all compilers I know, though the standard doesn't demand that and an implementation is free to come up with something else.

It's not specific at all to C++. Consider this C99 bit: <pre class="prettyprint"><code>struct vector { double* data; size_t n; }; void plus(struct vector* restrict x, struct vector* restrict y, struct vector* restrict z) { // same deal as ever } </code></pre> Here, <code>restrict</code> buys us very little: <code>x->data</code>, <code>y->data</code> and <code>z->data</code> are all <code>double*</code> and are allowed to alias. This is exactly like case 1, even when using restrict. If there were a <code>restrict</code> keyword in C++ (or when using an extension), the best bet would probably to do <code>plus(vecA.size(), &vecA[0], &vecB[0], &vecB[0])</code>, using the same <code>plus</code> as in case 2. And in fact it's possible to do this right now, using a C89-style interface without <code>restrict</code> but that uses the keyword under the covers.

C++ aliasing rules

Tags:

Just wondering if someone would confirm a few aliasing rules for me.

I know that aliasing (i.e load-store issues) could cause the following type of code to be suboptimal, because we can't assume that x, y, z don't overlap:

// case 1: void plus(size_t n, double *x, double *y, double *z) {     for (size_t i = 0; i != n; ++i)         z[i] = x[i] + y[i]; }

I know that there's a C keyword __restrict that hints to the compiler that it shouldn't consider the overlapping case, and hence potentially generate better code:

// case 2: void plus(size_t n, double *__restrict x, double *__restrict y, double *__restrict z) { // as above... }

But how does aliasing work with C++ style code, where we would be dealing with container objects passed by reference, rather than the C-like examples above with raw pointers??

For instance, I'm assuming that there would be aliasing issues if we did the following:

// case 3: void plus(std::vector<double> &x, std::vector<double> &y, std::vector<double> &z) { // similar to above... }

And to move to a less trivial example, does it make any difference if the underlying data types in the containers are different?? At the implementation level most containers dynamically manage storage with pointers, so it's not clear to me how the compiler could ensure that the following doesn't alias:

// case 4: void foo(std::vector<mytype1> &x, std::vector<mytype2> &y) { // interwoven operations on x, y... }

I'm not trying to micro-optimise, but I'm wondering if it's currently better to pass restricted pointers to containers around, rather than references.

EDIT: To clear up some terminology, as pointed out: restrict is the C99 keyword. There's also __restrict and __restrict__ in various compilers, but they all do the same thing.

800

asked Jun 12 '11 07:06

Darren Engwirda

2 Answers

According to the strict-aliasing rule, you are not allowed to alias the same memory with pointers to different types (except char* and friends), so case 4 could only apply if one of the types was a char*.

Case 3 though isn't all that different from case 1, as references are implemented as pointers on all compilers I know, though the standard doesn't demand that and an implementation is free to come up with something else.

117

answered Oct 04 '22 02:10

Xeo

It's not specific at all to C++. Consider this C99 bit:

struct vector {     double* data;     size_t n; };  void plus(struct vector* restrict x, struct vector* restrict y, struct vector* restrict z) {     // same deal as ever }

Here, restrict buys us very little: x->data, y->data and z->data are all double* and are allowed to alias. This is exactly like case 1, even when using restrict.

If there were a restrict keyword in C++ (or when using an extension), the best bet would probably to do plus(vecA.size(), &vecA[0], &vecB[0], &vecB[0]), using the same plus as in case 2. And in fact it's possible to do this right now, using a C89-style interface without restrict but that uses the keyword under the covers.

answered Oct 04 '22 00:10

Luc Danton

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

C++ aliasing rules

Tags:

Darren Engwirda

People also ask

2 Answers

Xeo

Luc Danton

Recent Activity

Donate For Us

C++ aliasing rules

Tags:

Darren Engwirda

People also ask

2 Answers

Xeo

Luc Danton

Related questions

Recent Activity

Donate For Us