Taking into consideration the entire C++11 standard, is it possible for any conforming implementation to succeed the first assertion below but fail the latter? <pre class="prettyprint"><code>#include <cassert> int main(int, char**) { const int I = 5, J = 4, K = 3; const int N = I * J * K; int arr1d[N] = {0}; int (&arr3d)[I][J][K] = reinterpret_cast<int (&)[I][J][K]>(arr1d); assert(static_cast<void*>(arr1d) == static_cast<void*>(arr3d)); // is this necessary? arr3d[3][2][1] = 1; assert(arr1d[3 * (J * K) + 2 * K + 1] == 1); // UB? } </code></pre> If not, is this technically UB or not, and does that answer change if the first assertion is removed (is <code>reinterpret_cast</code> guaranteed to preserve addresses here?)? Also, what if the reshaping is done in the opposite direction (3d to 1d) or from a 6x35 array to a 10x21 array? EDIT: If the answer is that this is UB because of the <code>reinterpret_cast</code>, is there some other strictly compliant way of reshaping (e.g., via <code>static_cast</code> to/from an intermediate <code>void *</code>)?

Update 2021-03-20: This same question was asked on Reddit recently and it was pointed out that my original answer is flawed because it does not take into account this aliasing rule: <blockquote> If a program attempts to access the stored value of an object through a glvalue whose type is not similar to one of the following types the behavior is undefined: <ul> <li>the dynamic type of the object,</li> <li>a type that is the signed or unsigned type corresponding to the dynamic type of the object, or</li> <li>a char, unsigned char, or std::byte type.</li> </ul> </blockquote> Under the rules for similarity, these two array types are not similar for any of the above cases and therefore it is technically undefined behaviour to access the 1D array through the 3D array. (This is definitely one of those situations where, in practice, it will almost certainly work with most compilers/targets) Note that the references in the original answer refer to an older C++11 draft standard <h3>Original answer:</h3> <h3> <code>reinterpret_cast</code> of references</h3> The standard states that an lvalue of type <code>T1</code> can be <code>reinterpret_cast</code> to a reference to <code>T2</code> if a pointer to <code>T1</code> can be <code>reinterpret_cast</code> to a pointer to <code>T2</code> (§5.2.10/11): <blockquote> An lvalue expression of type <code>T1</code> can be cast to the type “reference to <code>T2</code>” if an expression of type “pointer to <code>T1</code>” can be explicitly converted to the type “pointer to <code>T2</code>” using a reinterpret_cast. </blockquote> So we need to determine if a <code>int(*)[N]</code> can be converted to an <code>int(*)[I][J][K]</code>. <h3> <code>reinterpret_cast</code> of pointers</h3> A pointer to <code>T1</code> can be <code>reinterpret_cast</code> to a pointer to <code>T2</code> if both <code>T1</code> and <code>T2</code> are standard-layout types and <code>T2</code> has no stricter alignment requirements than <code>T1</code> (§5.2.10/7): <blockquote> When a prvalue v of type “pointer to T1” is converted to the type “pointer to cv T2”, the result is <code>static_cast<cv T2*>(static_cast<cv void*>(v))</code> if both <code>T1</code> and <code>T2</code> are standard-layout types (3.9) and the alignment requirements of <code>T2</code> are no stricter than those of <code>T1</code>, or if either type is void. </blockquote> <ol> <li> Are <code>int[N]</code> and <code>int[I][J][K]</code> standard-layout types? <code>int</code> is a scalar type and arrays of scalar types are considered to be standard-layout types (§3.9/9). <blockquote> Scalar types, standard-layout class types (Clause 9), arrays of such types and cv-qualified versions of these types (3.9.3) are collectively called standard-layout types. </blockquote> </li> <li> Does <code>int[I][J][K]</code> have no stricter alignment requirements than <code>int[N]</code>. The result of the <code>alignof</code> operator gives the alignment requirement of a complete object type (§3.11/2). <blockquote> The result of the <code>alignof</code> operator reflects the alignment requirement of the type in the complete-object case. </blockquote> Since the two arrays here are not subobjects of any other object, they are complete objects. Applying <code>alignof</code> to an array gives the alignment requirement of the element type (§5.3.6/3): <blockquote> When <code>alignof</code> is applied to an array type, the result shall be the alignment of the element type. </blockquote> So both array types have the same alignment requirement. </li> </ol> That makes the <code>reinterpret_cast</code> valid and equivalent to: <pre class="prettyprint"><code>int (&arr3d)[I][J][K] = *reinterpret_cast<int (*)[I][J][K]>(&arr1d); </code></pre> where <code>*</code> and <code>&</code> are the built-in operators, which is then equivalent to: <pre class="prettyprint"><code>int (&arr3d)[I][J][K] = *static_cast<int (*)[I][J][K]>(static_cast<void*>(&arr1d)); </code></pre> <h3> <code>static_cast</code> through <code>void*</code> </h3> The <code>static_cast</code> to <code>void*</code> is allowed by the standard conversions (§4.10/2): <blockquote> A prvalue of type “pointer to cv <code>T</code>,” where <code>T</code> is an object type, can be converted to a prvalue of type “pointer to cv void”. The result of converting a “pointer to cv <code>T</code>” to a “pointer to cv void” points to the start of the storage location where the object of type <code>T</code> resides, as if the object is a most derived object (1.8) of type <code>T</code> (that is, not a base class subobject). </blockquote> The <code>static_cast</code> to <code>int(*)[I][J][K]</code> is then allowed (§5.2.9/13): <blockquote> A prvalue of type “pointer to cv1 <code>void</code>” can be converted to a prvalue of type “pointer to cv2 <code>T</code>,” where <code>T</code> is an object type and cv2 is the same cv-qualification as, or greater cv-qualification than, cv1. </blockquote> So the cast is fine! But are we okay to access objects through the new array reference? <h3>Accessing array elements</h3> Performing array subscripting on an array like <code>arr3d[E2]</code> is equivalent to <code>*((E1)+(E2))</code> (§5.2.1/1). Let's consider the following array subscripting: <pre class="prettyprint"><code>arr3d[3][2][1] </code></pre> Firstly, <code>arr3d[3]</code> is equivalent to <code>*((arr3d)+(3))</code>. The lvalue <code>arr3d</code> undergoes array-to-pointer conversion to give a <code>int(*)[2][1]</code>. There is no requirement that the underlying array must be of the correct type to do this conversion. The pointers value is then accessed (which is fine by §3.10) and then the value 3 is added to it. This pointer arithmetic is also fine (§5.7/5): <blockquote> If both the pointer operand and the result point to elements of the same array object, or one past the last element of the array object, the evaluation shall not produce an overflow; otherwise, the behavior is undefined. </blockquote> This this pointer is dereferenced to give an <code>int[2][1]</code>. This undergoes the same process for the next two subscripts, resulting in the final <code>int</code> lvalue at the appropriate array index. It is an lvalue due to the result of <code>*</code> (§5.3.1/1): <blockquote> The unary * operator performs indirection: the expression to which it is applied shall be a pointer to an object type, or a pointer to a function type and the result is an lvalue referring to the object or function to which the expression points. </blockquote> It is then perfectly fine to access the actual <code>int</code> object through this lvalue because the lvalue is of type <code>int</code> too (§3.10/10): <blockquote> If a program attempts to access the stored value of an object through a glvalue of other than one of the following types the behavior is undefined: <ul> <li>the dynamic type of the object</li> <li>[...]</li> </ul> </blockquote> So unless I've missed something. I'd say this program is well-defined.

Reshaping a 1-d array to a multidimensional array

Tags:

c++

multidimensional-array

c++11

language-lawyer

Taking into consideration the entire C++11 standard, is it possible for any conforming implementation to succeed the first assertion below but fail the latter?

#include <cassert>

int main(int, char**)
{  
    const int I = 5, J = 4, K = 3;
    const int N = I * J * K;

    int arr1d[N] = {0};
    int (&arr3d)[I][J][K] = reinterpret_cast<int (&)[I][J][K]>(arr1d);
    assert(static_cast<void*>(arr1d) ==
           static_cast<void*>(arr3d)); // is this necessary?

    arr3d[3][2][1] = 1;
    assert(arr1d[3 * (J * K) + 2 * K + 1] == 1); // UB?
}

If not, is this technically UB or not, and does that answer change if the first assertion is removed (is reinterpret_cast guaranteed to preserve addresses here?)? Also, what if the reshaping is done in the opposite direction (3d to 1d) or from a 6x35 array to a 10x21 array?

EDIT: If the answer is that this is UB because of the reinterpret_cast, is there some other strictly compliant way of reshaping (e.g., via static_cast to/from an intermediate void *)?

682

asked Mar 07 '13 23:03

Stephen Lin

2 Answers

Update 2021-03-20:

This same question was asked on Reddit recently and it was pointed out that my original answer is flawed because it does not take into account this aliasing rule:

If a program attempts to access the stored value of an object through a glvalue whose type is not similar to one of the following types the behavior is undefined:

the dynamic type of the object,

a type that is the signed or unsigned type corresponding to the dynamic type of the object, or

a char, unsigned char, or std::byte type.

Under the rules for similarity, these two array types are not similar for any of the above cases and therefore it is technically undefined behaviour to access the 1D array through the 3D array. (This is definitely one of those situations where, in practice, it will almost certainly work with most compilers/targets)

Note that the references in the original answer refer to an older C++11 draft standard

Original answer:

`reinterpret_cast` of references

The standard states that an lvalue of type T1 can be reinterpret_cast to a reference to T2 if a pointer to T1 can be reinterpret_cast to a pointer to T2 (§5.2.10/11):

An lvalue expression of type T1 can be cast to the type “reference to T2” if an expression of type “pointer to T1” can be explicitly converted to the type “pointer to T2” using a reinterpret_cast.

So we need to determine if a int(*)[N] can be converted to an int(*)[I][J][K].

`reinterpret_cast` of pointers

A pointer to T1 can be reinterpret_cast to a pointer to T2 if both T1 and T2 are standard-layout types and T2 has no stricter alignment requirements than T1 (§5.2.10/7):

When a prvalue v of type “pointer to T1” is converted to the type “pointer to cv T2”, the result is static_cast<cv T2*>(static_cast<cv void*>(v)) if both T1 and T2 are standard-layout types (3.9) and the alignment requirements of T2 are no stricter than those of T1, or if either type is void.

Are int[N] and int[I][J][K] standard-layout types?

int is a scalar type and arrays of scalar types are considered to be standard-layout types (§3.9/9).

Scalar types, standard-layout class types (Clause 9), arrays of such types and cv-qualified versions of these types (3.9.3) are collectively called standard-layout types.
Does int[I][J][K] have no stricter alignment requirements than int[N].

The result of the alignof operator gives the alignment requirement of a complete object type (§3.11/2).

The result of the alignof operator reflects the alignment requirement of the type in the complete-object case.

Since the two arrays here are not subobjects of any other object, they are complete objects. Applying alignof to an array gives the alignment requirement of the element type (§5.3.6/3):

When alignof is applied to an array type, the result shall be the alignment of the element type.

So both array types have the same alignment requirement.

That makes the reinterpret_cast valid and equivalent to:

int (&arr3d)[I][J][K] = *reinterpret_cast<int (*)[I][J][K]>(&arr1d);

where * and & are the built-in operators, which is then equivalent to:

int (&arr3d)[I][J][K] = *static_cast<int (*)[I][J][K]>(static_cast<void*>(&arr1d));

`static_cast` through `void*`

The static_cast to void* is allowed by the standard conversions (§4.10/2):

A prvalue of type “pointer to cv T,” where T is an object type, can be converted to a prvalue of type “pointer to cv void”. The result of converting a “pointer to cv T” to a “pointer to cv void” points to the start of the storage location where the object of type T resides, as if the object is a most derived object (1.8) of type T (that is, not a base class subobject).

The static_cast to int(*)[I][J][K] is then allowed (§5.2.9/13):

A prvalue of type “pointer to cv1 void” can be converted to a prvalue of type “pointer to cv2 T,” where T is an object type and cv2 is the same cv-qualification as, or greater cv-qualification than, cv1.

So the cast is fine! But are we okay to access objects through the new array reference?

Accessing array elements

Performing array subscripting on an array like arr3d[E2] is equivalent to *((E1)+(E2)) (§5.2.1/1). Let's consider the following array subscripting:

arr3d[3][2][1]

Firstly, arr3d[3] is equivalent to *((arr3d)+(3)). The lvalue arr3d undergoes array-to-pointer conversion to give a int(*)[2][1]. There is no requirement that the underlying array must be of the correct type to do this conversion. The pointers value is then accessed (which is fine by §3.10) and then the value 3 is added to it. This pointer arithmetic is also fine (§5.7/5):

If both the pointer operand and the result point to elements of the same array object, or one past the last element of the array object, the evaluation shall not produce an overflow; otherwise, the behavior is undefined.

This this pointer is dereferenced to give an int[2][1]. This undergoes the same process for the next two subscripts, resulting in the final int lvalue at the appropriate array index. It is an lvalue due to the result of * (§5.3.1/1):

The unary * operator performs indirection: the expression to which it is applied shall be a pointer to an object type, or a pointer to a function type and the result is an lvalue referring to the object or function to which the expression points.

It is then perfectly fine to access the actual int object through this lvalue because the lvalue is of type int too (§3.10/10):

If a program attempts to access the stored value of an object through a glvalue of other than one of the following types the behavior is undefined:

the dynamic type of the object

[...]

So unless I've missed something. I'd say this program is well-defined.

150

answered Sep 28 '22 09:09

Joseph Mansfield

I am under the impression that it will work. You allocate the same piece of contiguous memory. I know the C-standard guarantees it will be contiguous at least. I don't know what is said in the C++11 standard.

However the first assert should always be true. The address of the first element of the array will always be the same. All memory address will be the same since the same piece of memory is allocated.

I would therefore also say that the second assert will always hold true. At least as long as the ordering of the elements are always in row major order. This is also guaranteed by the C-standard and I would be surprised if the C++11 standard says anything differently.

answered Sep 28 '22 09:09

AxelOmega

Related questions
                            
                                range-based for on multi-dimensional array
                            
                                Implementing variadic type traits
                            
                                How to measure Memory Usage of std::unordered_map
                            
                                Where to get MD5 hashes from a GitHub release?
                            
                                Is the term "method" defined by the C++ Standard?
                            
                                Why does a 2-stage command-line build with clang not generate a dSYM directory?
                            
                                How to detect machine word size in C/C++?
                            
                                Is there a reason to use C++11's std::int_fast32_t or std::int_fast16_t over int in cross-platform code?
                            
                                SFINAE on functions with default parameters - free function vs operator()
                            
                                Is it possible to share an enum declaration between C# and unmanaged C++?
                            
                                Why does wide file-stream in C++ narrow written data by default?
                            
                                Avoid linking to libstdc++
                            
                                &&= and ||= operators [duplicate]
                            
                                C++: overriding public\private inheritance
                            
                                fwrite chokes on "<?xml version"
                            
                                Is NULL defined as nullptr in C++11?
                            
                                C++ abstract base class constructors/destructors - general correctness
                            
                                Using Quaternions for OpenGL Rotations [duplicate]
                            
                                How to get a vector containing only the last n elements of another vector?
                            
                                asio::read with timeout

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Reshaping a 1-d array to a multidimensional array

Tags:

c++

multidimensional-array

c++11

language-lawyer

Stephen Lin

People also ask

2 Answers

Original answer:

`reinterpret_cast` of references

`reinterpret_cast` of pointers

`static_cast` through `void*`

Accessing array elements

Joseph Mansfield

AxelOmega

Recent Activity

Donate For Us

Reshaping a 1-d array to a multidimensional array

Tags:

c++

multidimensional-array

c++11

language-lawyer

Stephen Lin

People also ask

2 Answers

Original answer:

reinterpret_cast of references

reinterpret_cast of pointers

static_cast through void*

Accessing array elements

Joseph Mansfield

AxelOmega

Related questions

Recent Activity

Donate For Us

`reinterpret_cast` of references

`reinterpret_cast` of pointers

`static_cast` through `void*`