Most of experienced programmer knows data alignment is important for program's performance. I have seen some programmer wrote program that allocate bigger size of buffer than they need, and use the aligned pointer as begin. I am wondering should I do that in my program, I have no idea is there any guarantee of alignment of address returned by C++'s new operation. So I wrote a little program to test <pre class="prettyprint"><code>for(size_t i = 0; i < 100; ++i) { char *p = new char[123]; if(reinterpret_cast<size_t>(p) % 4) { cout << "*"; system("pause"); } cout << reinterpret_cast<void *>(p) << endl; } for(size_t i = 0; i < 100; ++i) { short *p = new short[123]; if(reinterpret_cast<size_t>(p) % 4) { cout << "*"; system("pause"); } cout << reinterpret_cast<void *>(p) << endl; } for(size_t i = 0; i < 100; ++i) { float *p = new float[123]; if(reinterpret_cast<size_t>(p) % 4) { cout << "*"; system("pause"); } cout << reinterpret_cast<void *>(p) << endl; } system("pause"); </code></pre> The compiler I am using is Visual C++ Express 2008. It seems that all addresses the new operation returned are aligned. But I am not sure. So my question is: are there any guarantee? If they do have guarantee, I don't have to align myself, if not, I have to.

The alignment has the following guarantee from the standard (3.7.3.1/2): <blockquote> The pointer returned shall be suitably aligned so that it can be converted to a pointer of any complete object type and then used to access the object or array in the storage allocated (until the storage is explicitly deallocated by a call to a corresponding deallocation function). </blockquote> EDIT: Thanks to timday for highlighting a bug in gcc/glibc where the guarantee does not hold. EDIT 2: Ben's comment highlights an intersting edge case. The requirements on the allocation routines are for those provided by the standard only. If the application has it's own version, then there's no such guarantee on the result.

This is a late answer but just to clarify the situation on Linux - on 64-bit systems memory is always 16-byte aligned: http://www.gnu.org/software/libc/manual/html_node/Aligned-Memory-Blocks.html <blockquote> The address of a block returned by malloc or realloc in the GNU system is always a multiple of eight (or sixteen on 64-bit systems). </blockquote> The <code>new</code> operator calls <code>malloc</code> internally (see <code>./gcc/libstdc++-v3/libsupc++/new_op.cc</code>) so this applies to <code>new</code> as well. The implementation of <code>malloc</code> which is part of the <code>glibc</code> basically defines <code>MALLOC_ALIGNMENT</code> to be <code>2*sizeof(size_t)</code> and <code>size_t</code> is 32bit=4byte and 64bit=8byte on a x86-32 and x86-64 system, respectively. <pre class="prettyprint"><code>$ cat ./glibc-2.14/malloc/malloc.c: ... #ifndef INTERNAL_SIZE_T #define INTERNAL_SIZE_T size_t #endif ... #define SIZE_SZ (sizeof(INTERNAL_SIZE_T)) ... #ifndef MALLOC_ALIGNMENT #define MALLOC_ALIGNMENT (2 * SIZE_SZ) #endif </code></pre>

Is there any guarantee of alignment of address return by C++'s new operation?

Tags:

c++

performance

alignment

new-operator

Most of experienced programmer knows data alignment is important for program's performance. I have seen some programmer wrote program that allocate bigger size of buffer than they need, and use the aligned pointer as begin. I am wondering should I do that in my program, I have no idea is there any guarantee of alignment of address returned by C++'s new operation. So I wrote a little program to test

for(size_t i = 0; i < 100; ++i) {     char *p = new char[123];     if(reinterpret_cast<size_t>(p) % 4) {         cout << "*";         system("pause");     }     cout << reinterpret_cast<void *>(p) << endl; } for(size_t i = 0; i < 100; ++i) {     short *p = new short[123];     if(reinterpret_cast<size_t>(p) % 4) {         cout << "*";         system("pause");     }     cout << reinterpret_cast<void *>(p) << endl; } for(size_t i = 0; i < 100; ++i) {     float *p = new float[123];     if(reinterpret_cast<size_t>(p) % 4) {         cout << "*";         system("pause");     }     cout << reinterpret_cast<void *>(p) << endl; } system("pause");

The compiler I am using is Visual C++ Express 2008. It seems that all addresses the new operation returned are aligned. But I am not sure. So my question is: are there any guarantee? If they do have guarantee, I don't have to align myself, if not, I have to.

768

asked Feb 03 '09 09:02

Fang-Pen Lin

2 Answers

The alignment has the following guarantee from the standard (3.7.3.1/2):

The pointer returned shall be suitably aligned so that it can be converted to a pointer of any complete object type and then used to access the object or array in the storage allocated (until the storage is explicitly deallocated by a call to a corresponding deallocation function).

EDIT: Thanks to timday for highlighting a bug in gcc/glibc where the guarantee does not hold.

EDIT 2: Ben's comment highlights an intersting edge case. The requirements on the allocation routines are for those provided by the standard only. If the application has it's own version, then there's no such guarantee on the result.

128

answered Sep 27 '22 17:09

Richard Corden

This is a late answer but just to clarify the situation on Linux - on 64-bit systems memory is always 16-byte aligned:

http://www.gnu.org/software/libc/manual/html_node/Aligned-Memory-Blocks.html

The address of a block returned by malloc or realloc in the GNU system is always a multiple of eight (or sixteen on 64-bit systems).

The new operator calls malloc internally (see ./gcc/libstdc++-v3/libsupc++/new_op.cc) so this applies to new as well.

The implementation of malloc which is part of the glibc basically defines MALLOC_ALIGNMENT to be 2*sizeof(size_t) and size_t is 32bit=4byte and 64bit=8byte on a x86-32 and x86-64 system, respectively.

$ cat ./glibc-2.14/malloc/malloc.c: ... #ifndef INTERNAL_SIZE_T #define INTERNAL_SIZE_T size_t #endif ... #define SIZE_SZ                (sizeof(INTERNAL_SIZE_T)) ... #ifndef MALLOC_ALIGNMENT #define MALLOC_ALIGNMENT       (2 * SIZE_SZ) #endif

answered Sep 27 '22 17:09

user1059432

Related questions
                            
                                What does assert(0) mean?
                            
                                Why does const auto &p{nullptr} work while auto *p{nullptr} doesn't in C++17?
                            
                                Compile error in 'winbase.h'
                            
                                Why do arrays of different integer sizes have different performance?
                            
                                How does the compiler benefit from C++'s new final keyword?
                            
                                Parallel for loop in openmp
                            
                                Pointer-to-pointer dynamic two-dimensional array
                            
                                Assignment operator not available in derived class
                            
                                Operating System compile time
                            
                                Boost 1.46.1, Property Tree: How to iterate through ptree receiving sub ptrees?
                            
                                Class and std::async on class member in C++
                            
                                What happens if a constructor throws an exception?
                            
                                Can functions from the C standard library be used in C++?
                            
                                Get bytes from std::string in C++
                            
                                How to implement serialization in C++
                            
                                Constructor Overloading in C++
                            
                                Strange definition of FALSE and TRUE, why? [duplicate]
                            
                                Can I determine the number of channels in cv::Mat Opencv
                            
                                C++11 way to index tuple at runtime without using switch
                            
                                Compiler optimization of bitwise not operation

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With