I couldn't find a decent document that explains how the alignment system works and why some types are more strictly aligned than the others.

I'll try to explain in short. <h3>What is data alignment?</h3> The architecture in you computer is composed of processor and memory. Memory is organized in cells, so: <pre class="prettyprint"><code> 0x00 | data | 0x01 | ... | 0x02 | ... | </code></pre> Each memory cell has a specified size, amount of bits it can store. This is architecture dependent. When you define a variable in your C/C++ program, one or more different cells are occupied by your program. For example <pre class="prettyprint"><code>int variable = 12; </code></pre> Suppose each cell contains 32 bits and the <code>int</code> type size is 32 bits, then in somewhere in your memory: <pre class="prettyprint"><code>variable: | 0 0 0 c | // c is hexadecimal of 12. </code></pre> When your CPU has to operate on that variable it needs to bring it inside its register. A CPU can take in "1 clock" a small amount of bit from the memory, that size is usually called WORD. This dimension is architecture dependent as well. Now suppose you have a variable which is stored, because of some offset, in two cells. For example I have two different pieces data to store (I'm going to use a "string representation to make more clear"): <pre class="prettyprint"><code>data1: "ab" data2: "cdef" </code></pre> So the memory will be composed in that way (2 different cells): <pre class="prettyprint"><code>|a b c d| |e f 0 0| </code></pre> That is, <code>data1</code> occupies just half of the cell, so <code>data2</code> occupies the remaining part and a part of a second cell. Now suppose you CPU wants to read <code>data2</code>. The CPU needs 2 clocks in order to access the data, because within one clock it reads the first cell and within the other clock it reads the remaining part in the second cell. If we align <code>data2</code> in accordance with this memory-example, we can introduce a sort of padding and shift <code>data2</code> all in the second cell. <pre class="prettyprint"><code>|a b 0 0| |c d e f| --- padding </code></pre> In that way the CPU will lose only "1 clock" in order to access to <code>data2</code>. <h3>What an align system does</h3> An align system just introduces that padding in order to align the data with the memory of the system, in accordance with the architecture. <h3>Why should I care about alignment?</h3> I will not go deep in this answer. However, broadly speaking, memory alignment comes from the requirements of the context. In the example above, having padding (so the data is memory-aligned) can save CPU cycles in order to retrieve the data. This might have an impact on the execution performance of the program because of minor number of memory access. However, beyond the above example (made only for sake of the explanation), there are many other scenarios where memory alignment is useful or even needed. For example, some architectures might have strict requirements how the memory can be accessed. In such cases, the padding helps to allocate memory fulfilling the platform constraints.

What is data alignment? Why and when should I be worried when typecasting pointers in C? [duplicate]

1 Answers

I'll try to explain in short.

What is data alignment?

The architecture in you computer is composed of processor and memory. Memory is organized in cells, so:

 0x00 |   data  |    0x01 |   ...   |  0x02 |   ...   |

Each memory cell has a specified size, amount of bits it can store. This is architecture dependent.

When you define a variable in your C/C++ program, one or more different cells are occupied by your program.

For example

int variable = 12;

Suppose each cell contains 32 bits and the int type size is 32 bits, then in somewhere in your memory:

variable: | 0 0 0 c |  // c is hexadecimal of 12.

When your CPU has to operate on that variable it needs to bring it inside its register. A CPU can take in "1 clock" a small amount of bit from the memory, that size is usually called WORD. This dimension is architecture dependent as well.

Now suppose you have a variable which is stored, because of some offset, in two cells.

For example I have two different pieces data to store (I'm going to use a "string representation to make more clear"):

data1: "ab" data2: "cdef"

So the memory will be composed in that way (2 different cells):

|a b c d|     |e f 0 0|

That is, data1 occupies just half of the cell, so data2 occupies the remaining part and a part of a second cell.

Now suppose you CPU wants to read data2. The CPU needs 2 clocks in order to access the data, because within one clock it reads the first cell and within the other clock it reads the remaining part in the second cell.

If we align data2 in accordance with this memory-example, we can introduce a sort of padding and shift data2 all in the second cell.

|a b 0 0|     |c d e f|      ---    padding

In that way the CPU will lose only "1 clock" in order to access to data2.

What an align system does

An align system just introduces that padding in order to align the data with the memory of the system, in accordance with the architecture.

Why should I care about alignment?

I will not go deep in this answer. However, broadly speaking, memory alignment comes from the requirements of the context.

In the example above, having padding (so the data is memory-aligned) can save CPU cycles in order to retrieve the data. This might have an impact on the execution performance of the program because of minor number of memory access.

However, beyond the above example (made only for sake of the explanation), there are many other scenarios where memory alignment is useful or even needed.

For example, some architectures might have strict requirements how the memory can be accessed. In such cases, the padding helps to allocate memory fulfilling the platform constraints.

199

answered Oct 12 '22 19:10

BiagioF

Related questions
                            
                                how do I parse an iso 8601 date (with optional milliseconds) to a struct tm in C++?
                            
                                LRU implementation in production code
                            
                                where should "include" be put in C++
                            
                                boost Shared_pointer NULL
                            
                                what does the error mean when I am compiling c++ with g++ compiler?
                            
                                VS2012 C++ warning C4005: '__useHeader': macro redefinition
                            
                                error: no matching function for call to ‘min(long unsigned int&, unsigned int&)’
                            
                                static_cast<int>(foo) vs. (int)foo
                            
                                Does std::map::iterator return a copy of value or a value itself?
                            
                                How to export a C++ class from a dll? [duplicate]
                            
                                Why is rsize_t defined?
                            
                                Error1 error LNK1107: invalid or corrupt file: cannot read at 0x2B0
                            
                                Why is it not a compiler error to assign to the result of a substr call?
                            
                                How can I know the real maximum size of a vector? (Not using std::vector::max_size)
                            
                                How to write portable code in c++?
                            
                                Why is using "vector.at(x)" better than "vector[x]" in C++?
                            
                                Compiling C++ code (Xcode) for iOS and Android. Is it real? [closed]
                            
                                What are practical applications of weak linking?
                            
                                What's the difference between ordering and sorting?
                            
                                Metaprograming: Failure of Function Definition Defines a Separate Function

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

What is data alignment? Why and when should I be worried when typecasting pointers in C? [duplicate]

Tags:

c++

c

memory

Dogus Ural

People also ask