Why does the <code>sizeof</code> operator return a size larger for a structure than the total sizes of the structure's members?

Packing and byte alignment, as described in the C FAQ here: <blockquote> It's for alignment. Many processors can't access 2- and 4-byte quantities (e.g. ints and long ints) if they're crammed in every-which-way. Suppose you have this structure: <pre class="prettyprint"><code>struct { char a[3]; short int b; long int c; char d[3]; }; </code></pre> Now, you might think that it ought to be possible to pack this structure into memory like this: <pre class="prettyprint"><code>+-------+-------+-------+-------+ | a | b | +-------+-------+-------+-------+ | b | c | +-------+-------+-------+-------+ | c | d | +-------+-------+-------+-------+ </code></pre> But it's much, much easier on the processor if the compiler arranges it like this: <pre class="prettyprint"><code>+-------+-------+-------+ | a | +-------+-------+-------+ | b | +-------+-------+-------+-------+ | c | +-------+-------+-------+-------+ | d | +-------+-------+-------+ </code></pre> In the packed version, notice how it's at least a little bit hard for you and me to see how the b and c fields wrap around? In a nutshell, it's hard for the processor, too. Therefore, most compilers will pad the structure (as if with extra, invisible fields) like this: <pre class="prettyprint"><code>+-------+-------+-------+-------+ | a | pad1 | +-------+-------+-------+-------+ | b | pad2 | +-------+-------+-------+-------+ | c | +-------+-------+-------+-------+ | d | pad3 | +-------+-------+-------+-------+ </code></pre> </blockquote>

Why isn't sizeof for a struct equal to the sum of sizeof of each member?

2 Answers

This is because of padding added to satisfy alignment constraints. Data structure alignment impacts both performance and correctness of programs:

Mis-aligned access might be a hard error (often SIGBUS).
Mis-aligned access might be a soft error.
- Either corrected in hardware, for a modest performance-degradation.
- Or corrected by emulation in software, for a severe performance-degradation.
- In addition, atomicity and other concurrency-guarantees might be broken, leading to subtle errors.

Here's an example using typical settings for an x86 processor (all used 32 and 64 bit modes):

struct X {     short s; /* 2 bytes */              /* 2 padding bytes */     int   i; /* 4 bytes */     char  c; /* 1 byte */              /* 3 padding bytes */ };  struct Y {     int   i; /* 4 bytes */     char  c; /* 1 byte */              /* 1 padding byte */     short s; /* 2 bytes */ };  struct Z {     int   i; /* 4 bytes */     short s; /* 2 bytes */     char  c; /* 1 byte */              /* 1 padding byte */ };  const int sizeX = sizeof(struct X); /* = 12 */ const int sizeY = sizeof(struct Y); /* = 8 */ const int sizeZ = sizeof(struct Z); /* = 8 */

One can minimize the size of structures by sorting members by alignment (sorting by size suffices for that in basic types) (like structure Z in the example above).

IMPORTANT NOTE: Both the C and C++ standards state that structure alignment is implementation-defined. Therefore each compiler may choose to align data differently, resulting in different and incompatible data layouts. For this reason, when dealing with libraries that will be used by different compilers, it is important to understand how the compilers align data. Some compilers have command-line settings and/or special #pragma statements to change the structure alignment settings.

144

answered Sep 19 '22 19:09

6 revs, 6 users 79%

Packing and byte alignment, as described in the C FAQ here:

It's for alignment. Many processors can't access 2- and 4-byte quantities (e.g. ints and long ints) if they're crammed in every-which-way.

Suppose you have this structure:
struct {     char a[3];     short int b;     long int c;     char d[3]; }; 
Now, you might think that it ought to be possible to pack this structure into memory like this:
+-------+-------+-------+-------+ |           a           |   b   | +-------+-------+-------+-------+ |   b   |           c           | +-------+-------+-------+-------+ |   c   |           d           | +-------+-------+-------+-------+ 
But it's much, much easier on the processor if the compiler arranges it like this:
+-------+-------+-------+ |           a           | +-------+-------+-------+ |       b       | +-------+-------+-------+-------+ |               c               | +-------+-------+-------+-------+ |           d           | +-------+-------+-------+ 
In the packed version, notice how it's at least a little bit hard for you and me to see how the b and c fields wrap around? In a nutshell, it's hard for the processor, too. Therefore, most compilers will pad the structure (as if with extra, invisible fields) like this:
+-------+-------+-------+-------+ |           a           | pad1  | +-------+-------+-------+-------+ |       b       |     pad2      | +-------+-------+-------+-------+ |               c               | +-------+-------+-------+-------+ |           d           | pad3  | +-------+-------+-------+-------+ 

answered Sep 18 '22 19:09

EmmEff

Related questions
                            
                                What is meant with "const" at end of function declaration? [duplicate]
                            
                                What is the easiest way to initialize a std::vector with hardcoded elements?
                            
                                How do I achieve the theoretical maximum of 4 FLOPs per cycle?
                            
                                How to determine CPU and memory consumption from inside a process
                            
                                How do I detect unsigned integer multiply overflow?
                            
                                How to call a parent class function from derived class function?
                            
                                Can code that is valid in both C and C++ produce different behavior when compiled in each language?
                            
                                C++ code file extension? What is the difference between .cc and .cpp [closed]
                            
                                How can I get the list of files in a directory using C or C++?
                            
                                Read file line by line using ifstream in C++
                            
                                How to find out if an item is present in a std::vector?
                            
                                How to concatenate a std::string and an int
                            
                                What does the C++ standard state the size of int, long type to be?
                            
                                Sleep for milliseconds
                            
                                Difference between `constexpr` and `const`
                            
                                Why use static_cast<int>(x) instead of (int)x?
                            
                                What is a segmentation fault?
                            
                                Appending a vector to a vector [duplicate]
                            
                                What are the rules for calling the base class constructor?
                            
                                Why is my program slow when looping over exactly 8192 elements?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Why isn't sizeof for a struct equal to the sum of sizeof of each member?

Tags:

c++

c

c++-faq

struct

sizeof

Kevin

People also ask

2 Answers

6 revs, 6 users 79%

EmmEff

Recent Activity

Donate For Us