When I print the size of a union like this: <pre class="prettyprint"><code>union u { char c[5]; int i; } un; </code></pre> using this: <pre class="prettyprint"><code>int _tmain(int argc, _TCHAR* argv[]) { printf("size of union = %d ",sizeof(un)); return 0; } </code></pre> I get an answer of 8 using Visual C++, but I expected 5. Why? Well, for the same example, i did something like this: <pre class="prettyprint"><code>int i1 = 0x98761234; un.i = i1; printf("\n un.c[0] = %x ",un.c[0]); printf("\n un.c[1] = %x ",un.c[1]); printf("\n un.c[2]= %x ",un.c[2]); printf("\n un.c[3] = %x ",un.c[3]); printf("\n un.c[4] = %x ",un.c[4]); printf("size of union = %d ",sizeof(un)); </code></pre> i got results like <pre class="prettyprint"><code>un.c[0] = 34; un.c[1] = 12; un.c[2] = 76; un.c[3] = ffffff98; </code></pre> why are there 6fs at un.c[3]

The <code>sizeof</code> operator produces the size of a variable or type, including any padding necessary to separate elements in an array of that type such that everything is still correctly aligned. Since your union has an <code>int</code> member, it needs to be 4-byte aligned, so its "natural" size gets rounded upwards to the next multiple of 4 bytes. <hr> The <code>ffffff98</code> is because you're compiling with signed <code>char</code>. Using <code>%x</code> with an argument that is not <code>unsigned int</code> causes undefined behaviour; what you're seeing is sometimes called sign-extension. The result of your aliasing is <code>0x98</code> reinterpreted as <code>char</code>, which is <code>-104</code>. This retains its value on being promoted to <code>int</code> (this is called the default argument promotions), and the int <code>-104</code> when aliased as <code>unsigned int</code> becomes <code>0xffffff98</code>.

The alignment of your union must be the largest alignment of any of its members. This is 4. Therefore, the size of the union must be aligned to that size. It could have been 5 (as <code>c</code> is the largest member of the union), but because the alignment of the union as a whole is 4, the size of the union is padded to 8. Note that this is just for VC++. The standard does not specifically require it. Though it does allow implementations to pad types as needed, which VC++ does. GCC could do something different, and there could be compile-time switches you could employ to change this behavior.

Why is my union's size bigger than I expected?

Tags:

c++

unions

When I print the size of a union like this:

union u {
  char c[5];
  int i;
} un;

using this:

int _tmain(int argc, _TCHAR* argv[])
{
    printf("size of union = %d ",sizeof(un));
    return 0;
}

I get an answer of 8 using Visual C++, but I expected 5. Why?

Well, for the same example, i did something like this:

int i1 = 0x98761234;
un.i = i1;
printf("\n un.c[0] = %x ",un.c[0]);
printf("\n un.c[1] = %x ",un.c[1]);
printf("\n un.c[2]= %x ",un.c[2]);
printf("\n un.c[3] = %x ",un.c[3]);
printf("\n un.c[4] = %x ",un.c[4]);
printf("size of union = %d ",sizeof(un));

i got results like

un.c[0] = 34;
un.c[1] = 12;
un.c[2] = 76;
un.c[3] = ffffff98;

why are there 6fs at un.c[3]

705

asked Aug 27 '11 04:08

Saurabh Ghorpade

2 Answers

The sizeof operator produces the size of a variable or type, including any padding necessary to separate elements in an array of that type such that everything is still correctly aligned. Since your union has an int member, it needs to be 4-byte aligned, so its "natural" size gets rounded upwards to the next multiple of 4 bytes.

The ffffff98 is because you're compiling with signed char. Using %x with an argument that is not unsigned int causes undefined behaviour; what you're seeing is sometimes called sign-extension. The result of your aliasing is 0x98 reinterpreted as char, which is -104. This retains its value on being promoted to int (this is called the default argument promotions), and the int -104 when aliased as unsigned int becomes 0xffffff98.

answered Oct 17 '22 13:10

hmakholm left over Monica

The alignment of your union must be the largest alignment of any of its members. This is 4. Therefore, the size of the union must be aligned to that size. It could have been 5 (as c is the largest member of the union), but because the alignment of the union as a whole is 4, the size of the union is padded to 8.

Note that this is just for VC++. The standard does not specifically require it. Though it does allow implementations to pad types as needed, which VC++ does. GCC could do something different, and there could be compile-time switches you could employ to change this behavior.

answered Oct 17 '22 15:10

Nicol Bolas

Related questions
                            
                                Memory allocation in C++
                            
                                Do I kill a kitten each time I use struct everywhere instead of class?
                            
                                Why does the print command in gdb return \035 for C++ std::strings?
                            
                                MFC dlg class link errors for MyClass::GetMessageMap() and MyClass::GetRuntimeClass (MSVC 2008)
                            
                                Qt moveToThread() vs calling new thread when do we use each
                            
                                Indenting Paragraph With cout
                            
                                OpenGL: How to render perfect rectangular gradient?
                            
                                C++, Multilanguage/Localisation support
                            
                                inheriting ostream and streambuf problem with xsputn and overflow
                            
                                Serial Comm using WriteFile/ReadFile
                            
                                How to test whether expression is a temporary?
                            
                                OpenCV 2.2 SURF Feature matching problems
                            
                                operator new inside namespace
                            
                                Initialisation lists in constructors trying to initialize a structure
                            
                                Why is my constructor with non const reference as argument allowed to be called with temporary objects?
                            
                                Getting std::map allocator to work
                            
                                Hex character to int in C++
                            
                                C++ Qt Reflection with Copy and Assignment
                            
                                One reader. One writer. Some general questions about mutexes and atomic-builtins
                            
                                libstdc++ GLIBCXX version errors

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With