I stumbled across a code based on unions in C. Here is the code: <pre class="prettyprint"><code> union { struct { char ax[2]; char ab[2]; } s; struct { int a; int b; } st; } u ={12, 1}; printf("%d %d", u.st.a, u.st.b); </code></pre> I just couldn't understand how come the output was <code>268 0</code>. How were the values initialized? How is the union functioning here? Shouldn't the output be <code>12 1</code>. It would be great if anyone could explain what exactly is happening here in detail. I am using a 32 bit processor and on Windows 7.

The code doesn't do what you think. Brace-initializes initialize the first union member, i.e. <code>u.s</code>. However, now the initializer is incomplete and missing braces, since <code>u.s</code> contains two arrays. It should be somethink like: <code>u = { { {'a', 'b'}, { 'c', 'd' } } };</code> You should always compile with all warnings, a decent compiler should have told you that something was amiss. For instance, GCC says, <code>missing braces around initialiser (near initialisation for ‘u.s’)</code> and <code>missing initialiser (near initialisation for ‘u.s.ab’)</code>. Very helpful. In C99 you can take advantage of named member initialization to initialize the second union member: <code>u = { .st = {12, 1} };</code> (This is not possible in C++, by the way.) The corresponding syntax for the first case is <code>`u = { .s = { {'a', 'b'}, { 'c', 'd' } } };</code>, which is arguably more explicit and readable!

Your code uses the default initializer for the union, which is its first member. Both 12 and 1 go into the characters of ax, hence the result that you see (which is very much compiler-dependent). If you wanted to initialize through the second memmber (<code>st</code>) you would use a designated initializer: <pre class="prettyprint"><code>union { struct { char ax[2]; char ab[2]; } s; struct { int a; int b; } st; } u ={ .st = {12, 1}}; </code></pre>

The code sets <code>u.s.ax[0]</code> to 12 and <code>u.s.ax[1]</code> to 1. <code>u.s.ax</code> is overlayed onto <code>u.st.a</code> so the least-significant byte of <code>u.st.a</code> is set to 12 and the most-significant byte to 1 (so you must be running on a little-endian architecture) giving a value of 0x010C or 268.

A union's size is the maximum size of the largest element that composes the union. So in this case, your union type has a size of 8-bytes on a 32-bit platform where <code>int</code> types are 4-bytes each. The first member of the union, <code>s</code>, though, only takes up 2-bytes, and therefore overlaps with the first 2-bytes of the <code>st.a</code> member. Since you are on a little-endian system, that means that we're overlapping the two lower-order bytes of <code>st.a</code>. Thus, when you initialize the union as it's done with the values <code>{12, 1}</code>, you've only initialized the values in the two lower-order bytes of <code>st.a</code> ... this leaves the value of <code>st.b</code> initialized to <code>0</code>. Thus when you attempt to print out the struct containing the two <code>int</code> rather than <code>char</code> members of the union, you end up with your results of <code>128</code> and <code>0</code>.

Union and Struct Initialization

Tags:

c

unions

I stumbled across a code based on unions in C. Here is the code:

    union    {  
        struct  {  
            char ax[2];  
            char ab[2];  
        } s;  
        struct  {  
            int a;  
            int b;  
        } st;  
    } u ={12, 1}; 

    printf("%d %d", u.st.a, u.st.b);

I just couldn't understand how come the output was 268 0. How were the values initialized? How is the union functioning here? Shouldn't the output be 12 1. It would be great if anyone could explain what exactly is happening here in detail.

I am using a 32 bit processor and on Windows 7.

632

asked Dec 07 '11 15:12

h4ck3d

4 Answers

The code doesn't do what you think. Brace-initializes initialize the first union member, i.e. u.s. However, now the initializer is incomplete and missing braces, since u.s contains two arrays. It should be somethink like: u = { { {'a', 'b'}, { 'c', 'd' } } };

You should always compile with all warnings, a decent compiler should have told you that something was amiss. For instance, GCC says, missing braces around initialiser (near initialisation for ‘u.s’) and missing initialiser (near initialisation for ‘u.s.ab’). Very helpful.

In C99 you can take advantage of named member initialization to initialize the second union member: u = { .st = {12, 1} }; (This is not possible in C++, by the way.) The corresponding syntax for the first case is `u = { .s = { {'a', 'b'}, { 'c', 'd' } } };, which is arguably more explicit and readable!

101

answered Oct 09 '22 13:10

Kerrek SB

Your code uses the default initializer for the union, which is its first member. Both 12 and 1 go into the characters of ax, hence the result that you see (which is very much compiler-dependent).

If you wanted to initialize through the second memmber (st) you would use a designated initializer:

union {  
    struct {  
        char ax[2];  
        char ab[2];  
    } s;  
    struct {  
        int a;  
        int b;  
    } st;  
} u ={ .st = {12, 1}};

answered Oct 09 '22 13:10

Sergey Kalinichenko

The code sets u.s.ax[0] to 12 and u.s.ax[1] to 1. u.s.ax is overlayed onto u.st.a so the least-significant byte of u.st.a is set to 12 and the most-significant byte to 1 (so you must be running on a little-endian architecture) giving a value of 0x010C or 268.

answered Oct 09 '22 15:10

Borodin

A union's size is the maximum size of the largest element that composes the union. So in this case, your union type has a size of 8-bytes on a 32-bit platform where int types are 4-bytes each. The first member of the union, s, though, only takes up 2-bytes, and therefore overlaps with the first 2-bytes of the st.a member. Since you are on a little-endian system, that means that we're overlapping the two lower-order bytes of st.a. Thus, when you initialize the union as it's done with the values {12, 1}, you've only initialized the values in the two lower-order bytes of st.a ... this leaves the value of st.b initialized to 0. Thus when you attempt to print out the struct containing the two int rather than char members of the union, you end up with your results of 128 and 0.

answered Oct 09 '22 14:10

Jason

Related questions
                            
                                How do I change the background color of a conditional macro in eclipse?
                            
                                Problem forking fork() multiple processes Unix
                            
                                Introduction to GUI programming with c [closed]
                            
                                What's the difference between C header files (.h) and C++ header files (.hpp)?
                            
                                printf how to do floating points with leading zeros
                            
                                How to set pointer reference through a function
                            
                                How to tell if a file handle is a socket?
                            
                                Get list of C structure members
                            
                                Newbie asm: where is the call code?
                            
                                How to force using local shared libraries over system libraries?
                            
                                Are there any compiler / preprocesser tricks to debug print an enum's name?
                            
                                Optimal Buffer size for read-process-write
                            
                                free() syntax with arguments in C
                            
                                Why earlier versions of C made it mandatory to declare variables in the beginning? [duplicate]
                            
                                Can I access static variables inside a function from outside
                            
                                Can I tell what version of Visual Studio was used to build a DLL by examining the DLL itself
                            
                                If I do not pass enough parameters when calling a function in a DLL, what will happen?
                            
                                Binary "tail" a file
                            
                                Visual C++ standards compliance [closed]
                            
                                Why doesn't GCC produce a warning when assigning a signed literal to an unsigned type?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With