Consider this simple program <pre class="prettyprint"><code>#include <iostream> struct A { int x1234; short x56; char x7; }; struct B : A { char x8; }; int main() { std::cout << sizeof(A) << ' ' << sizeof(B) << '\n'; return 0; } </code></pre> This prints <code>8 12</code>. Even though <code>B</code> could be packed into 8 bytes without breaking alignment requirements, instead it takes up a greedy 12 bytes. It would be nice to have <code>sizeof(B) == 8</code>, but the answer to Is the size of a struct required to be an exact multiple of the alignment of that struct? suggests that there isn't a way. I was therefore surprised when the following <pre class="prettyprint"><code>struct MakePackable { }; struct A : MakePackable { int x1234; short x56; char x7; }; struct B : A { char x8; }; </code></pre> printed <code>8 8</code>. What is going on here? I suspect that standard-layout-types have something to do with it. If so, then what is the rationale for it causing the above behaviour, when the only purpose of that feature is to ensure binary-compatibility with C? <hr> EDIT: As others have pointed out this is ABI or compiler specific, so I ought to add that this behaviour was observed on x86_64-unknown-linux-gnu with the following compilers: <ul> <li>clang 3.6</li> <li>gcc 5.1</li> </ul> I have also noticed something strange from clang's struct dumper. If we ask for the data size without tail padding ("dsize"), <pre class="prettyprint"><code> A B first 8 9 second 7 8 </code></pre> then in the first example we get <code>dsize(A) == 8</code>. Why is this not 7?

This is a data point although not a complete answer. Say we have (as a complete translation unit, not a snippet): <pre class="prettyprint"><code>struct X {}; struct A { int x1234; short x56; char x7; } void func(A &dst, A const &src) { dst = src; } </code></pre> With g++, this function is compiled to: <pre class="prettyprint"><code>movq (%rdx), %rax movq %rax, (%rcx) </code></pre> However if <code>struct A : X</code> is used instead, then this function is: <pre class="prettyprint"><code>movl (%rdx), %eax movl %eax, (%rcx) movzwl 4(%rdx), %eax movw %ax, 4(%rcx) movzbl 6(%rdx), %eax movb %al, 6(%rcx) </code></pre> These two cases actually correspond to the sizes being <code>8 12</code> and <code>8 8</code> respectively in OP's example. The reason for this is fairly clear: <code>A</code> might be used as a base for some class <code>B</code>, and then the call <code>func(b, a);</code> must be careful not to disturb other members of <code>b</code> that might reside in the padding area (<code>b.x8</code> in OP's example); I cannot see any particular property of <code>A : X</code> in the C++ standard which would make g++ decide that the padding is re-usable in <code>struct A : X</code>, but not in <code>struct A</code>. Both <code>A</code> and <code>A : X</code>are trivially copyable, standard layout and POD. I guess it must just be an optimization decision based on typical usage. The version without re-use will be faster to copy. Maybe a g++ ABI designer could comment? Interestingly, this example shows that being trivially copyable does not imply that <code>memcpy(&b, &a, sizeof b)</code> is equivalent to <code>b = a</code> !

I'm not a real language lawyer of C++, however what I've found so far is: Referencing the answers in this question, a struct only remains a standard layout POD while there is only 1 class with non-static members among itself and its parent classes. So under that idea <code>A</code> has a guaranteed layout in both cases, but <code>B</code> does not in either case. Supporting this is the fact that std::is_pod is true for <code>A</code> and false for <code>B</code> in both. <ul> <li>First case: http://ideone.com/jyPb5J </li> <li>Second case: http://ideone.com/bYcLXa </li> </ul> So if I'm understanding this correctly myself, the compiler is allowed some room to do what it wants with the layout of <code>B</code> in both cases. And apparently in the second case it feels like making use of what would otherwise have been the padding byte of <code>A</code>.

Why does this struct padding trick work?

Tags:

c++

inheritance

padding

Consider this simple program

#include <iostream>

struct A
{
    int   x1234;
    short x56;
    char  x7;
};

struct B : A
{
    char x8;
};

int main()
{
    std::cout << sizeof(A) << ' ' << sizeof(B) << '\n';
    return 0;
}

This prints 8 12. Even though B could be packed into 8 bytes without breaking alignment requirements, instead it takes up a greedy 12 bytes.

It would be nice to have sizeof(B) == 8, but the answer to Is the size of a struct required to be an exact multiple of the alignment of that struct? suggests that there isn't a way.

I was therefore surprised when the following

struct MakePackable
{
};

struct A : MakePackable
{
    int   x1234;
    short x56;
    char  x7;
};

struct B : A
{
    char x8;
};

printed 8 8.

What is going on here? I suspect that standard-layout-types have something to do with it. If so, then what is the rationale for it causing the above behaviour, when the only purpose of that feature is to ensure binary-compatibility with C?

EDIT: As others have pointed out this is ABI or compiler specific, so I ought to add that this behaviour was observed on x86_64-unknown-linux-gnu with the following compilers:

clang 3.6
gcc 5.1

I have also noticed something strange from clang's struct dumper. If we ask for the data size without tail padding ("dsize"),

          A   B
first     8   9
second    7   8

then in the first example we get dsize(A) == 8. Why is this not 7?

932

asked Jul 07 '15 00:07

PBS

2 Answers

This is a data point although not a complete answer.

Say we have (as a complete translation unit, not a snippet):

struct X {};

struct A
{
    int   x1234;
    short x56;
    char  x7;
}

void func(A &dst, A const &src) 
{
    dst = src;
}

With g++, this function is compiled to:

movq    (%rdx), %rax
movq    %rax, (%rcx)

However if struct A : X is used instead, then this function is:

movl    (%rdx), %eax
movl    %eax, (%rcx)
movzwl  4(%rdx), %eax
movw    %ax, 4(%rcx)
movzbl  6(%rdx), %eax
movb    %al, 6(%rcx)

These two cases actually correspond to the sizes being 8 12 and 8 8 respectively in OP's example.

The reason for this is fairly clear: A might be used as a base for some class B, and then the call func(b, a); must be careful not to disturb other members of b that might reside in the padding area (b.x8 in OP's example);

I cannot see any particular property of A : X in the C++ standard which would make g++ decide that the padding is re-usable in struct A : X, but not in struct A. Both A and A : Xare trivially copyable, standard layout and POD.

I guess it must just be an optimization decision based on typical usage. The version without re-use will be faster to copy. Maybe a g++ ABI designer could comment?

Interestingly, this example shows that being trivially copyable does not imply that memcpy(&b, &a, sizeof b) is equivalent to b = a !

131

answered Oct 23 '22 19:10

M.M

I'm not a real language lawyer of C++, however what I've found so far is:

Referencing the answers in this question, a struct only remains a standard layout POD while there is only 1 class with non-static members among itself and its parent classes. So under that idea A has a guaranteed layout in both cases, but B does not in either case.

Supporting this is the fact that std::is_pod is true for A and false for B in both.

First case: http://ideone.com/jyPb5J
Second case: http://ideone.com/bYcLXa

So if I'm understanding this correctly myself, the compiler is allowed some room to do what it wants with the layout of B in both cases. And apparently in the second case it feels like making use of what would otherwise have been the padding byte of A.

answered Oct 23 '22 18:10

TheUndeadFish

Related questions
                            
                                Is there a way in C++ to create 'super private' variables?
                            
                                Creating a 3D sphere in Opengl using Visual C++
                            
                                Fast Arc Cos algorithm?
                            
                                QGraphicsView Zooming in and out under mouse position using mouse wheel
                            
                                error LNK2019: unresolved external symbol _main referenced in function ___tmainCRTStartup, but this time it's NOT a Windows/Console problem!
                            
                                if(false==condition). Why? [duplicate]
                            
                                C# vs. C++ in a cross-platform project
                            
                                Convert double to string C++? [duplicate]
                            
                                What are some C++ related idioms, misconceptions, and gotchas that you've learnt from experience?
                            
                                Normalizing from [0.5 - 1] to [0 - 1]
                            
                                How to find minimum value from vector? [closed]
                            
                                C++ for Game Programming - Love or Distrust? [closed]
                            
                                Can Eclipse hover tips display Doxygen comments from header file?
                            
                                What are the rules regarding initialization of non-local statics?
                            
                                Boost Spirit Qi - Duplicate last letter with stream-based parsing
                            
                                What is the rationale for requiring inclusion of <initializer_list>?
                            
                                How do I convert V8 objects to pointers?
                            
                                Do we need to use std::launder when doing pointer arithmetic within a standard-layout object (e.g., with offsetof)?
                            
                                Formal methods in C++ for safety critical software
                            
                                Why does the order of template parameters matter to the MS C++ compiler in this example?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With