David Hollman recently tweeted the following example (which I've slightly reduced): <pre class="prettyprint"><code>struct FooBeforeBase { double d; bool b[4]; }; struct FooBefore : FooBeforeBase { float value; }; static_assert(sizeof(FooBefore) > 16); //---------------------------------------------------- struct FooAfterBase { protected: double d; public: bool b[4]; }; struct FooAfter : FooAfterBase { float value; }; static_assert(sizeof(FooAfter) == 16); </code></pre> You can examine the layout in clang on godbolt and see that the reason the size changed is that in <code>FooBefore</code>, the member <code>value</code> is placed at offset 16 (maintaining a full alignment of 8 from <code>FooBeforeBase</code>) whereas in <code>FooAfter</code>, the member <code>value</code> is placed at offset 12 (effectively using <code>FooAfterBase</code>'s tail-padding). It is clear to me that <code>FooBeforeBase</code> is standard-layout, but <code>FooAfterBase</code> is not (because its non-static data members do not all have the same access control, [class.prop]/3). But what is it about <code>FooBeforeBase</code>'s being standard-layout that requires this respect of padding bytes? Both gcc and clang reuse <code>FooAfterBase</code>'s padding, ending up with <code>sizeof(FooAfter) == 16</code>. But MSVC does not, ending up with 24. Is there a required layout per the standard and, if not, why do gcc and clang do what they do? <hr> There is some confusion, so just to clear up: <ul> <li> <code>FooBeforeBase</code> is standard-layout</li> <li> <code>FooBefore</code> is not (both it and a base class have non-static data members, similar to <code>E</code> in this example)</li> <li> <code>FooAfterBase</code> is not (it has non-static data members of differing access)</li> <li> <code>FooAfter</code> is not (for both of the above reasons)</li> </ul>

The answer to this question doesn't come from the standard but rather from the Itanium ABI (which is why gcc and clang have one behavior but msvc does something else). That ABI defines a layout, the relevant parts of which for the purposes of this question are: <blockquote> For purposes internal to the specification, we also specify: <ul> <li> dsize(O): the data size of an object, which is the size of O without tail padding.</li> </ul> </blockquote> and <blockquote> We ignore tail padding for PODs because an early version of the standard did not allow us to use it for anything else and because it sometimes permits faster copying of the type. </blockquote> Where the placement of members other than virtual base classes is defined as: <blockquote> Start at offset dsize(C), incremented if necessary for alignment to nvalign(D) for base classes or to align(D) for data members. Place D at this offset unless [... not relevant ...]. </blockquote> The term POD has disappeared from the C++ standard, but it means standard-layout and trivially copyable. In this question, <code>FooBeforeBase</code> is a POD. The Itanium ABI ignores tail padding - hence <code>dsize(FooBeforeBase)</code> is 16. But <code>FooAfterBase</code> is not a POD (it is trivially copyable, but it is not standard-layout). As a result, tail padding is not ignored, so <code>dsize(FooAfterBase)</code> is just 12, and the <code>float</code> can go right there. This has interesting consequences, as pointed out by Quuxplusone in a related answer, implementors also typically assume that tail padding isn't reused, which wreaks havoc on this example: <blockquote> <pre class="prettyprint"><code>#include <algorithm> #include <stdio.h> struct A { int m_a; }; struct B : A { int m_b1; char m_b2; }; struct C : B { short m_c; }; int main() { C c1 { 1, 2, 3, 4 }; B& b1 = c1; B b2 { 5, 6, 7 }; printf("before operator=: %d\n", int(c1.m_c)); // 4 b1 = b2; printf("after operator=: %d\n", int(c1.m_c)); // 4 printf("before std::copy: %d\n", int(c1.m_c)); // 4 std::copy(&b2, &b2 + 1, &b1); printf("after std::copy: %d\n", int(c1.m_c)); // 64, or 0, or anything but 4 } </code></pre> </blockquote> Here, <code>=</code> does the right thing (it does not override <code>B</code>'s tail padding), but <code>copy()</code> has a library optimization that reduces to <code>memmove()</code> - which does not care about tail padding because it assumes it does not exist.

Standard-layout and tail padding

Tags:

David Hollman recently tweeted the following example (which I've slightly reduced):

struct FooBeforeBase {
    double d;
    bool b[4];
};

struct FooBefore : FooBeforeBase {
    float value;
};

static_assert(sizeof(FooBefore) > 16);

//----------------------------------------------------

struct FooAfterBase {
protected:
    double d;
public:  
    bool b[4];
};

struct FooAfter : FooAfterBase {
    float value;
};

static_assert(sizeof(FooAfter) == 16);

You can examine the layout in clang on godbolt and see that the reason the size changed is that in FooBefore, the member value is placed at offset 16 (maintaining a full alignment of 8 from FooBeforeBase) whereas in FooAfter, the member value is placed at offset 12 (effectively using FooAfterBase's tail-padding).

It is clear to me that FooBeforeBase is standard-layout, but FooAfterBase is not (because its non-static data members do not all have the same access control, [class.prop]/3). But what is it about FooBeforeBase's being standard-layout that requires this respect of padding bytes?

Both gcc and clang reuse FooAfterBase's padding, ending up with sizeof(FooAfter) == 16. But MSVC does not, ending up with 24. Is there a required layout per the standard and, if not, why do gcc and clang do what they do?

There is some confusion, so just to clear up:

FooBeforeBase is standard-layout
FooBefore is not (both it and a base class have non-static data members, similar to E in this example)
FooAfterBase is not (it has non-static data members of differing access)
FooAfter is not (for both of the above reasons)

927

asked Dec 18 '18 16:12

Barry

1 Answers

The answer to this question doesn't come from the standard but rather from the Itanium ABI (which is why gcc and clang have one behavior but msvc does something else). That ABI defines a layout, the relevant parts of which for the purposes of this question are:

For purposes internal to the specification, we also specify:

dsize(O): the data size of an object, which is the size of O without tail padding.

and

We ignore tail padding for PODs because an early version of the standard did not allow us to use it for anything else and because it sometimes permits faster copying of the type.

Where the placement of members other than virtual base classes is defined as:

Start at offset dsize(C), incremented if necessary for alignment to nvalign(D) for base classes or to align(D) for data members. Place D at this offset unless [... not relevant ...].

The term POD has disappeared from the C++ standard, but it means standard-layout and trivially copyable. In this question, FooBeforeBase is a POD. The Itanium ABI ignores tail padding - hence dsize(FooBeforeBase) is 16.

But FooAfterBase is not a POD (it is trivially copyable, but it is not standard-layout). As a result, tail padding is not ignored, so dsize(FooAfterBase) is just 12, and the float can go right there.

This has interesting consequences, as pointed out by Quuxplusone in a related answer, implementors also typically assume that tail padding isn't reused, which wreaks havoc on this example:

#include <algorithm>
#include <stdio.h>

struct A {
    int m_a;
};

struct B : A {
    int m_b1;
    char m_b2;
};

struct C : B {
    short m_c;
};

int main() {
    C c1 { 1, 2, 3, 4 };
    B& b1 = c1;
    B b2 { 5, 6, 7 };

    printf("before operator=: %d\n", int(c1.m_c));  // 4
    b1 = b2;
    printf("after operator=: %d\n", int(c1.m_c));  // 4

    printf("before std::copy: %d\n", int(c1.m_c));  // 4
    std::copy(&b2, &b2 + 1, &b1);
    printf("after std::copy: %d\n", int(c1.m_c));  // 64, or 0, or anything but 4
}

Here, = does the right thing (it does not override B's tail padding), but copy() has a library optimization that reduces to memmove() - which does not care about tail padding because it assumes it does not exist.

147

answered Sep 18 '22 01:09

Barry

Related questions
                            
                                How to detect when browser throttles timers and websockets disconnection after a user leaves a tab or turns off the screen? (javascript)
                            
                                What do I need to escape when sending a query?
                            
                                HTML Editor in a Windows Forms Application [closed]
                            
                                How to get the source file name and the line number of a type member?
                            
                                MSBuild - can it work out project dependencies in a solution file? If so how?
                            
                                What's the purpose of claims-based authorization?
                            
                                NUnit "missing" GPSVC.DLL on Windows 7/64
                            
                                Retrieve Facebook Fan Names
                            
                                Why does .NET decimal.ToString(string) round away from zero, apparently inconsistent with the language spec?
                            
                                In MVVM with WPF how do I unit test the link between the ViewModel and the View
                            
                                Integrating Emacs org-mode with email?
                            
                                IPhone OAuth Tutorial? [closed]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Standard-layout and tail padding

Tags:

Barry

People also ask

1 Answers

Barry

Recent Activity

Donate For Us