In C++, most of the optimizations are derived from the as-if rule. That is, as long as the program behaves as-if no optimization had taken place, then they are valid. The Empty Base Optimization is one such trick: in some conditions, if the base class is empty (does not have any non-static data member), then the compiler may elide its memory representation. Apparently it seems that the standard forbids this optimization on data members, that is even if a data member is empty, it must still take at least one byte worth of place: from n3225, [class] <blockquote> 4 - Complete objects and member subobjects of class type shall have nonzero size. </blockquote> Note: this leads to the use of private inheritance for Policy Design in order to have EBO kick in when appropriate I was wondering if, using the as-if rule, one could still be able to perform this optimization. <hr> edit: following a number of answers and comments, and to make it clearer what I am wondering about. First, let me give an example: <pre class="prettyprint"><code>struct Empty {}; struct Foo { Empty e; int i; }; </code></pre> My question is, why is <code>sizeof(Foo) != sizeof(int)</code> ? In particular, unless you specify some packing, chances are due to alignment issues that Foo will be twice the size of int, which seems ridiculously inflated. Note: my question is not why is <code>sizeof(Foo) != 0</code>, this is not actually required by EBO either According to C++, it is because no sub-object may have a zero size. However a base is authorized to have a zero size (EBO) therefore: <pre class="prettyprint"><code>struct Bar: Empty { int i; }; </code></pre> is likely (thanks to EBO) to obey <code>sizeof(Bar) == sizeof(int)</code>. Steve Jessop seems to be of an opinion that it is so that no two sub-objects would have the same address. I thought about it, however it doesn't actually prevent the optimization in most cases: If you have "unused" memory, then it is trivial: <pre class="prettyprint"><code>struct UnusedPadding { Empty e; Empty f; double d; int i; }; // chances are that the layout will leave some memory after int </code></pre> But in fact, it's even "worse" than that, because <code>Empty</code> space is never written to (you'd better not if EBO kicks in...) and therefore you could actually place it at an occupied place that is not the address of another object: <pre class="prettyprint"><code>struct Virtual { virtual ~Virtual() {} Empty e; Empty f; int i; }; // most compilers will reserve some space for a virtual pointer! </code></pre> Or, even in our original case: <pre class="prettyprint"><code>struct Foo { Empty e; int i; }; // deja vu! </code></pre> One could have <code>(char*)foo.e == (char*)foo.i + 1</code> if all we wanted were different address.

It is coming to c++20 with the <code>[[no_unique_address]]</code> attribute. The proposal P0840r2 has been accepted into the draft standard. It has this example: <pre class="prettyprint"><code>template<typename Key, typename Value, typename Hash, typename Pred, typename Allocator> class hash_map { [[no_unique_address]] Hash hasher; [[no_unique_address]] Pred pred; [[no_unique_address]] Allocator alloc; Bucket *buckets; // ... public: // ... }; </code></pre>

Empty Data Member Optimization: would it be possible?

Tags:

c++

compiler-optimization

In C++, most of the optimizations are derived from the as-if rule. That is, as long as the program behaves as-if no optimization had taken place, then they are valid.

The Empty Base Optimization is one such trick: in some conditions, if the base class is empty (does not have any non-static data member), then the compiler may elide its memory representation.

Apparently it seems that the standard forbids this optimization on data members, that is even if a data member is empty, it must still take at least one byte worth of place: from n3225, [class]

4 - Complete objects and member subobjects of class type shall have nonzero size.

Note: this leads to the use of private inheritance for Policy Design in order to have EBO kick in when appropriate

I was wondering if, using the as-if rule, one could still be able to perform this optimization.

edit: following a number of answers and comments, and to make it clearer what I am wondering about.

First, let me give an example:

struct Empty {};

struct Foo { Empty e; int i; };

My question is, why is sizeof(Foo) != sizeof(int) ? In particular, unless you specify some packing, chances are due to alignment issues that Foo will be twice the size of int, which seems ridiculously inflated.

Note: my question is not why is sizeof(Foo) != 0, this is not actually required by EBO either

According to C++, it is because no sub-object may have a zero size. However a base is authorized to have a zero size (EBO) therefore:

struct Bar: Empty { int i; };

is likely (thanks to EBO) to obey sizeof(Bar) == sizeof(int).

Steve Jessop seems to be of an opinion that it is so that no two sub-objects would have the same address. I thought about it, however it doesn't actually prevent the optimization in most cases:

If you have "unused" memory, then it is trivial:

struct UnusedPadding { Empty e; Empty f; double d; int i; };
// chances are that the layout will leave some memory after int

But in fact, it's even "worse" than that, because Empty space is never written to (you'd better not if EBO kicks in...) and therefore you could actually place it at an occupied place that is not the address of another object:

struct Virtual { virtual ~Virtual() {} Empty e; Empty f; int i; };
// most compilers will reserve some space for a virtual pointer!

Or, even in our original case:

struct Foo { Empty e; int i; }; // deja vu!

One could have (char*)foo.e == (char*)foo.i + 1 if all we wanted were different address.

480

asked Jan 07 '11 09:01

Matthieu M.

2 Answers

It is coming to c++20 with the [[no_unique_address]] attribute.

The proposal P0840r2 has been accepted into the draft standard. It has this example:

template<typename Key, typename Value, typename Hash, typename Pred, typename Allocator>
class hash_map {
  [[no_unique_address]] Hash hasher;
  [[no_unique_address]] Pred pred;
  [[no_unique_address]] Allocator alloc;
  Bucket *buckets;
  // ...
public:
  // ...
};

answered Sep 17 '22 13:09

Ozirus

Under the as-if rule:

struct A {
    EmptyThing x;
    int y;
};

A a;
assert((void*)&(a.x) != (void*)&(a.y));

The assert must not be triggered. So I don't see any benefit in secretly making x have size 0, when you'd just need to add padding to the structure anyway.

I suppose in theory a compiler could track whether pointers might be taken to the members, and make the optimization only if they definitely aren't. This would have limited use, since there'd be two different versions of the struct with different layouts: one for the optimized case and one for general code.

But for example if you create an instance of A on the stack, and do something with it that is entirely inlined (or otherwise visible to the optimizer), yes, parts of the struct could be completely omitted. This isn't specific to empty objects, though - an empty object is just a special case of an object whose storage isn't accessed, and therefore could in some situations never be allocated at all.

answered Sep 20 '22 13:09

Steve Jessop

Related questions
                            
                                Is it OK to define operator<< or operator>> for FILE&?
                            
                                Generic way of lazily evaluating (short-circuiting) template conditional types
                            
                                How do I convert an armadillo matrix to a vector of vectors?
                            
                                Does the function override the base function?
                            
                                Constant expression initializer for static class member of type double
                            
                                emplace_back() issue under VS2013
                            
                                std::regex, to match begin/end of string
                            
                                atoi() for int128_t type
                            
                                Why does clang still need libgcc.a to compile my code?
                            
                                What is the purpose of std::forward()'s rvalue reference overload?
                            
                                Does malloc return an "invalid pointer value" in C++17? [duplicate]
                            
                                How do c++ compilers find an extern variable?
                            
                                What's a good way to store a small, fixed size, hierarchical set of static data?
                            
                                Profiler for Visual Studio 2008, C++?
                            
                                How do I examine the contents of an std::vector in gdb, using the icc compiler?
                            
                                Singleton - Why use classes?
                            
                                Class template specializations with shared functionality
                            
                                Unsequenced value computations (a.k.a sequence points)
                            
                                Strings and character encoding in C++
                            
                                QuadTree for 2D collision detection

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With