How can I zero just the padding bytes of a class?

Tags:

I want to set the padding bytes of a class to 0, since I am saving/loading/comparing/hashing instances at a byte level, and garbage-initialised padding introduces non-determinism in each of those operations.

I know that this will achieve what I want (for trivially copyable types):

struct Example
{
    Example(char a_, int b_)
    {
        memset(this, 0, sizeof(*this));
        a = a_;
        b = b_;
    }
    char a;
    int b;
};

I don't like doing that though, for two reasons: I like constructor initialiser lists, and I know that setting the bits to 0 isn't always the same as zero-initialisation (e.g. pointers and floats don't necessarily have zero values that are all 0 bits).

As an aside, it's obviously limited to types that are trivially copyable, but that's not an issue for me since the operations I listed above (loading/saving/comparing/hashing at a byte level) require trivially copyable types anyway.

What I would like is something like this [magical] snippet:

struct Example
{
    Example(char a_, int b_) : a(a_), b(b_)
    {
        // Leaves all members alone, and sets all padding bytes to 0.
        memset_only_padding_bytes(this, 0);
    }
    char a;
    int b;
};

I doubt such a thing is possible, so if anyone can suggest a non-ugly alternative... I'm all ears :)

595

asked Oct 23 '13 15:10

Ben Hymers

3 Answers

There's no way I know of to do this fully automatically in pure C++. We use a custom code generation system to accomplish this (among other things). You could potentially accomplish this with a macro to which you fed all your member variable names; it would simply look for holes between offsetof(memberA)+sizeof(memberA) and offsetof(memberB).

Alternatively, serialize/hash on a memberwise basis, rather than as a binary blob. That's ten kinds of cleaner.

Oh, one other option -- you could provide an operator new which explicitly cleared the memory before returning it. I'm not a fan of that approach, though..... it doesn't work for stack allocation.

answered Oct 14 '22 00:10

Sneftel

You should never use padded structs when binary writing/reading them. Simply because the padding can vary from one platform to another which will lead to binary incompatibility.

Use some compiler options, like #pragma pack (push, 1) to disable padding when defining those writable structs and restore it with #pragma pack(pop).

This sadly means you're losing the optimization provided by it. If that is a concern, by carefully designing your structs you can manually "pad" them by inserting dummy variables. Then zero-initialization becomes obvious, you just assign zeros to those dummies. I don't recommend that "manual" approach as it's very error-prone, but as you're using binary blob write you're probably concerned already. But by all means, benchmark unpadded structs before.

answered Oct 14 '22 00:10

Agent_L

I faced a similar problem - and simply saying that this is a poor design decision (as per dasblinkenlight's comment) doesn't necessarily help as you may have no control over the hashing code (in my case I was using an external library).

One solution is to write a custom iterator for your class, which iterates through the bytes of the data and skips the padding. You then modify your hashing algorithm to use your custom iterator instead of a pointer. One simple way to do this is to templatize the pointer so that it can take an iterator - since the semantics of a pointer and an iterator are the same, you shouldn't have to modify any code beyond the templatizing.

EDIT: Boost provides a nice library which makes it simple to add custom iterators to your container: Boost.Iterator.

Whichever solution you go for, it is highly preferable to avoid hashing the padding as doing so means that your hashing algorithm is highly coupled with your data structure. If you switch data structures (or as Agent_L mentions, use the same data structure on a different platform which pads differently), then it will produce different hashes. On the other hand, if you only hash the actual data itself, then you will always produce the same hash values no matter what data structure you use later.

answered Oct 14 '22 00:10

JBentley

Related questions
                            
                                Can I use C++ class members initialized in the initializer list, later in the list?
                            
                                why does --list.end() compile?
                            
                                Master Include Files - Good or Bad Practice
                            
                                C++ function returns a rvalue, but that can be assigned a new value?
                            
                                C++: pass function with arbitrary number of parameters as a parameter
                            
                                Linking FreeImage as a static library in VS2010?
                            
                                Signal execution order with Qt::QueuedConnection
                            
                                Brace initialization for class with virtual function
                            
                                What is the size of this class and Why?
                            
                                how to know if a binary contains debugging symbols or not without file, objdump or gdb?
                            
                                c/c++ : header file not found
                            
                                How to link glew in xcode
                            
                                Generating Bitcoin address from ECDSA Public Key
                            
                                Complicated test to check which object instantiates a function call
                            
                                Is there a null character at end of a string object in C++?
                            
                                Cython c++ example fails to recognize c++, why?
                            
                                Algorithm to compute mode
                            
                                Build a 3rd party library from source within an existing Qt project
                            
                                Why can an unnamed struct not be used as a trailing return type?
                            
                                Casting a variadic parameter pack to (void)

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How can I zero just the padding bytes of a class?

Tags:

c++

padding

Ben Hymers

People also ask

3 Answers

Sneftel

Agent_L

JBentley

Recent Activity

Donate For Us