From this comment in GCC bug #53119: <blockquote> In C, <code>{0}</code> is the universal zero initializer equivalent to C++'s <code>{}</code> (the latter being invalid in C). It is necessary to use whenever you want a zero-initialized object of a complete but conceptually-opaque or implementation-defined type. The classic example in the C standard library is <code>mbstate_t</code>: <pre class="prettyprint"><code>mbstate_t state = { 0 }; /* correctly zero-initialized */ </code></pre> versus the common but nonportable: <pre class="prettyprint"><code>mbstate_t state; memset(&state, 0, sizeof state); </code></pre> </blockquote> It strikes me as odd that the latter version could be unportable (even for implementation-defined types, the compiler has to know the size). What is the issue here and when is a <code>memset(x, 0, sizeof x)</code> unportable?

Noting a difference in behavior between the two methods... In <code>...= {0};</code> if padding bytes exist, they will not be cleared. But a call to <code>memset()</code> will clear padding. From here <blockquote> "Possible implementation of mbstate_t is a struct type holding an array representing the incomplete multibyte character, an integer counter indicating the number of bytes in the array that have been processed, and a representation of the current shift state." </blockquote> In the case <code>mbstate_t</code> is implemented as a <code>struct</code> it is notable that <code>{0}</code> will not set padding bytes that may exist to <code>zero</code>, making the following assumption debatable: <pre class="prettyprint"><code>mbstate_t state = { 0 }; /* correctly zero-initialized */ </code></pre> <code>memset()</code> however does include padding bytes. <pre class="prettyprint"><code>memset(state , 0, sizeof state);//all bytes in memory region of test will be cleared </code></pre>

When is memset to 0 nonportable? [duplicate]

Tags:

c

portability

memset

From this comment in GCC bug #53119:

In C, {0} is the universal zero initializer equivalent to C++'s {} (the latter being invalid in C). It is necessary to use whenever you want a zero-initialized object of a complete but conceptually-opaque or implementation-defined type. The classic example in the C standard library is mbstate_t:
mbstate_t state = { 0 }; /* correctly zero-initialized */
versus the common but nonportable:
mbstate_t state;
memset(&state, 0, sizeof state);

It strikes me as odd that the latter version could be unportable (even for implementation-defined types, the compiler has to know the size). What is the issue here and when is a memset(x, 0, sizeof x) unportable?

990

asked Nov 30 '21 13:11

Felix Dombek

Video Answer

3 Answers

memset(p, 0, n) sets to all-bits-0.
An initializer of { 0 } sets to the value 0.
On just about any machine you've ever heard of, the two concepts are equivalent.

However, there have been machines where the floating-point value 0.0 was not represented by a bit pattern of all-bits-0. And there have been machines where a null pointer was not represented by a bit pattern of all-bits-0, either. On those machines, an initializer of { 0 } would always get you the zero initialization you wanted, while memset might not.

See also question 7.31 and question 5.17 in the C FAQ list.

Postscript: One other difference, as pointed out by @ryker: memset will set any "holes" in a padded structure to 0, while setting that structure to { 0 } might not.

179

answered Oct 17 '22 14:10

Steve Summit

The reason for this has to do with how types are represented.

Section 6.7.9p10 of the C standard describes how fields are initialized as follows:

If an object that has automatic storage duration is not initialized explicitly, its value is indeterminate. If an object that has static or thread storage duration is not initialized

explicitly, then:

if it has pointer type, it is initialized to a null pointer;

if it has arithmetic type, it is initialized to (positive or unsigned) zero;

if it is an aggregate, every member is initialized (recursively) according to these rules, and any padding is initialized to zero bits;

if it is a union, the first named member is initialized (recursively) according to these rules, and any padding is initialized to zero bits

And p21 also states:

If there are fewer initializers in a brace-enclosed list than there are elements or members of an aggregate, or fewer characters in a string literal used to initialize an array of known size than there are elements in the array, the remainder of the aggregate shall be initialized implicitly the same as objects that have static storage duration

The difference between this and setting all bytes to zero is that some of the above values may not necessarily be represented by all bits zero.

For example, there are some architectures where the address 0 is a valid address. This means that a null pointer is not represented as all bits zero. (Note: (void *)0 is specified as a null pointer constant by the standard, however the implementation will treat this as whatever the representation of a null pointer is)

The standard also doesn't mandate a particular representation for floating point types. While the most common representation, IEEE754, does use all bits 0 to represent the value +0, this is not necessarily true for other representations.

answered Oct 17 '22 14:10

dbush

Noting a difference in behavior between the two methods...

In ...= {0}; if padding bytes exist, they will not be cleared.
But a call to memset() will clear padding.

From here

"Possible implementation of mbstate_t is a struct type holding an array representing the incomplete multibyte character, an integer counter indicating the number of bytes in the array that have been processed, and a representation of the current shift state."

In the case mbstate_t is implemented as a struct it is notable that {0} will not set padding bytes that may exist to zero, making the following assumption debatable:

mbstate_t state = { 0 }; /* correctly zero-initialized */

memset() however does include padding bytes.

memset(state , 0, sizeof state);//all bytes in memory region of test will be cleared

answered Oct 17 '22 12:10

ryyker

Related questions
                            
                                How to include header AND source files folder in Visual Studio
                            
                                When writing a python extension in C, how does one pass a python function in to a C function?
                            
                                What is the point of the {U,}INTn_C macros in stdint.h?
                            
                                (How) can I predict the runtime of a code snippet using LLVM Machine Code Analyzer?
                            
                                How to exclude an C enum case when using the type in Swift
                            
                                What happens to a pointer to another pointer when the first one is freed? [duplicate]
                            
                                Initializing an atomic_flag
                            
                                why floor, ceil implementation return x + x when x is NaN or inf?
                            
                                Is using sentinel pointer values near the maximum underlying integer value safe?
                            
                                How to set the host exception port using the Mach kernel on macOS?
                            
                                How does one prove simple equalities of non-deterministic values in Frama-C + EVA?
                            
                                Why does the return from "system" not match the return of the script that was called?
                            
                                Minimal example using LLVM's C API yields error: function and module have different contexts
                            
                                Memory location of variables defined in a shared library
                            
                                Is there any way of checking if file system is case-insensitive in the preprocessors?
                            
                                Is it safe to read and write to an array at different positions from multiple threads in C with phtreads?
                            
                                How to use exit() safely from any thread
                            
                                Why can't the size of a static array be made variable?
                            
                                Write the prototype for a function that takes an array of exactly 16 integers
                            
                                How to test for lossless double / integer conversion?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With