This question is related to another post of mine: why allocate_shared and make_shared so slow In here I can describe the question more clearly. Think about the following code: <pre class="prettyprint"><code>struct A { char data_[0x10000]; }; class C { public: C() : a_() { } A a_; }; int main() { C c; return 0; } </code></pre> I found for the code <code>C() : a_()</code>, the compiler uses <code>memset(addr,0,0x10000)</code> as the constructor of the A. And if the type A has a empty constructor, the asm code is right. To describe the issue more clearly, I wrote some test code: <pre class="prettyprint"><code>#include <stdlib.h> struct A { //A() {} char data_[0x10000]; void dummy() { // avoid optimize erase by compiler data_[rand() % sizeof(data_)] = 1; } int dummy2() { // avoid optimize erase by compiler return data_[0]; } }; class B { public: template<class ... T> B(T&...t) : a_(std::forward<T>(t)...) { } A a_; }; class C { public: C() : a_() { } A a_; }; template<class ... T> int test(T&...t) { A a(t...); a.dummy(); return a.dummy2(); } int main() { A a; a.dummy(); auto r1 = a.dummy2(); auto r2 = test(); B b; b.a_.dummy(); auto r3 = b.a_.dummy2(); C c; c.a_.dummy(); auto r4 = c.a_.dummy2(); return r1 + r2 + r3 + r4; } </code></pre> I compiled the code with vs2017, in windows 10, x86 release build. Then I checked the asm code: <pre class="prettyprint"><code>template<class ... T> int test(T&...t) { 00E510B8 call _chkstk (0E51CE0h) 00E510BD mov eax,dword ptr [__security_cookie (0E53004h)] 00E510C2 xor eax,ebp 00E510C4 mov dword ptr [ebp-4],eax A a(t...); 00E510C7 push 10000h 00E510CC lea eax,[a] 00E510D2 push 0 00E510D4 push eax 00E510D5 call _memset (0E51C3Ah) 00E510DA add esp,0Ch a.dummy(); 00E510DD call dword ptr [__imp__rand (0E520B4h)] } 00E510E3 mov ecx,dword ptr [ebp-4] </code></pre> It is very clear that the function <code>test()</code> calls <code>memset(p, 0, 0x10000)</code>. And if I add an empty constructor in A (line <code>A(){}</code>), the compiler removes the memset. So why does the code call memset when type A does not have constructor but does not call memset when A has a constructor? Is it part of the c++ standard, or just a compiler bug? Obviously the memset(p, 0, sizeof(T)) is useless and harmful which slows down the program. How do I workaround it?

Doesn't this explain it? We can see that: <blockquote> Zero initialization is performed [...] as part of value-initialization sequence for [...] members of value-initialized class types that have no constructors, including value initialization of elements of aggregates for which no initializers are provided. </blockquote> ... <blockquote> Value initialization is performed [...] when a non-static data member or a base class is initialized using a member initializer with an empty pair of parentheses or braces (since C++11); </blockquote> So putting <code>a_()</code> in the member initializer list falls into the latter case, that as a result invokes zero initialization of the array. To answer your question: to me this seems to be a standard behavior not a compiler bug.

why c++ use memset(addr,0,sizeof(T)) to construct a object? Standard or compiler bug?

This question is related to another post of mine: why allocate_shared and make_shared so slow

In here I can describe the question more clearly.

Think about the following code:

struct A {
    char data_[0x10000];
};

class C {
public:
    C() : a_() { }
    A a_;
};

int main() {
    C c;
    return 0;
}

I found for the code C() : a_(), the compiler uses memset(addr,0,0x10000) as the constructor of the A. And if the type A has a empty constructor, the asm code is right.

To describe the issue more clearly, I wrote some test code:

#include <stdlib.h>

struct A {
    //A() {}
    char data_[0x10000];
    void dummy() { // avoid optimize erase by compiler
        data_[rand() % sizeof(data_)] = 1;
    }
    int dummy2() { // avoid optimize erase by compiler
        return data_[0];
    }
};

class B {
public:
    template<class ... T> B(T&...t) 
        : a_(std::forward<T>(t)...) {
    }
    A a_;
};

class C {
public:
    C() : a_() {
    }
    A a_;
};

template<class ... T>
int test(T&...t) {
    A a(t...);
    a.dummy();
    return a.dummy2();
}

int main() {
    A a;
    a.dummy();
    auto r1 = a.dummy2();

    auto r2 = test();

    B b;
    b.a_.dummy();
    auto r3 = b.a_.dummy2();

    C c;
    c.a_.dummy();
    auto r4 = c.a_.dummy2();
    return r1 + r2 + r3 + r4;
}

I compiled the code with vs2017, in windows 10, x86 release build. Then I checked the asm code:

template<class ... T>
int test(T&...t) {
00E510B8  call        _chkstk (0E51CE0h)  
00E510BD  mov         eax,dword ptr [__security_cookie (0E53004h)]  
00E510C2  xor         eax,ebp  
00E510C4  mov         dword ptr [ebp-4],eax  
    A a(t...);
00E510C7  push        10000h  
00E510CC  lea         eax,[a]  
00E510D2  push        0  
00E510D4  push        eax  
00E510D5  call        _memset (0E51C3Ah)  
00E510DA  add         esp,0Ch  
    a.dummy();
00E510DD  call        dword ptr [__imp__rand (0E520B4h)]  
}
00E510E3  mov         ecx,dword ptr [ebp-4]

It is very clear that the function test() calls memset(p, 0, 0x10000).

And if I add an empty constructor in A (line A(){}), the compiler removes the memset.

So why does the code call memset when type A does not have constructor but does not call memset when A has a constructor?

Is it part of the c++ standard, or just a compiler bug?

Obviously the memset(p, 0, sizeof(T)) is useless and harmful which slows down the program. How do I workaround it?

What is the use of memset in C?

Description. The memset() function sets the first count bytes of dest to the value c. The value of c is converted to an unsigned character.

Why do we need to memset?

memset() is used to fill a block of memory with a particular value. The syntax of memset() function is as follows : // ptr ==> Starting address of memory to be filled // x ==> Value to be filled // n ==> Number of bytes to be filled starting // from ptr to be filled void *memset(void *ptr, int x, size_t n);

What can I use instead of memset?

There isn't a standard function for this - you will just need to call memcpy() in a loop: my_stuff *my_array = malloc(MAX * sizeof(my_stuff)); my_stuff tmp; size_t i; tmp.

Why does memset take an int?

memset predates (by quite a bit) the addition of function prototypes to C. Without a prototype, you can't pass a char to a function -- when/if you try, it'll be promoted to int when you pass it, and what the function receives is an int .

A a(t...);

Will be parsed as initializing a with t....^† When t... is empty, as when you call it, this will be understood as value-initializing a.

For A without a user-provided default constructor, value-initialize is to zero all its members, hence the memset.

When you provide a constructor for A, value-initialize is to call the default constructor, which you defined to be do nothing, therefore no memset will be called.

This is not a bug in the compiler, this is required behaviour. To remove the redundant memset, you could just write A a;. In this case a is default-initialized and no automatic zeroing occurs, with or without the user-provided constructor.

_{† This is important since A a() will be parsed as a function called a with return type A}

Doesn't this explain it?

We can see that:

Zero initialization is performed [...] as part of value-initialization sequence for [...] members of value-initialized class types that have no constructors, including value initialization of elements of aggregates for which no initializers are provided.

...

Value initialization is performed [...] when a non-static data member or a base class is initialized using a member initializer with an empty pair of parentheses or braces (since C++11);

So putting a_() in the member initializer list falls into the latter case, that as a result invokes zero initialization of the array.

To answer your question: to me this seems to be a standard behavior not a compiler bug.

why c++ use memset(addr,0,sizeof(T)) to construct a object? Standard or compiler bug?

Tags:

c++

constructor

c++11

perfect-forwarding

alpha

People also ask

2 Answers

Passer By

K. Kirsz

Recent Activity

Donate For Us

why c++ use memset(addr,0,sizeof(T)) to construct a object? Standard or compiler bug?

Tags:

c++

constructor

c++11

perfect-forwarding

alpha

People also ask

2 Answers

Passer By

K. Kirsz

Related questions

Recent Activity

Donate For Us