This question is related to another post of mine: why allocate_shared and make_shared so slow
In here I can describe the question more clearly.
Think about the following code:
struct A {
char data_[0x10000];
};
class C {
public:
C() : a_() { }
A a_;
};
int main() {
C c;
return 0;
}
I found for the code C() : a_()
, the compiler uses memset(addr,0,0x10000)
as the constructor of the A. And if the type A has a empty constructor, the asm code is right.
To describe the issue more clearly, I wrote some test code:
#include <stdlib.h>
struct A {
//A() {}
char data_[0x10000];
void dummy() { // avoid optimize erase by compiler
data_[rand() % sizeof(data_)] = 1;
}
int dummy2() { // avoid optimize erase by compiler
return data_[0];
}
};
class B {
public:
template<class ... T> B(T&...t)
: a_(std::forward<T>(t)...) {
}
A a_;
};
class C {
public:
C() : a_() {
}
A a_;
};
template<class ... T>
int test(T&...t) {
A a(t...);
a.dummy();
return a.dummy2();
}
int main() {
A a;
a.dummy();
auto r1 = a.dummy2();
auto r2 = test();
B b;
b.a_.dummy();
auto r3 = b.a_.dummy2();
C c;
c.a_.dummy();
auto r4 = c.a_.dummy2();
return r1 + r2 + r3 + r4;
}
I compiled the code with vs2017, in windows 10, x86 release build. Then I checked the asm code:
template<class ... T>
int test(T&...t) {
00E510B8 call _chkstk (0E51CE0h)
00E510BD mov eax,dword ptr [__security_cookie (0E53004h)]
00E510C2 xor eax,ebp
00E510C4 mov dword ptr [ebp-4],eax
A a(t...);
00E510C7 push 10000h
00E510CC lea eax,[a]
00E510D2 push 0
00E510D4 push eax
00E510D5 call _memset (0E51C3Ah)
00E510DA add esp,0Ch
a.dummy();
00E510DD call dword ptr [__imp__rand (0E520B4h)]
}
00E510E3 mov ecx,dword ptr [ebp-4]
It is very clear that the function test()
calls memset(p, 0, 0x10000)
.
And if I add an empty constructor in A (line A(){}
), the compiler removes the memset.
So why does the code call memset when type A does not have constructor but does not call memset when A has a constructor?
Is it part of the c++ standard, or just a compiler bug?
Obviously the memset(p, 0, sizeof(T)) is useless and harmful which slows down the program. How do I workaround it?
Description. The memset() function sets the first count bytes of dest to the value c. The value of c is converted to an unsigned character.
memset() is used to fill a block of memory with a particular value. The syntax of memset() function is as follows : // ptr ==> Starting address of memory to be filled // x ==> Value to be filled // n ==> Number of bytes to be filled starting // from ptr to be filled void *memset(void *ptr, int x, size_t n);
There isn't a standard function for this - you will just need to call memcpy() in a loop: my_stuff *my_array = malloc(MAX * sizeof(my_stuff)); my_stuff tmp; size_t i; tmp.
memset predates (by quite a bit) the addition of function prototypes to C. Without a prototype, you can't pass a char to a function -- when/if you try, it'll be promoted to int when you pass it, and what the function receives is an int .
A a(t...);
Will be parsed as initializing a
with t...
.† When t...
is empty, as when you call it, this will be understood as value-initializing a
.
For A
without a user-provided default constructor, value-initialize is to zero all its members, hence the memset
.
When you provide a constructor for A
, value-initialize is to call the default constructor, which you defined to be do nothing, therefore no memset
will be called.
This is not a bug in the compiler, this is required behaviour. To remove the redundant memset
, you could just write A a;
. In this case a
is default-initialized and no automatic zeroing occurs, with or without the user-provided constructor.
† This is important since A a()
will be parsed as a function called a
with return type A
Doesn't this explain it?
We can see that:
Zero initialization is performed [...] as part of value-initialization sequence for [...] members of value-initialized class types that have no constructors, including value initialization of elements of aggregates for which no initializers are provided.
...
Value initialization is performed [...] when a non-static data member or a base class is initialized using a member initializer with an empty pair of parentheses or braces (since C++11);
So putting a_()
in the member initializer list falls into the latter case, that as a result invokes zero initialization of the array.
To answer your question: to me this seems to be a standard behavior not a compiler bug.
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With