Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

char four[4] = "four"; What are the correct semantics for this statement?

int main(void)
{
    char four[4] = "four";
    return 0;
}

When compiled as a C++ program, G++ reports

xxx.cpp: In function int main():

xxx.cpp:3: error: initializer-string for array of chars is too long

When compiled a a C program, GCC reports no error.

It appears to me, that the assignment is correctly copying all 4 bytes into the variable, as I expected.

So my question boils down to.....

Is the observed behavior in C correct or am I touching an undefined behavior somewhere, or is it something else altogether?

like image 212
EvilTeach Avatar asked Aug 19 '10 16:08

EvilTeach


2 Answers

Short answer: your code is valid C, but not valid C++.

Long Aswer:

"four" is actually 5 characters long - there is a \0 added there for you. In section 6.7.8 Initialization, paragraph 13, the C standard says:

An array of character type may be initialized by a character string literal, optionally enclosed in braces. Successive characters of the character string literal (including the terminating null character if there is room or if the array is of unknown size) initialize the elements of the array.

So the \0 is just ignored in your program when it is compiled as C. C++ is treating it differently. In fact, this particular case is called out explicitly in the C++ spec (Section 8.5.2 Character arrays, paragraph 2):

There shall not be more initializers than there are array elements. [ Example:

char cv[4] = "asdf";  // error

is ill-formed since there is no space for the implied trailing ’\0’. — end example ]

like image 116
Carl Norum Avatar answered Sep 21 '22 06:09

Carl Norum


The string "four" actually contains five bytes: the four letters plus a zero byte (\0) as a string terminator. It's been a while since I've written C or C++, but I would guess the C compiler is silently ignoring it for whatever reason.

like image 45
fizban Avatar answered Sep 20 '22 06:09

fizban