The following compiles and prints "string" as an output. <pre class="prettyprint"><code>#include <stdio.h> struct S { int x; char c[7]; }; struct S bar() { struct S s = {42, "string"}; return s; } int main() { printf("%s", bar().c); } </code></pre> Apparently this seems to invokes an undefined behavior according to <blockquote> C99 6.5.2.2/5 If an attempt is made to modify the result of a function call or to access it after the next sequence point, the behavior is undefined. </blockquote> I don't understand where it says about "next sequence point". What's going on here?

You've run into a subtle corner of the language. An expression of array type is, in most contexts, implicitly converted to a pointer to the first element of the array object. The exceptions, none of which apply here, are: <ul> <li>When the array expression is the operand of a unary <code>&</code> operator (which yields the address of the entire array);</li> <li>When it's the operand of a unary <code>sizeof</code> <strike>or (as of C11) <code>_Alignof</code></strike> operator (<code>sizeof arr</code> yields the size of the array, not the size of a pointer); and</li> <li>When it's a string literal in an initializer used to initialize an array object (<code>char str[6] = "hello";</code> doesn't convert <code>"hello"</code> to a <code>char*</code>.)</li> </ul> (The N1570 draft incorrectly adds <code>_Alignof</code> to the list of exceptions. In fact, for reasons that are not clear, <code>_Alignof</code> can only be applied to a type name, not to an expression.) Note that there's an implicit assumption: that the array expression refers to an array object in the first place. In most cases, it does (the simplest case is when the array expression is the name of a declared array object) -- but in this one case, there is no array object. If a function returns a struct, the struct result is returned by value. In this case, the struct contains an array, giving us an array value with no corresponding array object, at least logically. So the array expression <code>bar().c</code> decays to a pointer to the first element of ... er, um, ... an array object that doesn't exist. The 2011 ISO C standard addresses this by introducing "temporary lifetime", which applies only to "A non-lvalue expression with structure or union type, where the structure or union contains a member with array type" (N1570 6.2.4p8). Such an object may not be modified, and its lifetime ends at the end of the containing full expression or full declarator. So as of C2011, your program's behavior is well defined. The <code>printf</code> call gets a pointer to the first element of an array that's part of a struct object with temporary lifetime; that object continues to exist until the <code>printf</code> call finishes. But as of C99, the behavior is undefined -- not necessarily because of the clause you quote (as far as I can tell, there is no intervening sequence point), but because C99 doesn't define the array object that would be necessary for the <code>printf</code> to work. If your goal is to get this program to work, rather than to understand why it might fail, you can store the result of the function call in an explicit object: <pre class="prettyprint"><code>const struct s result = bar(); printf("%s", result.c); </code></pre> Now you have a struct object with automatic, rather than temporary, storage duration, so it exists during and after the execution of the <code>printf</code> call.

Undefined behavior: when attempting to access the result of function call

Tags:

c

undefined-behavior

function-calls

c99

The following compiles and prints "string" as an output.

#include <stdio.h>

struct S { int x; char c[7]; };

struct S bar() {
    struct S s = {42, "string"};
    return s;
}

int main()
{
    printf("%s", bar().c);
}

Apparently this seems to invokes an undefined behavior according to

C99 6.5.2.2/5 If an attempt is made to modify the result of a function call or to access it after the next sequence point, the behavior is undefined.

I don't understand where it says about "next sequence point". What's going on here?

399

asked Dec 07 '12 01:12

cpx

1 Answers

You've run into a subtle corner of the language.

An expression of array type is, in most contexts, implicitly converted to a pointer to the first element of the array object. The exceptions, none of which apply here, are:

When the array expression is the operand of a unary & operator (which yields the address of the entire array);
When it's the operand of a unary sizeof ~~or (as of C11) _Alignof~~ operator (sizeof arr yields the size of the array, not the size of a pointer); and
When it's a string literal in an initializer used to initialize an array object (char str[6] = "hello"; doesn't convert "hello" to a char*.)

(The N1570 draft incorrectly adds _Alignof to the list of exceptions. In fact, for reasons that are not clear, _Alignof can only be applied to a type name, not to an expression.)

Note that there's an implicit assumption: that the array expression refers to an array object in the first place. In most cases, it does (the simplest case is when the array expression is the name of a declared array object) -- but in this one case, there is no array object.

If a function returns a struct, the struct result is returned by value. In this case, the struct contains an array, giving us an array value with no corresponding array object, at least logically. So the array expression bar().c decays to a pointer to the first element of ... er, um, ... an array object that doesn't exist.

The 2011 ISO C standard addresses this by introducing "temporary lifetime", which applies only to "A non-lvalue expression with structure or union type, where the structure or union contains a member with array type" (N1570 6.2.4p8). Such an object may not be modified, and its lifetime ends at the end of the containing full expression or full declarator.

So as of C2011, your program's behavior is well defined. The printf call gets a pointer to the first element of an array that's part of a struct object with temporary lifetime; that object continues to exist until the printf call finishes.

But as of C99, the behavior is undefined -- not necessarily because of the clause you quote (as far as I can tell, there is no intervening sequence point), but because C99 doesn't define the array object that would be necessary for the printf to work.

If your goal is to get this program to work, rather than to understand why it might fail, you can store the result of the function call in an explicit object:

const struct s result = bar();
printf("%s", result.c);

Now you have a struct object with automatic, rather than temporary, storage duration, so it exists during and after the execution of the printf call.

110

answered Sep 27 '22 21:09

Keith Thompson

Related questions
                            
                                How do I make sure that I understand C pointers? [closed]
                            
                                Timeout Function
                            
                                How to tell a C or a C++ compiler that pointers are not aliased
                            
                                utf8 aware strncpy
                            
                                Convert double value to a char array in C
                            
                                Difference between nice and setpriority in unix
                            
                                Forward FFT an image and backward FFT an image to get the same result
                            
                                Stopping a receiver thread that blocks on recv()
                            
                                Storing a number greater than 20! (factorial)
                            
                                C/C++ most efficient if statement evaluation
                            
                                Is there way to check the type of a preprocessor symbol value in C/C++
                            
                                When is the file loaded into memory - for fread, fopen and fwrite calls?
                            
                                can anyone explain why size_t type is used with an example?
                            
                                va_list misbehavior on Linux
                            
                                Why is a typedef not allowed in the inner struct?
                            
                                Fscanf or Fgets? Reading a file line after line
                            
                                What is meaning of ":" in struct C [duplicate]
                            
                                Casting NULL to a struct pointer in C?
                            
                                How to properly choose rng seed for parallel processes
                            
                                Eclipse can't link to kernel32.lib

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With