We know that logical-AND operator (<code>&&</code>) guarantees left-to-right evaluation. But I am wondering if the compiler optimizer can ever reorder the memory access instructions for <code>*a</code> and <code>b->foo</code> in the following code, i.e. the optimizer writes instructions that try to access <code>*b</code> before accessing <code>*a</code>. (Consider both <code>a</code> and <code>b</code> to be pointers to memory regions in the heap.) <pre class="prettyprint"><code>if (*a && b->foo) { /* do something */ } </code></pre> One might think that <code>&&</code> causes a sequence point, so the compiler must emit instructions to access <code>*a</code> before accessing <code>*b</code> but after reading the accepted answer at https://stackoverflow.com/a/14983432/1175080, I am not so sure. If you look at this answer, there are semi-colons between statements and they also establish sequence points and therefore they should also prevent reordering, but the answer there seems to indicate that they need compiler level memory barrier despite the presence of semicolons. I mean if you claim that <code>&&</code> establishes a sequence point, then that is true for semicolons in the code at https://stackoverflow.com/a/14983432/1175080. Then why is a compiler-level memory barrier required in that code?

The system can evaluate <code>b->foo</code> until such time as it hits something that exceeds its ability to execute speculatively. Most modern systems can handle a speculative fault and ignore the fault if it turns out that the results of the operation are never used. So it's purely up to the capabilities of the compiler, CPU, and other system components. So long as it can ensure there are no visible consequences to conforming code, it can execute (almost) anything it wants (almost) any time it wants.

<blockquote> But I am wondering if the compiler optimizer can ever reorder the memory access instructions for *a and b->foo in the following code, i.e. the optimizer writes instructions that try to access *b before accessing *a. <pre class="prettyprint"><code>if (*a && b->foo) { /* do something */ } </code></pre> </blockquote> The C semantics for the expression require that <code>*a</code> be evaluated first, and that <code>b->foo</code> be evaluated only if <code>*a</code> evaluated to nonzero. @Jack's answer provides the basis for that in the standard. But your question is about optimizations that compiler performs, and the standard specifies that <blockquote> The semantic descriptions in this International Standard describe the behavior of an abstract machine in which issues of optimization are irrelevant. </blockquote> (C2013, 5.1.2.3/1) An optimizing compiler can produce code that does not conform to the abstract semantics if it produces the same external behavior. In particular, in your example code, if the compiler can prove (or is willing to assume) that the evaluations of <code>*a</code> and <code>b->foo</code> have no externally visible behavior and are independent -- neither has a side effect that impacts the evaluation or side effects of the other -- then it may emit code that evaluates <code>b->foo</code> unconditionally, either before or after evaluating <code>*a</code>. Note that if <code>b</code> is NULL or contains an invalid pointer value then evaluating <code>b->foo</code> has undefined behavior. In that case, evaluation of <code>b->foo</code> is not independent of any other evaluation in the program. As @DavidSchwartz observes, however, even if <code>b</code>'s value may be null or invalid, the compiler may still be able to emit code that speculatively proceeds as if it were valid, and backtracks in the event that that turns out not to be the case. The key point here is that the externally-visible behavior is unaffected by valid optimizations.

Can the C compiler optimizer violate short-circuiting and reorder memory accesses for operands in a logical-AND expression?

Tags:

c

logical-operators

We know that logical-AND operator (&&) guarantees left-to-right evaluation.

But I am wondering if the compiler optimizer can ever reorder the memory access instructions for *a and b->foo in the following code, i.e. the optimizer writes instructions that try to access *b before accessing *a.

(Consider both a and b to be pointers to memory regions in the heap.)

if (*a && b->foo) {
  /* do something */
}

One might think that && causes a sequence point, so the compiler must emit instructions to access *a before accessing *b but after reading the accepted answer at https://stackoverflow.com/a/14983432/1175080, I am not so sure. If you look at this answer, there are semi-colons between statements and they also establish sequence points and therefore they should also prevent reordering, but the answer there seems to indicate that they need compiler level memory barrier despite the presence of semicolons.

I mean if you claim that && establishes a sequence point, then that is true for semicolons in the code at https://stackoverflow.com/a/14983432/1175080. Then why is a compiler-level memory barrier required in that code?

350

asked Jun 27 '16 18:06

Lone Learner

2 Answers

The system can evaluate b->foo until such time as it hits something that exceeds its ability to execute speculatively. Most modern systems can handle a speculative fault and ignore the fault if it turns out that the results of the operation are never used.

So it's purely up to the capabilities of the compiler, CPU, and other system components. So long as it can ensure there are no visible consequences to conforming code, it can execute (almost) anything it wants (almost) any time it wants.

159

answered Sep 20 '22 02:09

David Schwartz

But I am wondering if the compiler optimizer can ever reorder the memory access instructions for *a and b->foo in the following code, i.e. the optimizer writes instructions that try to access *b before accessing *a.
if (*a && b->foo) {
  /* do something */
}

The C semantics for the expression require that *a be evaluated first, and that b->foo be evaluated only if *a evaluated to nonzero. @Jack's answer provides the basis for that in the standard. But your question is about optimizations that compiler performs, and the standard specifies that

The semantic descriptions in this International Standard describe the behavior of an abstract machine in which issues of optimization are irrelevant.

(C2013, 5.1.2.3/1)

An optimizing compiler can produce code that does not conform to the abstract semantics if it produces the same external behavior.

In particular, in your example code, if the compiler can prove (or is willing to assume) that the evaluations of *a and b->foo have no externally visible behavior and are independent -- neither has a side effect that impacts the evaluation or side effects of the other -- then it may emit code that evaluates b->foo unconditionally, either before or after evaluating *a. Note that if b is NULL or contains an invalid pointer value then evaluating b->foo has undefined behavior. In that case, evaluation of b->foo is not independent of any other evaluation in the program.

As @DavidSchwartz observes, however, even if b's value may be null or invalid, the compiler may still be able to emit code that speculatively proceeds as if it were valid, and backtracks in the event that that turns out not to be the case. The key point here is that the externally-visible behavior is unaffected by valid optimizations.

answered Sep 23 '22 02:09

John Bollinger

Related questions
                            
                                Can't run c/c++ codes in code::blocks 13.12 on linuxmint 17: Status 255
                            
                                Does Postfix operator really has a higher precedence than prefix? [closed]
                            
                                How to decode this information from strace output
                            
                                Set the i-th bit to zero? [duplicate]
                            
                                Is it safe to call dlclose after dlsym
                            
                                Unsafe C functions and the replacement
                            
                                Generic bidimensional array
                            
                                MQTT Library on Microcontroller
                            
                                Does freeing an int* which was assigned to a char* (allocated by `malloc`) invoke Undefined Behavior?
                            
                                Reentrancy and Reentrant in C?
                            
                                Is it OK to share the same epoll file descriptor among threads?
                            
                                struct without typedef keyword
                            
                                Debugging in Dev-C++ makes IDE crash and not respond
                            
                                How to perform uint32/float conversion with SSE?
                            
                                Pass char pointer array to function
                            
                                About listen(), accept() in network socket programming(3-way handshaking)
                            
                                printf a float value with precision (number of decimal digits) passed in a variable [duplicate]
                            
                                The width specifier in printf does not work properly with accented characters
                            
                                What is isatty() in C for?
                            
                                implicit declaration of function ‘sched_setaffinity’

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With