When would the compiler be conservative regarding pointer dereferencing optimization, if at all?

Tags:

So, I recently took an interest in how well the compiler (gcc (GCC) 4.8.3 being the one in question) is in optimizing pointers and pointers.

Initially I created a simple integer and an integer pointer and realized operations on it so I could print it out. As expected, all the operations that were hard coded were optmized, through dereferenced pointer or not.

call    __main
leaq    .LC0(%rip), %rcx
movl    $1, %edx
call    printf

And even after creating a function that takes in an int pointer, dereferences it and changes it it still was perfectly optmized.

call    __main
leaq    .LC0(%rip), %rcx
movl    $-1, %edx
call    printf

Now, when I treated my pointer as a void and made changes by casting it to char and dereferencing it, it actually still optmized perfectly (an 'extra' mov call since I initially treated it as an 8 byte value, and then as a 1 byte value for pointer dereferencing)

call    __main
movl    $4, 44(%rsp)
movb    $2, 44(%rsp)
leaq    .LC0(%rip), %rcx
movl    44(%rsp), %eax
leal    1(%rax), %edx
call    printf

So onto my question(s):

How consistent is compiler optimization regarding pointer dereferencing? What would be some cases where it would chose to be conservative?
If all of my pointers in a project were declared with the restrict keyword, could I trust it would be as well optimized as if 'no pointers were being used at all'?

(assuming there are no volatile cases )

Ps¹.: I am aware the compiler generally does a good enough job, and that a programmer worrying about aiding the compiler in minor optimizations is, in general, unproductive (as so many point out in stackoverflow answers to questions regarding optimization). Yet I still have curiosity regarding the matter.

Ps².: gcc -O3 -S -c main.c was the command used to generate the assembly code

C Code: (as requested)

#include <stdio.h>

int main (void)
{
    int a = 4;
    int *ap = &a;

    *ap = 0;
    a += 1;

    printf("%d\n", a);
    return 0;
}

#include <stdio.h>

void change(int *p) {
    *p -= 2;
}

int main (void)
{
    int a = 4;
    int *ap = &a;

    *ap = 0;
    change(ap);
    a += 1;

    printf("%d\n", a);
    return 0;
}

#include <stdio.h>

void change(void *p) {
    *((char*)p) += 2;
}

int main (void)
{
    int a = 4;
    void *ap = (void*) &a;

    *((char*)(ap)) = 0;
    change(ap);
    a += 1;

    printf("%d\n", a);
    return 0;
}

675

asked Jul 20 '15 22:07

SSWilks

1 Answers

LLVM and GCC both emit static-single-assigment form code as a part of optimization analysis. One of the useful properties of SSA code is that precisely shows the flow of influence for assignment -- that is, it knows what assignments lead to other assignments and so can detect which values can influence all others.

The first influence chain looks something like

a₁ -> constant(0) -> ap -> a₂

The second: a₁ -> constant(0) - > ap -> p -> a₂

The third being pretty similar to the second. (Sorry, this notation is pretty much made-up but i hope it illustrates my point.)

Because it is fairly simple to prove that the influence of a on ap is deterministic, it will feel free to dereference 'early' and combine the instructions into one (though in the first two cases this isn't the most accurate statement since the constant overwrites the original reference and lets the compiler prove that the original assignment does not flow to the end of the code.

Causing the compiler to be more conservative about dereferencing would involve getting complicated enough to escape the compiler's understanding (difficult in a static program i think) or more likely causing the compiler to invoke a phi function in the process of SSA (in laymen's terms, to cause the assignment to be influenced by multiple previous assignments) in a nondeterministic way.

The restrict keyword has the purpose of hinting to the compiler that two pointers are different. This wouldn't restrict the use of dereference at runtime if the code which produced that pointer still had a nondeterministic source (for example, if runtime-created data influenced the choice of what pointer value was dereferenced- i think this could happen if a serialized pointer was sent into the program from an external source?)

105

answered Oct 02 '22 09:10

argentage

Related questions
                            
                                MinGW GCC in Windows 7 x64 does not create an executable
                            
                                fwrite() adds garbage data to output (WINE & Windows 7, mingw & MSVC; NOT linux/gcc)
                            
                                C pointers to Matlab variables
                            
                                Is it possible to use memory barriers only on the storing side
                            
                                What exactly needs to be PROTECTed when writing C functions for use in R
                            
                                Native android development without any java?
                            
                                Can I redirect output from a C DLL into my c# log4net output
                            
                                Which addrinfo struct should be used in connect()?
                            
                                Computing floating point accuracy (K&R 2-1)
                            
                                Can an unsigned long become negative when multiplied by a float?
                            
                                Is FLT_RADIX ever not 2 in C11 for modern general purpose computers?
                            
                                understanding size command for data bss segment in C
                            
                                Getting GCC to generate a PTEST instruction when using vector extensions
                            
                                Can GDB correctly parse C-style hexadecimal floating-point numbers?
                            
                                Which kind of recursive parsing is this algorithm? Bottom-up or top-down?
                            
                                What level are fread thread locks on? What level do they need to be on?
                            
                                Mac OS equivalent of the Windows Fibers API?
                            
                                Regarding double and triple pointers/double dimension arrays
                            
                                Intercept ELF loader in linux kernel: fs/binfmt_elf.c file via loadable kernel module
                            
                                Fast modular multiplication modulo prime for linear congruential generator in C

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

When would the compiler be conservative regarding pointer dereferencing optimization, if at all?

Tags:

c

optimization

pointers

gcc

dereference

SSWilks

People also ask

1 Answers

argentage

Recent Activity

Donate For Us