<pre class="prettyprint"><code>typedef struct { void * field1; } s1; void func1(void) { s1 my_s1; s1 * __restrict my_s1_ptr = &my_s1; *((int*)((char*)my_s1_ptr->field1 + 4)) = 0; *((int*)((char*)my_s1_ptr->field1 + 8)) = 1; *((int*)((char*)my_s1_ptr->field1 + 12)) = 2; *((int*)((char*)my_s1_ptr->field1 + 16)) = 3; } </code></pre> It seems that for version 11.1 of the Intel compiler and version 4.6 of gcc that the compiler reloads my_s1_ptr->field1 for each of the last 4 statements. My understanding of __restrict would suggest to me that the last 3 loads should be redundant and could be eliminated. Yes, I know the code is weird but there is a reason it is structured this way. I would just like to be able to get the compiler to eliminate the redundant loads. Any idea how to convince it to do that?

<code>s1 * __restrict</code> means that this is the only pointer to a particular <code>s1</code>, so no alias for that type. It doesn't mean that there will be no alias for other pointer types, like <code>void*</code>, <code>int*</code>, or <code>char*</code>. Using a <code>char*</code> is especially troublesome for the compiler, because a <code>char*</code> is specifically allowed to be used to access the bytes of other types. (<code>char</code> also means byte, and can be used to access the underlying memory of other types). If the compiler cannot prove that your assignment will never, ever change what's pointed to, it will have to reload the pointer each time. For example, how can it tell that <code>void* field1</code> isn't pointing to itself? <hr> And wouldn't something like this work without all the casts? <pre class="prettyprint"><code>int* p = my_s1.field1; p[1] = 0; p[2] = 1; p[3] = 2; p[4] = 3; </code></pre> Assuming an <code>int</code> is 4 bytes, and that <code>field1</code> actually points to an array of those.

c99 __restrict and compiler optimization

Tags:

c

optimization

compiler-construction

c99

restrict-qualifier

typedef struct {
    void * field1;
} s1;

void func1(void) {
    s1 my_s1;
    s1 * __restrict my_s1_ptr = &my_s1;
    *((int*)((char*)my_s1_ptr->field1 + 4))  = 0;
    *((int*)((char*)my_s1_ptr->field1 + 8))  = 1;
    *((int*)((char*)my_s1_ptr->field1 + 12)) = 2;
    *((int*)((char*)my_s1_ptr->field1 + 16)) = 3;
}

It seems that for version 11.1 of the Intel compiler and version 4.6 of gcc that the compiler reloads my_s1_ptr->field1 for each of the last 4 statements. My understanding of __restrict would suggest to me that the last 3 loads should be redundant and could be eliminated. Yes, I know the code is weird but there is a reason it is structured this way. I would just like to be able to get the compiler to eliminate the redundant loads. Any idea how to convince it to do that?

283

asked Jun 08 '12 01:06

DrTodd13

Video Answer

1 Answers

s1 * __restrict means that this is the only pointer to a particular s1, so no alias for that type. It doesn't mean that there will be no alias for other pointer types, like void*, int*, or char*.

Using a char* is especially troublesome for the compiler, because a char* is specifically allowed to be used to access the bytes of other types. (char also means byte, and can be used to access the underlying memory of other types).

If the compiler cannot prove that your assignment will never, ever change what's pointed to, it will have to reload the pointer each time. For example, how can it tell that void* field1 isn't pointing to itself?

And wouldn't something like this work without all the casts?

int* p = my_s1.field1;
p[1] = 0;
p[2] = 1;
p[3] = 2;
p[4] = 3;

Assuming an int is 4 bytes, and that field1 actually points to an array of those.

144

answered Sep 17 '22 04:09

Bo Persson

Related questions
                            
                                The valgrind reports error when printing allocated strings
                            
                                MPI : getting number of nodes (not processes) in a communicator
                            
                                bind() with SO_REUSEADDR fails
                            
                                Where can I get a PalmOS SDK?
                            
                                embedded software maintainability - configuration
                            
                                Ignoring a system call
                            
                                Makefile for C program that uses numpy extensions
                            
                                How does one create a new R environment from C?
                            
                                Can I use rounding to ensure determinism of atomic floating point operations?
                            
                                OpenGL GL_POLYGON_SMOOTH 2D Antialiasing creating tris out of quads
                            
                                Why do shifts have lower precedence than addition and subtraction in C?
                            
                                glUniform fails to set sampler value
                            
                                What are the restrictions on an ALSA PCM callback?
                            
                                How to use iconv for utf8 conversion?
                            
                                What is the order in which File Descriptors in epoll are returned?
                            
                                Why is multithreading slower than sequential programming in my case?
                            
                                Can a Non Blocking UDP write return with fewer bytes than requested?
                            
                                what are weak global references ? How it is different from a global reference?
                            
                                Unbalanced parenthesis using __attribute__ in g++
                            
                                How do I target a specific Mac OS X version?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With