This is the fast inverse square root implementation from Quake III Arena: <pre class="prettyprint"><code>float Q_rsqrt( float number ) { long i; float x2, y; const float threehalfs = 1.5F; x2 = number * 0.5F; y = number; i = * ( long * ) &y; // evil floating point bit level hacking i = 0x5f3759df - ( i >> 1 ); // what? y = * ( float * ) &i; y = y * ( threehalfs - ( x2 * y * y ) ); // 1st iteration // y = y * ( threehalfs - ( x2 * y * y ) ); // 2nd iteration, this can be removed return y; } </code></pre> I noticed that long int <code>i</code> takes the dereferenced value at the address (cast to a <code>long *</code>) of float <code>y</code>. The code then performs operations on <code>i</code> before storing the dereferenced value at the address (cast to a <code>float *</code>) of <code>i</code> into <code>y</code>. Would this break the strict aliasing rule since <code>i</code> is not the same type as <code>y</code>? I think that perhaps it doesn't since the value is dereferenced and copied; so the operations are performed on a copy rather than the original.

Yes, this code is badly broken and invokes undefined behavior. In particular, notice these two lines: <pre class="prettyprint"><code> y = number; i = * ( long * ) &y; // evil floating point bit level hacking </code></pre> Since the object <code>*(long *)&y</code> has type <code>long</code>, the compiler is free to assume it cannot alias an object of type <code>float</code>; thus, the compiler could reorder these two operations with respect to one another. To fix it, a union should be used.

Yes, it breaks aliasing rules. In modern C, you can change <code>i = * (long *) &y;</code> to: <pre class="prettyprint"><code>i = (union { float f; long l; }) {y} .l; </code></pre> and <code>y = * (float *) &i;</code> to: <pre class="prettyprint"><code>y = (union { long l; float f; }) {i} .f; </code></pre> Provided you have guarantees that, in the C implementation being used, <code>long</code> and <code>float</code> have suitable sizes and representations, then the behavior is defined by the C standard: The bytes of the object of one type will be reinterpreted as the other type.

Does this pointer casting break strict aliasing rule?

Tags:

c

casting

pointers

strict-aliasing

c99

This is the fast inverse square root implementation from Quake III Arena:

float Q_rsqrt( float number )
{
        long i;
        float x2, y;
        const float threehalfs = 1.5F;

        x2 = number * 0.5F;
        y  = number;
        i  = * ( long * ) &y;                       // evil floating point bit level hacking
        i  = 0x5f3759df - ( i >> 1 );               // what?
        y  = * ( float * ) &i;
        y  = y * ( threehalfs - ( x2 * y * y ) );   // 1st iteration
//      y  = y * ( threehalfs - ( x2 * y * y ) );   // 2nd iteration, this can be removed

        return y;
}

I noticed that long int i takes the dereferenced value at the address (cast to a long *) of float y. The code then performs operations on i before storing the dereferenced value at the address (cast to a float *) of i into y.

Would this break the strict aliasing rule since i is not the same type as y?

I think that perhaps it doesn't since the value is dereferenced and copied; so the operations are performed on a copy rather than the original.

436

asked Apr 04 '13 18:04

Vilhelm Gray

3 Answers

Yes, this code is badly broken and invokes undefined behavior. In particular, notice these two lines:

    y  = number;
    i  = * ( long * ) &y;                       // evil floating point bit level hacking

Since the object *(long *)&y has type long, the compiler is free to assume it cannot alias an object of type float; thus, the compiler could reorder these two operations with respect to one another.

To fix it, a union should be used.

answered Oct 17 '22 13:10

R.. GitHub STOP HELPING ICE

Yes, it breaks aliasing rules.

In modern C, you can change i = * (long *) &y; to:

i = (union { float f; long l; }) {y} .l;

and y = * (float *) &i; to:

y = (union { long l; float f; }) {i} .f;

Provided you have guarantees that, in the C implementation being used, long and float have suitable sizes and representations, then the behavior is defined by the C standard: The bytes of the object of one type will be reinterpreted as the other type.

answered Oct 17 '22 15:10

Eric Postpischil

Yes, it breaks aliasing rules.

The cleanest fix for things like i = * ( long * ) &y; would be this:

  memcpy(&i, &y, sizeof(i)); // assuming sizeof(i) == sizeof(y)

It avoids issues with alignment and aliasing. And with optimization enabled, the call to memcpy() should normally be replaced with just a few instructions.

Just as any other method suggested, this approach does not fix any problems related to trap representations. On most platforms, however, there are no trap representations in integers and if you know your floating point format you can avoid floating point format trap representations, if there are any.

answered Oct 17 '22 13:10

Alexey Frunze

Related questions
                            
                                Why does this C program compile without an error?
                            
                                Call C++(C) from D language
                            
                                Including SVN revision of a project in C source code
                            
                                Duplicate Symbol Error from C functions in Objective-C
                            
                                segmentation fault: 11 in C code
                            
                                Tips for using a C library from C#
                            
                                How to broadcast Message using UDP sockets locally?
                            
                                How to find if a process is running in C?
                            
                                How to cross-compile C++-library with dependencies?
                            
                                How good is Oniguruma compared to other cross-platform regexp libraries?
                            
                                Valgrind does not show line-numbers
                            
                                Adding two numbers without using operators
                            
                                Offset of global const variable in executable
                            
                                listen() queue length in socket-programing in c?
                            
                                Which "fatal" signals should a user-level program catch?
                            
                                C pthread_join return value
                            
                                segmentation fault using scanf [duplicate]
                            
                                Why case: always requires constant expression while if() doesn't?
                            
                                How to fix error Format specifies type 'char *' but the argument has type 'char'
                            
                                Are there any advantages to using calloc() instead of a malloc() and memset()?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With