I am converting a bunch of code over to use C++-style casts (with the help of <code>-Wold-style-cast</code>). I'm not entirely sold on its use for primitive variables, but I'm new to C++-style casts in general. One issue occurs in some endian converting code. The current code looks like this: <pre class="prettyprint"><code>#define REINTERPRET_VARIABLE(VAR,TYPE) (*((TYPE*)(&VAR))) //... uint16_t reverse(uint16_t val) { /*stuff to reverse uint16_t*/ } int16_t reverse( int16_t val) { uint16_t temp = reverse(REINTERPRET_VARIABLE(val,uint16_t)); return REINTERPRET_VARIABLE(temp,int16_t); } </code></pre> Now, endianness doesn't care about signedness. Therefore, to reverse an <code>int16_t</code>, we can treat it exactly like a <code>uint16_t</code> for the purposes of the reversal. This suggests code like this: <pre class="prettyprint"><code> int16_t reverse( int16_t val) { return reinterpret_cast<int16_t>(reverse(reinterpret_cast<uint16_t>(val))); } </code></pre> However, as described in this and in particular this question, <code>reinterpret_cast</code> requires a reference or a pointer (unless it's casting to itself). This suggests: <pre class="prettyprint"><code> int16_t reverse( int16_t val) { return reinterpret_cast<int16_t&>(reverse(reinterpret_cast<uint16_t&>(val))); } </code></pre> This doesn't work because, as my compiler tells me, the outside cast wants an lvalue. To fix this, you'd need to do something like: <pre class="prettyprint"><code> int16_t reverse( int16_t val) { uint16_t temp = reverse(reinterpret_cast<uint16_t&>(val)); return reinterpret_cast<int16_t&>(temp); } </code></pre> This is not much different from the original code, and indeed the temporary variable exists for the same reason, but four questions were raised for me: <ol> <li>Why is a temporary even necessary for a <code>reinterpret_cast</code>? I can understand a dumb compiler's needing to have a temporary to support the pointer nastiness of <code>REINTERPRET_VARIABLE</code>, but <code>reinterpret_cast</code> is supposed to just reinterpret bits. Is this clashing with RVO or something?</li> <li>Will requiring that temporary incur a performance penalty, or is it likely that the compiler can figure out that the temporary really should just be the return value?</li> <li>The second <code>reinterpret_cast</code> looks like it's returning a reference. Since the function return value isn't a reference, I'm pretty sure this is okay; the return value will be a copy, not a reference. However, I would still like to know what casting to a reference really even means? It is appropriate in this case, right?</li> <li>Are there any other performance implications I should be aware of? I'd guess that <code>reinterpret_cast</code> would be, if anything, faster since the compiler doesn't need to figure out that the bits should be reinterpreted--I just tell it that they should?</li> </ol>

<ol> <li> <code>temp</code> is required because the <code>&</code> (address-of) operator is applied to it on the next line. This operator requires an lvalue (the object to take the address of). </li> <li> I'd expect the compiler to optimize it out. </li> <li> <code>reinterpret_cast<T&>(x)</code> is the same as <code>* reinterpret_cast<T *>(&x)</code>, it is an lvalue designating the same memory location as <code>x</code> occupies. Note that the type of an expression is never a reference; but the result of casting to <code>T&</code>, or of using the <code>*</code> operator is an lvalue. </li> <li> I wouldn't expect any performance issues. </li> </ol> There are no strict aliasing problems with this particular piece of code, because it is allowed to alias an integer type as the signed or unsigned variation of the same type. But you suggest the codebase is full of reinterpret casts, so you should keep your eye out for strict aliasing violations elsewhere, perhaps compile with <code>-fno-strict-aliasing</code> until it is sorted out.

Since no one has answered this with language-lawyery facts in two years, I'll answer it instead with my educated guesses. <ol> <li>Who knows. But it's apparently necessary, as you've surmised. To avoid issues with strict aliasing, it would be safest to use <code>memcpy</code>, which will be optimized correctly by any compiler.</li> <li> The answer to any such question is always to profile it and to check the disassembly. In the example you gave, e.g. GCC will optimize it to: <pre class="prettyprint"><code>reverse(short): mov eax, edi rol ax, 8 ret </code></pre> Which looks pretty optimal (the <code>mov</code> is for copying from the input register; if you inline your function and use it, you'll see it is absent entirely). </li> <li>This is a language lawyer question. Probably has some useful semantic meaning. Don't worry about it. You haven't written code like this since.</li> <li>Again, profile. Maybe reinterpret casting gets in the way of certain optimizations. You should follow the same guidelines as you would for strict aliasing, mentioned above.</li> </ol>

reinterpret_cast rvalue and optimization

Tags:

c++

casting

I am converting a bunch of code over to use C++-style casts (with the help of -Wold-style-cast). I'm not entirely sold on its use for primitive variables, but I'm new to C++-style casts in general.

One issue occurs in some endian converting code. The current code looks like this:

#define REINTERPRET_VARIABLE(VAR,TYPE) (*((TYPE*)(&VAR)))

//...

uint16_t reverse(uint16_t val) { /*stuff to reverse uint16_t*/ }
 int16_t reverse( int16_t val) {
    uint16_t temp = reverse(REINTERPRET_VARIABLE(val,uint16_t));
    return REINTERPRET_VARIABLE(temp,int16_t);
}

Now, endianness doesn't care about signedness. Therefore, to reverse an int16_t, we can treat it exactly like a uint16_t for the purposes of the reversal. This suggests code like this:

 int16_t reverse( int16_t val) {
    return reinterpret_cast<int16_t>(reverse(reinterpret_cast<uint16_t>(val)));
}

However, as described in this and in particular this question, reinterpret_cast requires a reference or a pointer (unless it's casting to itself). This suggests:

 int16_t reverse( int16_t val) {
    return reinterpret_cast<int16_t&>(reverse(reinterpret_cast<uint16_t&>(val)));
}

This doesn't work because, as my compiler tells me, the outside cast wants an lvalue. To fix this, you'd need to do something like:

 int16_t reverse( int16_t val) {
    uint16_t temp = reverse(reinterpret_cast<uint16_t&>(val));
    return reinterpret_cast<int16_t&>(temp);
}

This is not much different from the original code, and indeed the temporary variable exists for the same reason, but four questions were raised for me:

Why is a temporary even necessary for a reinterpret_cast? I can understand a dumb compiler's needing to have a temporary to support the pointer nastiness of REINTERPRET_VARIABLE, but reinterpret_cast is supposed to just reinterpret bits. Is this clashing with RVO or something?
Will requiring that temporary incur a performance penalty, or is it likely that the compiler can figure out that the temporary really should just be the return value?
The second reinterpret_cast looks like it's returning a reference. Since the function return value isn't a reference, I'm pretty sure this is okay; the return value will be a copy, not a reference. However, I would still like to know what casting to a reference really even means? It is appropriate in this case, right?
Are there any other performance implications I should be aware of? I'd guess that reinterpret_cast would be, if anything, faster since the compiler doesn't need to figure out that the bits should be reinterpreted--I just tell it that they should?

390

asked Sep 05 '14 18:09

imallett

2 Answers

temp is required because the & (address-of) operator is applied to it on the next line. This operator requires an lvalue (the object to take the address of).
I'd expect the compiler to optimize it out.
reinterpret_cast<T&>(x) is the same as * reinterpret_cast<T *>(&x), it is an lvalue designating the same memory location as x occupies. Note that the type of an expression is never a reference; but the result of casting to T&, or of using the * operator is an lvalue.
I wouldn't expect any performance issues.

There are no strict aliasing problems with this particular piece of code, because it is allowed to alias an integer type as the signed or unsigned variation of the same type. But you suggest the codebase is full of reinterpret casts, so you should keep your eye out for strict aliasing violations elsewhere, perhaps compile with -fno-strict-aliasing until it is sorted out.

161

answered Sep 29 '22 06:09

M.M

Since no one has answered this with language-lawyery facts in two years, I'll answer it instead with my educated guesses.

Who knows. But it's apparently necessary, as you've surmised. To avoid issues with strict aliasing, it would be safest to use memcpy, which will be optimized correctly by any compiler.
The answer to any such question is always to profile it and to check the disassembly. In the example you gave, e.g. GCC will optimize it to:
```
reverse(short):
    mov     eax, edi
    rol     ax, 8
    ret
```
Which looks pretty optimal (the mov is for copying from the input register; if you inline your function and use it, you'll see it is absent entirely).
This is a language lawyer question. Probably has some useful semantic meaning. Don't worry about it. You haven't written code like this since.
Again, profile. Maybe reinterpret casting gets in the way of certain optimizations. You should follow the same guidelines as you would for strict aliasing, mentioned above.

answered Sep 29 '22 06:09

imallett

Related questions
                            
                                How to swap two parameters of a variadic template at compile time?
                            
                                Why does memory_order_relaxed use atomic (lock-prefixed) instructions on x86?
                            
                                aliasing a variadic template function
                            
                                "could not convert template argument" error for pointer parameters even with cast
                            
                                What are "sequence point"/"sequenced-before" rules in Rust?
                            
                                C++11 Kill threads when main() returns?
                            
                                Definition of large integer value
                            
                                GStreamer pipeline in C++
                            
                                Unresolved external symbol when linking to static lib with namespace
                            
                                Calling subclass methods from superclass in a vector C++
                            
                                Should a function go in one/both of two classes or be free-standing?
                            
                                Why is a templated user-defined conversion operator able to determine its return type?
                            
                                Getting class and member type from pointer to member variable
                            
                                Is explicit alignment necessary?
                            
                                RGB to greyscale conversion using CUDA
                            
                                Is there good substitution for interface in has-a relationship in c++
                            
                                operator << interprets arithmetic operation if the result is unsigned int or unsigned short
                            
                                Create Win32 window without WinMain function [duplicate]
                            
                                Understanding signed vs unsigned comparison [duplicate]
                            
                                How to use SFINAE to restrict overload to input iterators

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

reinterpret_cast rvalue and optimization

Tags:

c++

casting

imallett

People also ask

2 Answers

M.M

imallett

Recent Activity

Donate For Us