I faced an interesting scenario in which I got different results depending on the right operand type, and I can't really understand the reason for it. Here is the minimal code: <pre class="prettyprint"><code>#include <iostream> #include <cstdint> int main() { uint16_t check = 0x8123U; uint64_t new_check = (check & 0xFFFF) << 16; std::cout << std::hex << new_check << std::endl; new_check = (check & 0xFFFFU) << 16; std::cout << std::hex << new_check << std::endl; return 0; } </code></pre> I compiled this code with g++ (gcc version 4.5.2) on Linux 64bit: g++ -std=c++0x -Wall example.cpp -o example The output was: <blockquote> ffffffff81230000 81230000 </blockquote> I can't really understand the reason for the output in the first case. Why at some point would any of the temporal calculation results be promoted to a signed 64bit value (<code>int64_t</code>) resulting in the sign extension? I would accept a result of '0' in both cases if a 16bit value is shifted 16 bits left in the first place and then promoted to a 64bit value. I also do accept the second output if the compiler first promotes the <code>check</code> to <code>uint64_t</code> and then performs the other operations. But how come <code>&</code> with 0xFFFF (<code>int32_t</code>) vs. 0xFFFFU (<code>uint32_t</code>) would result in those two different outputs?

That's indeed an interesting corner case. It only occurs here because you use <code>uint16_t</code> for the unsigned type when you architecture use 32 bits for <code>ìnt</code> Here is a extract from Clause 5 Expressions from draft n4296 for C++14 (emphasize mine): <blockquote> 10 Many binary operators that expect operands of arithmetic or enumeration type cause conversions ... This pattern is called the usual arithmetic conversions, which are defined as follows: ... (10.5.3) — Otherwise, if the operand that has unsigned integer type has rank greater than or equal to the rank of the type of the other operand, the operand with signed integer type shall be converted to the type of the operand with unsigned integer type. (10.5.4) — Otherwise, if the type of the operand with signed integer type can represent all of the values of the type of the operand with unsigned integer type, the operand with unsigned integer type shall be converted to the type of the operand with signed integer type. </blockquote> You are in the 10.5.4 case: <ul> <li> <code>uint16_t</code> is only 16 bits while <code>int</code> is 32</li> <li> <code>int</code> can represent all the values of <code>uint16_t</code> </li> </ul> So the <code>uint16_t check = 0x8123U</code> operand is converted to the signed <code>0x8123</code> and result of the bitwise <code>&</code> is still 0x8123. But the shift (bitwise so it happens at the representation level) causes the result to be the intermediate unsigned 0x81230000 which converted to an int gives a negative value (technically it is implementation defined, but this conversion is a common usage) <blockquote> 5.8 Shift operators [expr.shift] ... Otherwise, if E1 has a signed type and non-negative value, and E1×2E2 is representable in the corresponding unsigned type of the result type, then that value, converted to the result type, is the resulting value;... </blockquote> and <blockquote> 4.7 Integral conversions [conv.integral] ... 3 If the destination type is signed, the value is unchanged if it can be represented in the destination type; otherwise, the value is implementation-defined. </blockquote> (beware this was true undefined behaviour in C++11...) So you end with a conversion of the signed int 0x81230000 to an <code>uint64_t</code> which as expected gives 0xFFFFFFFF81230000, because <blockquote> 4.7 Integral conversions [conv.integral] ... 2 If the destination type is unsigned, the resulting value is the least unsigned integer congruent to the source integer (modulo 2n where n is the number of bits used to represent the unsigned type). </blockquote> TL/DR: There is no undefined behaviour here, what causes the result is the conversion of signed 32 bits int to unsigned 64 bits int. The only part part that is undefined behaviour is a shift that would cause a sign overflow but all common implementations share this one and it is implementation defined in C++14 standard. Of course, if you force the second operand to be unsigned everything is unsigned and you get evidently the correct <code>0x81230000</code> result. [EDIT] As explained by MSalters, the result of the shift is only implementation defined since C++14, but was indeed undefined behaviour in C++11. The shift operator paragraph said: <blockquote> ... Otherwise, if E1 has a signed type and non-negative value, and E1×2E2 is representable in the result type, then that is the resulting value; otherwise, the behavior is undefined. </blockquote>

Bit wise '&' with signed vs unsigned operand

Tags:

c++

c++11

bitwise-and

type-promotion

sign-extension

I faced an interesting scenario in which I got different results depending on the right operand type, and I can't really understand the reason for it.

Here is the minimal code:

Click to copy

#include <iostream> #include <cstdint>  int main() {     uint16_t check = 0x8123U;      uint64_t new_check = (check & 0xFFFF) << 16;      std::cout << std::hex << new_check << std::endl;      new_check = (check & 0xFFFFU) << 16;      std::cout << std::hex << new_check << std::endl;      return 0; }

I compiled this code with g++ (gcc version 4.5.2) on Linux 64bit: g++ -std=c++0x -Wall example.cpp -o example

The output was:

ffffffff81230000

81230000

I can't really understand the reason for the output in the first case.

Why at some point would any of the temporal calculation results be promoted to a signed 64bit value (int64_t) resulting in the sign extension?

I would accept a result of '0' in both cases if a 16bit value is shifted 16 bits left in the first place and then promoted to a 64bit value. I also do accept the second output if the compiler first promotes the check to uint64_t and then performs the other operations.

But how come & with 0xFFFF (int32_t) vs. 0xFFFFU (uint32_t) would result in those two different outputs?

375

asked Aug 03 '16 07:08

Alex Lop.

1 Answers

That's indeed an interesting corner case. It only occurs here because you use uint16_t for the unsigned type when you architecture use 32 bits for ìnt

Here is a extract from Clause 5 Expressions from draft n4296 for C++14 (emphasize mine):

10 Many binary operators that expect operands of arithmetic or enumeration type cause conversions ... This pattern is called the usual arithmetic conversions, which are defined as follows:
...
(10.5.3) — Otherwise, if the operand that has unsigned integer type has rank greater than or equal to the rank of the type of the other operand, the operand with signed integer type shall be converted to the type of the operand with unsigned integer type.
(10.5.4) — Otherwise, if the type of the operand with signed integer type can represent all of the values of the type of the operand with unsigned integer type, the operand with unsigned integer type shall be converted to the type of the operand with signed integer type.

You are in the 10.5.4 case:

uint16_t is only 16 bits while int is 32
int can represent all the values of uint16_t

So the uint16_t check = 0x8123U operand is converted to the signed 0x8123 and result of the bitwise & is still 0x8123.

But the shift (bitwise so it happens at the representation level) causes the result to be the intermediate unsigned 0x81230000 which converted to an int gives a negative value (technically it is implementation defined, but this conversion is a common usage)

5.8 Shift operators [expr.shift]
...
Otherwise, if E1 has a signed type and non-negative value, and E1×2^E2 is representable in the corresponding unsigned type of the result type, then that value, converted to the result type, is the resulting value;...

and

4.7 Integral conversions [conv.integral]
...
3 If the destination type is signed, the value is unchanged if it can be represented in the destination type; otherwise, the value is implementation-defined.

(beware this was true undefined behaviour in C++11...)

So you end with a conversion of the signed int 0x81230000 to an uint64_t which as expected gives 0xFFFFFFFF81230000, because

4.7 Integral conversions [conv.integral]
...
2 If the destination type is unsigned, the resulting value is the least unsigned integer congruent to the source integer (modulo 2n where n is the number of bits used to represent the unsigned type).

TL/DR: There is no undefined behaviour here, what causes the result is the conversion of signed 32 bits int to unsigned 64 bits int. The only part part that is undefined behaviour is a shift that would cause a sign overflow but all common implementations share this one and it is implementation defined in C++14 standard.

Of course, if you force the second operand to be unsigned everything is unsigned and you get evidently the correct 0x81230000 result.

[EDIT] As explained by MSalters, the result of the shift is only implementation defined since C++14, but was indeed undefined behaviour in C++11. The shift operator paragraph said:

...
Otherwise, if E1 has a signed type and non-negative value, and E1×2^E2 is representable in the result type, then that is the resulting value; otherwise, the behavior is undefined.

168

answered Sep 29 '22 06:09

Serge Ballesta

Related questions
                            
                                What's the difference between "auto x = vector<int>()" and "vector<int> x"?
                            
                                Why is '\n' preferred over "\n" for output streams?
                            
                                Free easy way to draw graphs and charts in C++? [closed]
                            
                                Documenting functions in C++ with Doxygen [closed]
                            
                                Difference between an inline function and static inline function
                            
                                Warning: definition of implicit copy constructor is deprecated
                            
                                Converting Between Local Times and GMT/UTC in C/C++
                            
                                How could Qt apply style from an external Qt Stylesheet file?
                            
                                Why is it ambiguous to call overloaded ambig(long) and ambig(unsigned long) with an integer literal?
                            
                                Using boost property tree to read int array
                            
                                Why is the std::initializer_list constructor preferred when using a braced initializer list?
                            
                                Why is this C or C++ macro not expanded by the preprocessor?
                            
                                Why can't virtual functions use return type deduction?
                            
                                How do I stop name-mangling of my DLL's exported function?
                            
                                gluPerspective was removed in OpenGL 3.1, any replacements?
                            
                                Is there a Boost.Bimap alternative in c++11?
                            
                                Protected member is "not declared in this scope" in derived class [duplicate]
                            
                                what does (template) rebind<> do?
                            
                                Error enabling openmp - "ld: library not found for -lgomp" and Clang errors
                            
                                What is the size of float and double in C and C++? [duplicate]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With