In the current C++ Standard Draft, the left shift operator is defined as follows [expr.shift]: <blockquote> The value of <code>E1 << E2</code> is the unique value congruent to <code>E1×2^E2</code> modulo <code>2^N</code>, where <code>N</code> is the width of the type of the result. </blockquote> Consider <code>int E1 = 2^31-1 = 2'147'483'647</code>, <code>E2 = 1</code>, and <code>int</code> having 32 bits. Then there is an infinite number of numbers congruent to <code>E1×2^E2 = 4'294'967'294</code> modulo <code>2^N = 2^32</code>, namely, all the numbers <code>4'294'967'294 + k×2^32</code> where <code>k</code> is an arbitrary integer. Examples are <code>4'294'967'294</code> (<code>k=0</code>) or <code>-2</code> (<code>k=-1</code>). I don't understand what the Standard means by the unique value out of these numbers. Does it mean the unique value that can be represented by the resulting data type? Then, I suppose the result is defined as <code>-2</code>. Is this interpretation correct? Until C++20, the definition was different and this case would cause undefined behavior. I suppose the change is related to the mandatory 2's-complement representation of negative signed integers. In fact, there is now no more requirement for <code>E1</code> to be non-negative. It therefore seems that <code>-1 << 1</code> is defined as <code>-2</code>. Is that right as well?

<blockquote> Does it mean the unique value that can be represented by the resulting data type </blockquote> Yes. The set of numbers congruent to <code>E1×2^E2</code> modulo <code>2^N</code> is infinite, but there is only one value in any interval of size <code>2^N</code>, therefore there is only one value representable in an integer type of width <code>N</code>. If we look in the "p0907R1 Signed Integers are Two’s Complement" proposal we find a similar phrase with "unique representation" which makes this more clear: <blockquote> Conversion from signed to unsigned is always well-defined: the result is the unique value of the destination type that is congruent to the source integer modulo 2N. </blockquote> <blockquote> Then, I suppose the result is defined as <code>-2</code>. Is this interpretation correct? </blockquote> Yes On x64 the equivalent asm instruction is <code>shlx</code> (logical shift left) <blockquote> I suppose the change is related to the mandatory 2-complement representation of negative signed integers. </blockquote> Correct. As was the case with unsigned types, now also signed types they mathematically represent equivalence classes (well, it's not clear to me how much this is true as it looks like they want to still keep some UB cases on overflow).

Does C++20 well-define left shift for signed integers that "overflow"?

Q: What happens when a signed integer overflows?

In contrast, the C standard says that signed integer overflow leads to undefined behavior where a program can do anything, including dumping core or overrunning a buffer. The misbehavior can even precede the overflow. Such an overflow can occur during addition, subtraction, multiplication, division, and left shift.

Q: How do you overcome signed integer overflow?

Use 64-bits integers. One very good way to prevent integer overflows is to use int64_t to implement integers. In most case, 64-bits ints will not commit overflow, unlike their 32-bits counterparts. There is actually very few downsides in using int64_t instead of int32_t .

Q: Which left shift operation works on signed numbers?

For signed numbers, the sign bit is used to fill the vacated bit positions. In other words, if the number is positive, 0 is used, and if the number is negative, 1 is used.

Q: What happens when int overflows C++?

An integer overflow can cause the value to wrap and become negative, which violates the program's assumption and may lead to unexpected behavior (for example, 8-bit integer addition of 127 + 1 results in −128, a two's complement of 128).

Tags:

c++

language-lawyer

bit-shift

c++20

In the current C++ Standard Draft, the left shift operator is defined as follows [expr.shift]:

The value of E1 << E2 is the unique value congruent to E1×2^E2 modulo 2^N, where N is the width of the type of the result.

Consider int E1 = 2^31-1 = 2'147'483'647, E2 = 1, and int having 32 bits. Then there is an infinite number of numbers congruent to E1×2^E2 = 4'294'967'294 modulo 2^N = 2^32, namely, all the numbers 4'294'967'294 + k×2^32 where k is an arbitrary integer. Examples are 4'294'967'294 (k=0) or -2 (k=-1).

I don't understand what the Standard means by the unique value out of these numbers. Does it mean the unique value that can be represented by the resulting data type? Then, I suppose the result is defined as -2. Is this interpretation correct?

Until C++20, the definition was different and this case would cause undefined behavior. I suppose the change is related to the mandatory 2's-complement representation of negative signed integers.

In fact, there is now no more requirement for E1 to be non-negative. It therefore seems that -1 << 1 is defined as -2. Is that right as well?

776

asked Apr 02 '19 07:04

Daniel Langr

1 Answers

Does it mean the unique value that can be represented by the resulting data type

Yes. The set of numbers congruent to E1×2^E2 modulo 2^N is infinite, but there is only one value in any interval of size 2^N, therefore there is only one value representable in an integer type of width N.

If we look in the "p0907R1 Signed Integers are Two’s Complement" proposal we find a similar phrase with "unique representation" which makes this more clear:

Conversion from signed to unsigned is always well-defined: the result is the unique value of the destination type that is congruent to the source integer modulo 2^N.

Then, I suppose the result is defined as -2. Is this interpretation correct?

Yes

On x64 the equivalent asm instruction is shlx (logical shift left)

I suppose the change is related to the mandatory 2-complement representation of negative signed integers.

Correct. As was the case with unsigned types, now also signed types they mathematically represent equivalence classes (well, it's not clear to me how much this is true as it looks like they want to still keep some UB cases on overflow).

148

answered Sep 28 '22 05:09

bolov

Related questions
                            
                                Why don't complex-number literals work in clang?
                            
                                Is there a way to forward argument to inner constexpr function?
                            
                                Returning member unique_ptr from class method
                            
                                What is the rationale for self-assignment-unsafe move assignment operators in the standard library?
                            
                                Why can't lambda, when cast to function pointer, be used in constexpr context?
                            
                                Does casting a char array to another type violate strict-aliasing rules?
                            
                                Inserting into vector by reference to element of same vector
                            
                                Is there any reason C++ 11+ std::mutex should be declared as a global variable instead of passed into a std::thread as a function parameter?
                            
                                Under what conditions is the pure virtual method stub generated?
                            
                                BracketAlignmentStyle: Break before closing parenthesis
                            
                                Can template parameter deduction be used in class data members?
                            
                                How do I make a C++ (shared) library compatible with clang and GCC?
                            
                                Are cv-qualifiers allowed on decltype(auto) variables?
                            
                                Can user defined numeric literals be immediately followed by a dot? [duplicate]
                            
                                gcc over-aligned new support (alignas )
                            
                                C++ static factory method vs constructor: how to avoid copying?
                            
                                Why is this constructor not giving an incomplete type error?
                            
                                std::vector::push_back() doesn't compile on MSVC for an object with deleted move constructor
                            
                                Binding failure with objcopy --redefine-syms
                            
                                How to do function overloading with std::shared_ptr<void> and another type of std::shared_ptr?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With