I think the question is self explanatory, I guess it probably has something to do with overflow but still I do not quite get it. What is happening, bitwise, under the hood? Why does <code>-(-2147483648) = -2147483648</code> (at least while compiling in C)?

Note: this answer does not apply as such on the obsolete ISO C90 standard that is still used by many compilers First of all, on C99, C11, the expression <code>-(-2147483648) == -2147483648</code> is in fact false: <pre class="prettyprint"><code>int is_it_true = (-(-2147483648) == -2147483648); printf("%d\n", is_it_true); </code></pre> prints <pre class="prettyprint"><code>0 </code></pre> <hr> So how it is possible that this evaluates to true? The machine is using 32-bit two's complement integers. The <code>2147483648</code> is an integer constant that quite doesn't fit in 32 bits, thus it will be either <code>long int</code> or <code>long long int</code> depending on whichever is the first where it fits. This negated will result in <code>-2147483648</code> - and again, even though the number <code>-2147483648</code> can fit in a 32-bit integer, the expression <code>-2147483648</code> consists of a >32-bit positive integer preceded with unary <code>-</code>! You can try the following program: <pre class="prettyprint"><code>#include <stdio.h> int main() { printf("%zu\n", sizeof(2147483647)); printf("%zu\n", sizeof(2147483648)); printf("%zu\n", sizeof(-2147483648)); } </code></pre> The output on such machine most probably would be 4, 8 and 8. Now, <code>-2147483648</code> negated will again result in <code>+214783648</code>, which is still of type <code>long int</code> or <code>long long int</code>, and everything is fine. In C99, C11, the integer constant expression <code>-(-2147483648)</code> is well-defined on all conforming implementations. <hr> Now, when this value is assigned to a variable of type <code>int</code>, with 32 bits and two's complement representation, the value is not representable in it - the values on 32-bit 2's complement would range from -2147483648 to 2147483647. The C11 standard 6.3.1.3p3 says the following of integer conversions: <blockquote> <ul> <li>[When] the new type is signed and the value cannot be represented in it; either the result is implementation-defined or an implementation-defined signal is raised.</li> </ul> </blockquote> That is, the C standard doesn't actually define what the value in this case would be, or doesn't preclude the possibility that the execution of the program stops due to a signal being raised, but leaves it to the implementations (i.e. compilers) to decide how to handle it (C11 3.4.1): <blockquote> implementation-defined behavior unspecified behavior where each implementation documents how the choice is made </blockquote> and (3.19.1): <blockquote> implementation-defined value unspecified value where each implementation documents how the choice is made </blockquote> <hr> In your case, the implementation-defined behaviour is that the value is the 32 lowest-order bits [*]. Due to the 2's complement, the (long) long int value <code>0x80000000</code> has the bit 31 set and all other bits cleared. In 32-bit two's complement integers the bit 31 is the sign bit - meaning that the number is negative; all value bits zeroed means that the value is the minimum representable number, i.e. <code>INT_MIN</code>. <hr> [*] GCC documents its implementation-defined behaviour in this case as follows: <blockquote> The result of, or the signal raised by, converting an integer to a signed integer type when the value cannot be represented in an object of that type (C90 6.2.1.2, C99 and C11 6.3.1.3). For conversion to a type of width <code>N</code>, the value is reduced modulo <code>2^N</code> to be within range of the type; no signal is raised. </blockquote>

Why is -(-2147483648) = - 2147483648 in a 32-bit machine?

2 Answers

Note: this answer does not apply as such on the obsolete ISO C90 standard that is still used by many compilers

First of all, on C99, C11, the expression -(-2147483648) == -2147483648 is in fact false:

int is_it_true = (-(-2147483648) == -2147483648); printf("%d\n", is_it_true);

prints

So how it is possible that this evaluates to true? The machine is using 32-bit two's complement integers. The 2147483648 is an integer constant that quite doesn't fit in 32 bits, thus it will be either long int or long long int depending on whichever is the first where it fits. This negated will result in -2147483648 - and again, even though the number -2147483648 can fit in a 32-bit integer, the expression -2147483648 consists of a >32-bit positive integer preceded with unary -!

You can try the following program:

#include <stdio.h>  int main() {     printf("%zu\n", sizeof(2147483647));     printf("%zu\n", sizeof(2147483648));     printf("%zu\n", sizeof(-2147483648)); }

The output on such machine most probably would be 4, 8 and 8.

Now, -2147483648 negated will again result in +214783648, which is still of type long int or long long int, and everything is fine.

In C99, C11, the integer constant expression -(-2147483648) is well-defined on all conforming implementations.

Now, when this value is assigned to a variable of type int, with 32 bits and two's complement representation, the value is not representable in it - the values on 32-bit 2's complement would range from -2147483648 to 2147483647.

The C11 standard 6.3.1.3p3 says the following of integer conversions:

[When] the new type is signed and the value cannot be represented in it; either the result is implementation-defined or an implementation-defined signal is raised.

That is, the C standard doesn't actually define what the value in this case would be, or doesn't preclude the possibility that the execution of the program stops due to a signal being raised, but leaves it to the implementations (i.e. compilers) to decide how to handle it (C11 3.4.1):

implementation-defined behavior

unspecified behavior where each implementation documents how the choice is made

and (3.19.1):

implementation-defined value

unspecified value where each implementation documents how the choice is made

In your case, the implementation-defined behaviour is that the value is the 32 lowest-order bits [*]. Due to the 2's complement, the (long) long int value 0x80000000 has the bit 31 set and all other bits cleared. In 32-bit two's complement integers the bit 31 is the sign bit - meaning that the number is negative; all value bits zeroed means that the value is the minimum representable number, i.e. INT_MIN.

[*] GCC documents its implementation-defined behaviour in this case as follows:

The result of, or the signal raised by, converting an integer to a signed integer type when the value cannot be represented in an object of that type (C90 6.2.1.2, C99 and C11 6.3.1.3).

For conversion to a type of width N, the value is reduced modulo 2^N to be within range of the type; no signal is raised.

answered Oct 10 '22 06:10

Antti Haapala -- Слава Україні

Negating an (unsuffixed) integer constant:

The expression -(-2147483648) is perfectly defined in C, however it may be not obvious why it is this way.

When you write -2147483648, it is formed as unary minus operator applied to integer constant. If 2147483648 can't be expressed as int, then it s is represented as long or long long^* (whichever fits first), where the latter type is guaranteed by the C Standard to cover that value^†.

To confirm that, you could examine it by:

printf("%zu\n", sizeof(-2147483648));

which yields 8 on my machine.

The next step is to apply second - operator, in which case the final value is 2147483648L (assuming that it was eventually represented as long). If you try to assign it to int object, as follows:

int n = -(-2147483648);

then the actual behavior is implementation-defined. Referring to the Standard:

C11 §6.3.1.3/3 Signed and unsigned integers

Otherwise, the new type is signed and the value cannot be represented in it; either the result is implementation-defined or an implementation-defined signal is raised.

The most common way is to simply cut-off the higher bits. For instance, GCC documents it as:

For conversion to a type of width N, the value is reduced modulo 2^N to be within range of the type; no signal is raised.

Conceptually, the conversion to type of width 32 can be illustrated by bitwise AND operation:

value & (2^32 - 1) // preserve 32 least significant bits

In accordance with two's complement arithmetic, the value of n is formed with all zeros and MSB (sign) bit set, which represents value of -2^31, that is -2147483648.

Negating an `int` object:

If you try to negate int object, that holds value of -2147483648, then assuming two's complement machine, the program will exhibit undefined behavior:

n = -n; // UB if n == INT_MIN and INT_MAX == 2147483647

C11 §6.5/5 Expressions

If an exceptional condition occurs during the evaluation of an expression (that is, if the result is not mathematically defined or not in the range of representable values for its type), the behavior is undefined.

Additional references:

INT32-C. Ensure that operations on signed integers do not result in overflow

^{*) In withdrawed C90 Standard, there was no long long type and the rules were different. Specifically, sequence for unsuffixed decimal was int, long int, unsigned long int (C90 §6.1.3.2 Integer constants).}

^{†) This is due to LLONG_MAX, which must be at least +9223372036854775807 (C11 §5.2.4.2.1/1).}

159

answered Oct 10 '22 06:10

Grzegorz Szpetkowski

Related questions
                            
                                How to get the sign, mantissa and exponent of a floating point number
                            
                                Is a C compiler allowed to coalesce sequential assignments to volatile variables?
                            
                                How to build a release version binary in Go?
                            
                                Is it bad to declare a C-style string without const? If so, why?
                            
                                warning: left shift count >= width of type
                            
                                Global variables in header file
                            
                                What is the behavior of printing NULL with printf's %s specifier?
                            
                                What is the best IDE for C Development / Why use Emacs over an IDE? [closed]
                            
                                Can I have Swift, Objective-C, C and C++ files in the same Xcode project?
                            
                                How to capture Control+D signal?
                            
                                Does Function pointer make the program slow?
                            
                                What if I don't write default in switch case?
                            
                                How to get the Enum Index value in C#
                            
                                Efficient integer compare function
                            
                                How to add a builtin function in a GCC plugin?
                            
                                Difference between rdtscp, rdtsc : memory and cpuid / rdtsc?
                            
                                Programming languages that compile into C/C++ source? [closed]
                            
                                How does this C program compile and run with two main functions?
                            
                                What exactly is va_end for? Is it always necessary to call it?
                            
                                Hide password input on terminal

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Why is -(-2147483648) = - 2147483648 in a 32-bit machine?

Tags:

c

32-bit

twos-complement

Lesscomfortable

People also ask

2 Answers

Antti Haapala -- Слава Україні

Negating an (unsuffixed) integer constant:

C11 §6.3.1.3/3 Signed and unsigned integers

Negating an `int` object:

C11 §6.5/5 Expressions

Additional references:

Grzegorz Szpetkowski

Recent Activity

Donate For Us

Why is -(-2147483648) = - 2147483648 in a 32-bit machine?

Tags:

c

32-bit

twos-complement

Lesscomfortable

People also ask

2 Answers

Antti Haapala -- Слава Україні

Negating an (unsuffixed) integer constant:

C11 §6.3.1.3/3 Signed and unsigned integers

Negating an int object:

C11 §6.5/5 Expressions

Additional references:

Grzegorz Szpetkowski

Related questions

Recent Activity

Donate For Us

Negating an `int` object: