I was reading a chapter on bitwise operators, I came across 1's complement operator program and decided to run it on Visual C++. <pre class="prettyprint"><code>int main () { unsigned char c = 4, d; d = ~c; printf("%d\n", d); } </code></pre> It gives the valid output: <code>251</code> Then instead of using <code>d</code> as a variable to hold the value of <code>~c</code>, I decided to directly print the value of <code>~c</code>. <pre class="prettyprint"><code>int main () { unsigned char c=4; printf("%d\n", ~c); } </code></pre> It gives the output <code>-5</code>. Why didn't it work?

In this statement: <pre class="prettyprint"><code>printf("%d",~c); </code></pre> the <code>c</code> is converted to <code>int</code>1 type before <code>~</code> (bitwise complement) operator is applied. This is because of integer promotions, that are invoked to operand of the <code>~</code>. In this case an object of <code>unsigned char</code> type is promoted to (signed) <code>int</code>, which is then (after <code>~</code> operator evaluation) used by <code>printf</code> function, with matching <code>%d</code> format specifier. Notice that default argument promotions (as <code>printf</code> is a variadic function) does not play any role here, as object is already of type <code>int</code>. On the other hand, in this code: <pre class="prettyprint"><code>unsigned char c = 4, d; d = ~c; printf("%d", d); </code></pre> the following steps occur: <ul> <li> <code>c</code> is a subject to integer promotions because of <code>~</code> (in the same way, as described above)</li> <li> <code>~c</code> rvalue is evaluated as (signed) <code>int</code> value (e.g. <code>-5</code>)</li> <li> <code>d=~c</code> makes an implicit conversion from <code>int</code> to <code>unsigned char</code>, as <code>d</code> has such type. You may think of it as the same as <code>d = (unsigned char) ~c</code>. Notice that <code>d</code> cannot be negative (this is general rule for all unsigned types).</li> <li> <code>printf("%d", d);</code> invokes default argument promotions, thus <code>d</code> is converted to <code>int</code> and the (nonnegative) value is preserved (i.e. the <code>int</code> type can represent all values of <code>unsigned char</code> type).</li> </ul> <hr> 1) assuming that <code>int</code> can represent all values of the <code>unsigned char</code> (see T.C.'s comment below), but it is very likely to happen in this way. More specifically, we assume that <code>INT_MAX >= UCHAR_MAX</code> holds. Typically the <code>sizeof(int) > sizeof(unsigned char)</code> holds and byte consist of eight bits. Otherwise the <code>c</code> would be converted to <code>unsigned int</code> (as by C11 subclause §6.3.1.1/p2), and the format specifier should be also changed accordingly to <code>%u</code> in order to avoid getting an UB (C11 §7.21.6.1/p9).

<code>char</code> is promoted to <code>int</code> in <code>printf</code> statement before the operation <code>~</code> in second snippet. So <code>c</code>, which is <pre class="prettyprint"><code>0000 0100 (2's complement) </code></pre> in binary is promoted to (assuming 32-bit machine) <pre class="prettyprint"><code>0000 0000 0000 0000 0000 0000 0000 0100 // Say it is x </code></pre> and its bit-wise complement is equal to the two's complement of the value minus one (<code>~x = −x − 1</code>) <pre class="prettyprint"><code>1111 1111 1111 1111 1111 1111 1111 1011 </code></pre> which is <code>-5</code> in decimal in 2's complement form. Note that the default promotion of <code>char</code> <code>c</code> to <code>int</code> is also performed in <pre class="prettyprint"><code>d = ~c; </code></pre> before complement operation but the result is converted back to <code>unsigned char</code> as <code>d</code> is of type <code>unsigned char</code>. <h3>C11: 6.5.16.1 Simple assignment (p2):</h3> <blockquote> In simple assignment (<code>=</code>), the value of the right operand is converted to the type of the assignment expression and replaces the value stored in the object designated by the left operand. </blockquote> and <h3>6.5.16 (p3):</h3> <blockquote> The type of an assignment expression is the type the left operand would have after lvalue conversion. </blockquote>

To understand behavior of your code, you need to learn the concept called 'Integer Promotions' (that happens in your code implicitly before bit wise NOT operation on an <code>unsigned char</code> operand) As mentioned in N1570 committee draft: <blockquote> <h3> § 6.5.3.3 Unary arithmetic operators</h3> <ol start="4"> <li>The result of the <code>~</code> operator is the bitwise complement of its (promoted) operand (that is, each bit in the result is set if and only if the corresponding bit in the converted operand is not set). The integer promotions are performed on the operand, and the result has the promoted type. If the promoted type is an " 'unsigned type', the expression <code>~E</code> is equivalent to the maximum value representable in that type minus <code>E</code>".</li> </ol> </blockquote> Because <code>unsigned char</code> type is narrower than (as it requires fewer bytes) <code>int</code> type, - implicit type promotion performed by abstract machine(compiler) and value of variable <code>c</code> is promoted to <code>int</code> at the time of compilation (before application of the complement operation <code>~</code>). It is required for the correct execution of the program because <code>~</code> need an integer operand. <blockquote> <h3>§ 6.5 Expressions</h3> <ol start="4"> <li> Some operators (the unary operator <code>~</code>, and the binary operators <code><<</code>, <code>>></code>, <code>&</code>, <code>^</code>, and <code>|</code>, collectively described as bitwise operators) are required to have operands that have integer type. These operators yield values that depend on the internal representations of integers, and have implementation-deﬁned and undeﬁned aspects for signed types.</li> </ol> </blockquote> Compilers are smart-enough to analyze expressions, checks semantics of expressions, perform type checking and arithmetic conversions if required. That's the reason that to apply <code>~</code> on <code>char</code> type we don't need to explicitly write <code>~(int)c</code> — called explicit type casting (and do avoid errors). Note: <ol> <li>Value of <code>c</code> is promoted to <code>int</code> in expression <code>~c</code>, but type of <code>c</code> is still <code>unsigned char</code> - its type does not. Don't be confused. </li> <li> Important: result of <code>~</code> operation is of <code>int</code> type!, check below code (I don't have vs-compiler, I am using gcc): <pre class="prettyprint"><code>#include<stdio.h> #include<stdlib.h> int main(void){ unsigned char c = 4; printf(" sizeof(int) = %zu,\n sizeof(unsigned char) = %zu", sizeof(int), sizeof(unsigned char)); printf("\n sizeof(~c) = %zu", sizeof(~c)); printf("\n"); return EXIT_SUCCESS; } </code></pre> compile it, and run: <pre class="prettyprint"><code>$ gcc -std=gnu99 -Wall -pedantic x.c -o x $ ./x sizeof(int) = 4, sizeof(unsigned char) = 1 sizeof(~c) = 4 </code></pre> Notice: size of result of <code>~c</code> is same as of <code>int</code>, but not equals to <code>unsigned char</code> — result of <code>~</code> operator in this expression is <code>int</code>! that as mentioned 6.5.3.3 Unary arithmetic operators <blockquote> <ol start="3"> <li>The result of the unary <code>-</code> operator is the negative of its (promoted) operand. The integer promotions are performed on the operand, and the result has the promoted type. </li> </ol> </blockquote> </li> </ol> Now, as @haccks also explained in his answer -that result of <code>~c</code> on 32-bit machine and for value of <code>c = 4</code> is: <pre class="prettyprint"><code>1111 1111 1111 1111 1111 1111 1111 1011 </code></pre> in decimal it is <code>-5</code> — that is the output of your second code! In your first code, one more line is interesting to understand <code>b = ~c;</code>, because <code>b</code> is an <code>unsigned char</code> variable and result of <code>~c</code> is of <code>int</code> type, so to accommodate value of result of <code>~c</code> to <code>b</code> result value (~c) is truncated to fit into the unsigned char type as follows: <pre class="prettyprint"><code> 1111 1111 1111 1111 1111 1111 1111 1011 // -5 & 0xFF & 0000 0000 0000 0000 0000 0000 1111 1111 // - one byte ------------------------------------------- 1111 1011 </code></pre> Decimal equivalent of <code>1111 1011</code> is <code>251</code>. You could get same effect using: <pre class="prettyprint"><code>printf("\n ~c = %d", ~c & 0xFF); </code></pre> or as suggested by @ouah in his answer using explicitly casting.

Why does the complement behave differently through printf?

Tags:

c

variables

types

bitwise-operators

unsigned-char

I was reading a chapter on bitwise operators, I came across 1's complement operator program and decided to run it on Visual C++.

int main ()
{
   unsigned char c = 4, d;
   d = ~c;
   printf("%d\n", d);
}

It gives the valid output: 251

Then instead of using d as a variable to hold the value of ~c, I decided to directly print the value of ~c.

int main ()
{
   unsigned char c=4;
   printf("%d\n", ~c);
}

It gives the output -5.

Why didn't it work?

439

asked Feb 17 '15 10:02

Sanketssj5

3 Answers

In this statement:

printf("%d",~c);

the c is converted to int¹ type before ~ (bitwise complement) operator is applied. This is because of integer promotions, that are invoked to operand of the ~. In this case an object of unsigned char type is promoted to (signed) int, which is then (after ~ operator evaluation) used by printf function, with matching %d format specifier.

Notice that default argument promotions (as printf is a variadic function) does not play any role here, as object is already of type int.

On the other hand, in this code:

unsigned char c = 4, d;
d = ~c;
printf("%d", d);

the following steps occur:

c is a subject to integer promotions because of ~ (in the same way, as described above)
~c rvalue is evaluated as (signed) int value (e.g. -5)
d=~c makes an implicit conversion from int to unsigned char, as d has such type. You may think of it as the same as d = (unsigned char) ~c. Notice that d cannot be negative (this is general rule for all unsigned types).
printf("%d", d); invokes default argument promotions, thus d is converted to int and the (nonnegative) value is preserved (i.e. the int type can represent all values of unsigned char type).

^{1) assuming that int can represent all values of the unsigned char (see T.C.'s comment below), but it is very likely to happen in this way. More specifically, we assume that INT_MAX >= UCHAR_MAX holds. Typically the sizeof(int) > sizeof(unsigned char) holds and byte consist of eight bits. Otherwise the c would be converted to unsigned int (as by C11 subclause §6.3.1.1/p2), and the format specifier should be also changed accordingly to %u in order to avoid getting an UB (C11 §7.21.6.1/p9).}

123

answered Nov 10 '22 08:11

Grzegorz Szpetkowski

char is promoted to int in printf statement before the operation ~ in second snippet. So c, which is

0000 0100 (2's complement)

in binary is promoted to (assuming 32-bit machine)

0000 0000 0000 0000 0000 0000 0000 0100 // Say it is x

and its bit-wise complement is equal to the two's complement of the value minus one (~x = −x − 1)

1111 1111 1111 1111 1111 1111 1111 1011

which is -5 in decimal in 2's complement form.

Note that the default promotion of char c to int is also performed in

d = ~c;

before complement operation but the result is converted back to unsigned char as d is of type unsigned char.

C11: 6.5.16.1 Simple assignment (p2):

In simple assignment (=), the value of the right operand is converted to the type of the assignment expression and replaces the value stored in the object designated by the left operand.

and

6.5.16 (p3):

The type of an assignment expression is the type the left operand would have after lvalue conversion.

answered Nov 10 '22 09:11

haccks

To understand behavior of your code, you need to learn the concept called 'Integer Promotions' (that happens in your code implicitly before bit wise NOT operation on an unsigned char operand) As mentioned in N1570 committee draft:

§ 6.5.3.3 Unary arithmetic operators

The result of the ~ operator is the bitwise complement of its (promoted) operand (that is, each bit in the result is set if and only if the corresponding bit in the converted operand is not set). The integer promotions are performed on the operand, and the result has the promoted type. If the promoted type is an " 'unsigned type', the expression ~E is equivalent to the maximum value representable in that type minus E".

Because unsigned char type is narrower than (as it requires fewer bytes) int type, - implicit type promotion performed by abstract machine(compiler) and value of variable c is promoted to int at the time of compilation (before application of the complement operation ~). It is required for the correct execution of the program because ~ need an integer operand.

§ 6.5 Expressions

Some operators (the unary operator ~, and the binary operators <<, >>, &, ^, and |, collectively described as bitwise operators) are required to have operands that have integer type. These operators yield values that depend on the internal representations of integers, and have implementation-deﬁned and undeﬁned aspects for signed types.

Compilers are smart-enough to analyze expressions, checks semantics of expressions, perform type checking and arithmetic conversions if required. That's the reason that to apply ~ on char type we don't need to explicitly write ~(int)c — called explicit type casting (and do avoid errors).

Note:

Value of c is promoted to int in expression ~c, but type of c is still unsigned char - its type does not. Don't be confused.
Important: result of ~ operation is of int type!, check below code (I don't have vs-compiler, I am using gcc):
```
#include<stdio.h>
#include<stdlib.h>
int main(void){
 unsigned char c = 4;
 printf(" sizeof(int) = %zu,\n sizeof(unsigned char) = %zu",
 sizeof(int),
 sizeof(unsigned char));
 printf("\n sizeof(~c) = %zu", sizeof(~c)); 
 printf("\n");
 return EXIT_SUCCESS;
}
```
compile it, and run:
```
$ gcc -std=gnu99 -Wall -pedantic x.c -o x
$ ./x
sizeof(int) = 4,
sizeof(unsigned char) = 1
sizeof(~c) = 4
```
Notice: size of result of ~c is same as of int, but not equals to unsigned char — result of ~ operator in this expression is int! that as mentioned 6.5.3.3 Unary arithmetic operators
1. The result of the unary - operator is the negative of its (promoted) operand. The integer promotions are performed on the operand, and the result has the promoted type.

Now, as @haccks also explained in his answer -that result of ~c on 32-bit machine and for value of c = 4 is:

1111 1111 1111 1111 1111 1111 1111 1011

in decimal it is -5 — that is the output of your second code!

In your first code, one more line is interesting to understand b = ~c;, because b is an unsigned char variable and result of ~c is of int type, so to accommodate value of result of ~c to b result value (~c) is truncated to fit into the unsigned char type as follows:

    1111 1111 1111 1111 1111 1111 1111 1011  // -5 & 0xFF
 &  0000 0000 0000 0000 0000 0000 1111 1111  // - one byte      
    -------------------------------------------          
                                  1111 1011

Decimal equivalent of 1111 1011 is 251. You could get same effect using:

printf("\n ~c = %d", ~c  & 0xFF);

or as suggested by @ouah in his answer using explicitly casting.

answered Nov 10 '22 07:11

Grijesh Chauhan

Related questions
                            
                                Determining C executable name
                            
                                Violating of strict-aliasing in C, even without any casting?
                            
                                Append to the end of a file in C
                            
                                What is the fundamental difference between source and header files in C?
                            
                                What is 1LL or 2LL in C and C++?
                            
                                Why isn't C/C++'s "#pragma once" an ISO standard?
                            
                                How to print pthread_t
                            
                                Is memory allocation a system call?
                            
                                Can GCC not complain about undefined references?
                            
                                Should a buffer of bytes be signed or unsigned char buffer?
                            
                                Is there a good reason for always enclosing a define in parentheses in C?
                            
                                How are variable names stored in memory in C?
                            
                                Reading in double values with scanf in c
                            
                                Fast way to generate pseudo-random bits with a given probability of 0 or 1 for each bit
                            
                                How does a 'const struct' differ from a 'struct'?
                            
                                How to simulate an EOF?
                            
                                What does tilde(~) operator do?
                            
                                What does this GCC error "... relocation truncated to fit..." mean?
                            
                                Compiler not detecting obviously uninitialized variable
                            
                                Can a const variable be used to declare the size of an array in C?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With