I was reading an article on usage of <code>size_t</code> and <code>ptrdiff_t</code> data types here, when I came across this example: <img src="https://i.stack.imgur.com/morrj.png" alt="enter image description here"> The code: <pre class="prettyprint"><code>int A = -2; unsigned B = 1; int array[5] = { 1, 2, 3, 4, 5 }; int *ptr = array + 3; ptr = ptr + (A + B); //Error printf("%i\n", *ptr); </code></pre> <hr> I am unable to understand a couple of things. First, how can adding a <code>signed</code> and an <code>unsigned</code> number cast the enter result into <code>unsigned</code> type? If the result is indeed <code>0xFFFFFFFF</code> of <code>unsigned</code> type, why in a 32 bit system, while adding it with <code>ptr</code>, will it be interpreted as <code>ptr-1</code>, given that the number is actually <code>unsigned</code> type and the leading 1 should not signify sign? Second, why is the result different in 64 bit system? Can anyone explain this please?

The operands of the expression <code>A + B</code> are subject to usual arithmetic conversion, covered in C11 (n1570) 6.3.1.8 p1: <blockquote> [...] Otherwise, the integer promotions [which leave <code>int</code> and <code>unsigned int</code> unchanged] are performed on both operands. Then the following rules are applied to the promoted operands: <ul> <li>If both operands have the same type, [...]</li> <li>Otherwise, if both operands have signed integer types or both have unsigned integer types, [...]</li> <li>Otherwise, if the operand that has unsigned integer type has rank greater or equal to the rank of the type of the other operand, then the operand with signed integer type is converted to the type of the operand with unsigned integer type.</li> <li>[...]</li> </ul> </blockquote> The types <code>int</code> and <code>unsigned int</code> have the same rank (ibid. 6.3.1.1 p1, 4th bullet); the result of the addition has type <code>unsigned int</code>. On 32-bit systems, <code>int</code> and pointers usually have the same size (32 bit). From a hardware-centric point of view (and assuming 2's complement), subtracting <code>1</code> and adding <code>-1u</code> is the same (addition for signed and unsigned types is the same!), so the access to the array element appears to work. However, this is undefined behaviour, as <code>array</code> doesn't contain a 0x100000003rd element. On 64-bit, <code>int</code> usually has still 32 bit, but pointers have 64 bit. Thus, there is no wraparound and no equivalence to subtracting 1 (from a hardware-centric point of view, the behaviour is undefined in both cases). To illustrate, say <code>ptr</code> is 0xabcd0123, adding 0xffffffff yields <pre class="prettyprint"><code> abcd0123 + ffffffff 1abcd0122 ^-- The 1 is truncated for a 32-bit calculation, but not for 64-bit. </code></pre>

Difference in results when using int and size_t

Tags:

c

pointers

32bit-64bit

size-t

64-bit

I was reading an article on usage of size_t and ptrdiff_t data types here, when I came across this example:

enter image description here

The code:

int A = -2;
unsigned B = 1;
int array[5] = { 1, 2, 3, 4, 5 };
int *ptr = array + 3;
ptr = ptr + (A + B); //Error
printf("%i\n", *ptr);

I am unable to understand a couple of things. First, how can adding a signed and an unsigned number cast the enter result into unsigned type? If the result is indeed 0xFFFFFFFF of unsigned type, why in a 32 bit system, while adding it with ptr, will it be interpreted as ptr-1, given that the number is actually unsigned type and the leading 1 should not signify sign?

Second, why is the result different in 64 bit system?

Can anyone explain this please?

575

asked Nov 15 '14 21:11

SexyBeast

2 Answers

1. I am unable to understand a couple of things. First, how can adding a signed and an unsigned number cast the enter result into unsigned type?

This is defined by integer promotions and integer conversion rank.

6.3.1.8 p1: Otherwise, if the operand that has unsigned integer type has rank greater or equal to the rank of the type of the other operand, then the operand with signed integer type is converted to the type of the operand with unsigned integer type.

In this case unsigned has a higher rank than int, therefore int is promoted to unsigned.

The conversion of int ( -2 ) to unsigned is performed as described:

6.3.1.3 p2: Otherwise, if the new type is unsigned, the value is converted by repeatedly adding or subtracting one more than the maximum value that can be represented in the new type until the value is in the range of the new type

2. If the result is indeed 0xFFFFFFFF of unsigned type, why in a 32 bit system, while adding it with ptr, will it be interpreted as ptr-1, given that the number is actually unsigned type and the leading 1 should not signify sign?

This is undefined behavior and should not be relied on, since C doesn't define pointer arithmetic overflow.

6.5.6 p8: If both the pointer operand and the result point to elements of the same array object, or one past the last element of the array object, the evaluation shall not produce an overflow; otherwise, the behavior is undefined.

3. Second, why is the result different in 64 bit system?

( This assumes( as does the picture ) that int and unsigned are 4 bytes. )

The result of A and B is the same as described in 1., then that result is added to the pointer. Since the pointer is 8 bytes and assuming the addition doesn't overflow( it still could if ptr had a large address, giving the same undefined behavior as in 2. ) the result is an address.

This is undefined behavior because the pointer points way outside of the bounds of the array.

answered Sep 19 '22 13:09

2501

The operands of the expression A + B are subject to usual arithmetic conversion, covered in C11 (n1570) 6.3.1.8 p1:

[...]

Otherwise, the integer promotions [which leave int and unsigned int unchanged] are performed on both operands. Then the following rules are applied to the promoted operands:

If both operands have the same type, [...]

Otherwise, if both operands have signed integer types or both have unsigned integer types, [...]

Otherwise, if the operand that has unsigned integer type has rank greater or equal to the rank of the type of the other operand, then the operand with signed integer type is converted to the type of the operand with unsigned integer type.

[...]

The types int and unsigned int have the same rank (ibid. 6.3.1.1 p1, 4^th bullet); the result of the addition has type unsigned int.

On 32-bit systems, int and pointers usually have the same size (32 bit). From a hardware-centric point of view (and assuming 2's complement), subtracting 1 and adding -1u is the same (addition for signed and unsigned types is the same!), so the access to the array element appears to work.

However, this is undefined behaviour, as array doesn't contain a 0x100000003^rd element.

On 64-bit, int usually has still 32 bit, but pointers have 64 bit. Thus, there is no wraparound and no equivalence to subtracting 1 (from a hardware-centric point of view, the behaviour is undefined in both cases).

To illustrate, say ptr is 0xabcd0123, adding 0xffffffff yields

  abcd0123
+ ffffffff

 1abcd0122
 ^-- The 1 is truncated for a 32-bit calculation, but not for 64-bit.

answered Sep 21 '22 13:09

mafso

Related questions
                            
                                Simple file compression in C
                            
                                String input using C scanf_s
                            
                                Is this a C11 anonymous struct?
                            
                                when does open(2) fail with errno == EMLINK?
                            
                                When I call vfork(), can I call any exec*() function, or must I call execve()?
                            
                                How to detect mouse click over an image in GTK+?
                            
                                What is the meaning of sigfillset? Do I really needed it in my implementation?
                            
                                What memory access patterns are most efficient for outer-product-type double loops?
                            
                                Are the elements of the argv array always contiguous in memory?
                            
                                Cast from 32-bit address to 64-bit integer yields unexpected results [duplicate]
                            
                                Address Space Layout Randomization in C Compilers
                            
                                MessageBox "Abnormal program termination" keeps my application running
                            
                                How to pass a Swift string to a c function?
                            
                                Generate unaligned memory access exception in PowerPC
                            
                                Purpose of `#ifdef MODULE` around module_exit()?
                            
                                How to know if sendto() with TCP Fast Open actually used Fast Open?
                            
                                C - expression must be a modifiable lvalue
                            
                                C - meaning of wait(NULL) when executing fork() in parallel
                            
                                Why memory usage is more than physical RAM in Linux?
                            
                                How to pipe data to a program which calls scanf() and read() in Linux

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With