I am currently enrolled in a CS107 class which makes the following assumptions: <ul> <li><code>sizeof(int) == 4</code></li> <li><code>sizeof(short) == 2</code></li> <li><code>sizeof(char) == 1</code></li> <li>big endianness</li> </ul> My professor showed the following code: <pre class="prettyprint"><code>int arr[5]; ((short*)(((char*) (&arr[1])) + 8))[3] = 100; </code></pre> Here are the 20 bytes representing <code>arr</code>: <pre class="prettyprint"><code>|....|....|....|....|....| </code></pre> My professor states that <code>&arr[1]</code> points here, which I agree with. <pre class="prettyprint"><code>|....|....|....|....|....| x </code></pre> I now understand that <code>(char*)</code> makes the pointer the width of a char (1 byte) instead of the width of an int (4 bytes). What I don't understand is the <code>+ 8</code>, which my professor says points here: <pre class="prettyprint"><code>|....|....|....|....|....| x </code></pre> But shouldn't it point here, since it is going forwards 8 times the size of a char (1 byte)? <pre class="prettyprint"><code>|....|....|....|....|....| x </code></pre>

Let's take it step by step. Your expression can be decomposed like this: <pre class="prettyprint"><code>((short*)(((char*) (&arr[1])) + 8))[3] ----------------------------------------------------- char *base = (char *) &arr[1]; char *base_plus_offset = base + 8; short *cast_into_short = (short *) base_plus_offset; cast_into_short[3] = 100; </code></pre> <code>base_plus_offset</code> points at byte location <code>12</code> within the array. <code>cast_into_short[3]</code> refers to a <code>short</code> value at location <code>12 + sizeof(short) * 3</code>, which, in your case is <code>18</code>.

Pointer arithmetic around cast

Tags:

c

casting

pointer-arithmetic

I am currently enrolled in a CS107 class which makes the following assumptions:

sizeof(int) == 4
sizeof(short) == 2
sizeof(char) == 1
big endianness

My professor showed the following code:

int arr[5];
((short*)(((char*) (&arr[1])) + 8))[3] = 100;

Here are the 20 bytes representing arr:

|....|....|....|....|....|

My professor states that &arr[1] points here, which I agree with.

|....|....|....|....|....|
     x

I now understand that (char*) makes the pointer the width of a char (1 byte) instead of the width of an int (4 bytes).

What I don't understand is the + 8, which my professor says points here:

|....|....|....|....|....|
                         x

But shouldn't it point here, since it is going forwards 8 times the size of a char (1 byte)?

|....|....|....|....|....|
               x

727

asked Feb 17 '15 17:02

Alexey

2 Answers

Let's take it step by step. Your expression can be decomposed like this:

((short*)(((char*) (&arr[1])) + 8))[3]
-----------------------------------------------------
char *base = (char *) &arr[1];
char *base_plus_offset = base + 8;
short *cast_into_short = (short *) base_plus_offset;
cast_into_short[3] = 100;

base_plus_offset points at byte location 12 within the array. cast_into_short[3] refers to a short value at location 12 + sizeof(short) * 3, which, in your case is 18.

189

answered Oct 12 '22 17:10

Blagovest Buyukliev

The expression will set the two bytes 18 bytes after the start of arr to the value 100.

#include <stdio.h>

int main() {

    int arr[5];

    char* start=(char*)&arr;
    char* end=(char*)&((short*)(((char*) (&arr[1])) + 8))[3];

    printf("sizeof(int)=%zu\n",sizeof(int));
    printf("sizeof(short)=%zu\n",sizeof(short));
    printf("offset=%td <- THIS IS THE ANSWER\n",(end-start));
    printf("100=%04x (hex)\n",100);

    for(size_t i=0;i<5;++i){

       printf("arr[%zu]=%d (%08x hex)\n",i,arr[i],arr[i]);

    }

}

Possible Output:

sizeof(int)=4
sizeof(short)=2
offset=18 <- THIS IS THE ANSWER
100=0064 (hex)
arr[0]=0 (00000000 hex)
arr[1]=0 (00000000 hex)
arr[2]=0 (00000000 hex)
arr[3]=0 (00000000 hex)
arr[4]=6553600 (00640000 hex)

In all your professors shenanigans he's shifted you 1 integer, 8 chars/bytes and 3 shorts that 4+8+6=18 bytes. Bingo.

Notice this output reveals the machine I ran this on to have 4 byte integers, 2 byte short (common) and be little-endian because the last two bytes of the array were set to 0x64 and 0x00 respectively.

I find your diagrams dreadfully confusing because it isn't very clear if you mean the '|' to be addresses or not.

|....|....|....|....|
012345678901234567890
    ^     1 ^     ^ 2
A   X       C     S B

Include the bars ('|') A is the start of Arr and B is 'one past the end' (a legal concept in C).

X is the address referred to by the expression &Arr[1]. C by the expression (((char*) (&arr[1])) + 8). S by the whole expression. S and the byte following are assigned to and what that means depends on the endian-ness of your platform.

I leave it as an exercise to determine what the output on a similar but big-endian platform who output. Anyone? I notice from the comments you're big-endian and I'm little-endian (stop sniggering). You only need to change one line of the output.

answered Oct 12 '22 17:10

Persixty

Related questions
                            
                                Force ignore duplicate symbols?
                            
                                Buffers and Memoryview Objects explained for the non-C programmer
                            
                                string array initialisation
                            
                                Relearning C: New idioms? [closed]
                            
                                shared library constructor not working
                            
                                sscanf & newlines
                            
                                Wrapping C function in Cython and NumPy
                            
                                What is the rationale for == having higher precedence than bitwise AND, XOR, and OR? [closed]
                            
                                Where to add a CFLAG, such as -std=gnu99, into an (Eclipse CDT) autotools project
                            
                                Calling Python code from a C thread
                            
                                How to copy a file in C/C++ with libssh and SFTP
                            
                                Are there Clojure-like STM libraries for C
                            
                                How can I pass an array as parameters to a vararg function?
                            
                                ARM-Kernel Testing Module
                            
                                color object tracking in openCV keeps detecting the skin
                            
                                What is different functions: `malloc()` and `kmalloc()`?
                            
                                What is the complexity of this sum algorithm?
                            
                                static library implementation vs including source code implementation
                            
                                Should I free strdup pointer after basename/dirname in C?
                            
                                Is there a way of passing macro names as arguments to nested macros without them being expanded when the outermost macro is expanded?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With