How do pointers work "under the hood" in C?

Tags:

Take a simple program like this:

int main(void)
{
    char p;
    char *q;

    q = &p;

    return 0;
}

How is &p determined? Does the compiler calculate all such references before-hand or is it done at runtime? If at runtime, is there some table of variables or something where it looks these things up? Does the OS keep track of them and it just asks the OS?

My question may not even make sense in the context of the correct explanation, so feel free to set me straight.

826

asked Mar 14 '14 17:03

Chris Middleton

1 Answers

How is &p determined? Does the compiler calculate all such references before-hand or is it done at runtime?

This is an implementation detail of the compiler. Different compilers can choose different techniques depending on the kind of operating system they are generating code for and the whims of the compiler writer.

Let me describe for you how this is typically done on a modern operating system like Windows.

When the process starts up, the operating system gives the process a virtual address space, of, let's say 2GB. Of that 2GB, a 1MB section of it is set aside as "the stack" for the main thread. The stack is a region of memory where everything "below" the current stack pointer is "in use", and everything in that 1MB section "above" it is "free". How the operating system chooses which 1MB chunk of virtual address space is the stack is an implementation detail of Windows.

(Aside: whether the free space is at the "top" or "bottom" of the stack, whether the "valid" space grows "up" or "down" is also an implementation detail. Different operating systems on different chips do it differently. Let's suppose the stack grows from high addresses to low addresses.)

The operating system ensures that when main is invoked, the register ESP contains the address of the dividing line between the valid and free portions of the stack.

(Aside: again, whether the ESP is the address of the first valid point or the first free point is an implementation detail.)

The compiler generates code for main that pushes the stack pointer by lets say five bytes, by subtracting from it if the stack is growing "down". It decreases by five because it needs one byte for p and four for q. So the stack pointer changes; there are now five more "valid" bytes and five fewer "free" bytes.

Let's say that q is the memory that is now in ESP through ESP+3 and p is the memory now in ESP+4. To assign the address of p to q, the compiler generates code that copies the four byte value ESP+4 into the locations ESP through ESP+3.

(Aside: Note that it is highly likely that the compiler lays out the stack so that everything that has its address taken is on an ESP+offset value that is divisible by four. Some chips have requirements that addresses be divisible by pointer size. Again, this is an implementation detail.)

If you do not understand the difference between an address used as a value and an address used as a storage location, figure that out. Without understanding that key difference you will not be successful in C.

That's one way it could work but like I said, different compilers can choose to do it differently as they see fit.

176

answered Oct 05 '22 05:10

Eric Lippert

Related questions
                            
                                Object file to binary code
                            
                                Detect mp4 files
                            
                                Declaring array causing error
                            
                                C compiling error: stray '##' in program
                            
                                Is it possible to generate and run TemplateHaskell generated code at runtime?
                            
                                why InterlockedAdd is not available in vs2010?
                            
                                1D array decays to pointer, but 2D array doesn't do so, why? [duplicate]
                            
                                does chroot() require root privileges?
                            
                                Why does MapViewOfFile fail with ERROR_ACCESS_DENIED?
                            
                                Multiple color object detection using OpenCV
                            
                                C Global and Static variable storing in memory
                            
                                Infix to postfix algorithm that takes care of unary operators
                            
                                Is null character included while allocating using malloc
                            
                                Inconsistency in using pointer to an array and address of an array directly
                            
                                keep getting implicit declaration error
                            
                                Eliminate branching when find median in a binary {0, 255} image
                            
                                difference between time() and gettimeofday() and why does one cause seg fault
                            
                                "Expected expression before ' { ' token"
                            
                                How to convert int to string with Pebble SDK in C
                            
                                gcc shared library failed linking to glibc

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How do pointers work "under the hood" in C?

Tags:

c

pointers

memory-address

Chris Middleton

People also ask

1 Answers

Eric Lippert

Recent Activity

Donate For Us