I saw some usage of <code>(void*)</code> in <code>printf()</code>. If I want to print a variable's address, can I do it like this: <pre class="prettyprint"><code>int a = 19; printf("%d", &a); </code></pre> <ol> <li>I think, <code>&a</code> is <code>a</code>'s address which is just an integer, right?</li> <li> Many articles I read use something like this: <pre class="prettyprint"><code>printf("%p", (void*)&a); </code></pre> </li> </ol> <ol> <li>What does <code>%p</code> stand for? (A pointer?)</li> <li>Why use <code>(void*)</code>? Can't I use <code>(int)&a</code> instead?</li> </ol>

Pointers are not numbers. They are often internally represented that way, but they are conceptually distinct. <code>void*</code> is designed to be a generic pointer type. Any pointer value (other than a function pointer) may be converted to <code>void*</code> and back again without loss of information. This typically means that <code>void*</code> is at least as big as other pointer types. <code>printf</code>s <code>"%p"</code> format requires an argument of type <code>void*</code>. That's why an <code>int*</code> should be cast to <code>void*</code> in that context. (There's no implicit conversion because it's a variadic function; there's no declared parameter, so the compiler doesn't know what to convert it to.) Sloppy practices like printing pointers with <code>"%d"</code>, or passing an <code>int*</code> to <code>printf</code> with a <code>"%p"</code> format, are things that you can probably get away with on most current systems, but they render your code non-portable. (Note that it's common on 64-bit systems for <code>void*</code> and <code>int</code> to be different sizes, so printing pointers with <code>%d"</code> is really non-portable, not just theoretically.) Incidentally, the output format for <code>"%p"</code> is implementation-defined. Hexadecimal is common, (in upper or lower case, with or without a leading <code>"0x"</code> or <code>"0X"</code>), but it's not the only possibility. All you can count on is that, assuming a reasonable implementation, it will be a reasonable way to represent a pointer value in human-readable form (and that <code>scanf</code> will understand the output of <code>printf</code>). The article you read is entirely correct. The correct way to print an <code>int*</code> value is <pre class="prettyprint"><code>printf("%p", (void*)&a); </code></pre> Don't take the lazy way out; it's not at all difficult to get it right. Suggested reading: Section 4 of the comp.lang.c FAQ. (Further suggested reading: All the other sections. EDIT: In response to Alcott's question: <blockquote> There is still one thing I don't quite understand. <code>int a = 10; int *p = &a;</code>, so p's value is a's address in mem, right? If right, then p's value will range from 0 to 2^32-1 (if cpu is 32-bit), and an integer is 4-byte on 32-bit OS, right? then What's the difference between the p's value and an integer? Can p's value go out of the range? </blockquote> The difference is that they're of different types. Assume a system on which <code>int</code>, <code>int*</code>, <code>void*</code>, and <code>float</code> are all 32 bits (this is typical for current 32-bit systems). Does the fact that <code>float</code> is 32 bits imply that its range is 0 to 232-1? Or -231 to 231-1? Certainly not; the range of float (assuming IEEE representation) is approximately -3.40282e+38 to +3.40282e+38, with widely varying resolution across the range, plus exotic values like negative zero, subnormalized numbers, denormalized numbers, infinities, and NaNs (Not-a-Number). <code>int</code> and <code>float</code> are both 32 bits, and you can take the 32 bits of a <code>float</code> object and treat it as an <code>int</code> representation, but the result won't have any straightforward relationship to the value of the <code>float</code>. The second low-order bit of an <code>int</code>, for example, has a specific meaning; it contributes 0 to the value if it's 0, and 2 to the value if it's 1; the corresponding bit of a <code>float</code> has a meaning, but it's quite different (it contributes a value that depends on the value of the exponent). The situation with pointers is quite similar. A pointer value has a meaning: it's the address of some object (or any of several other things, but we'll set that aside for now). On most current systems, interpreting the bits of a pointer object as if it were an integer gives you something that makes sense on the machine level. But the language itself does not guarantee, or even hint, that that's the case. Pointers are not numbers. A concrete example: some years ago, I ran across some code that tried to compute the difference in bytes between two addresses by casting to integers. It was something like this: <pre class="prettyprint"><code>unsigned char *p0; unsigned char *p1; long difference = (unsigned long)p1 - (unsigned long)p0; </code></pre> If you assume that pointers are just numbers, representing addresses in a linear monolithic address space, then this code makes sense. But that assumption is not supported by the language. And in fact, there was a system on which that code was intended to run (the Cray T90) on which it simply would not have worked. The T90 had 64-bit pointers pointing to 64-bit words. Byte pointers were synthesized in software by storing an offset in the 3 high-order bits of a pointer object. Subtracting two pointers in the above manner, if they both had 0 offsets, would give you the number of words, not bytes, between the addresses. And if they had non-0 offsets, it would give you meaningless garbage. (Conversion from a pointer to an integer would just copy the bits; it could have done the work to give you a meaningful byte index, but it didn't.) The solution was simple: drop the casts and use pointer arithmetic: <pre class="prettyprint"><code>long difference = p1 - p0; </code></pre> Other addressing schemes are possible. For example, an address might consist of a descriptor that (perhaps indirectly) references a block of memory, plus an offset within that block. You can assume that addresses are just numbers, that the address space is linear and monolithic, that all pointers are the same size and have the same representation, that a pointer can be safely converted to <code>int</code>, or to <code>long</code>, and back again without loss of information. And the code you write based on those assumptions will probably work on most current systems. But it's entirely possible that some future systems will again use a different memory model, and your code will break. If you avoid making any assumptions beyond what the language actually guarantees, your code will be far more future-proof. And even leaving portability issues aside, it will probably be cleaner.

In C <code>void *</code> is an un-typed pointer. <code>void</code> does not mean void... it means anything. Thus casting to <code>void *</code> would be the same as casting to "pointer" in another language. Using <code>(int *)&a</code> should work too... but the stylistic point of saying <code>(void *)</code> is to say -- I don't care about the type -- just that it is a pointer. Note: It is possible for an implementation of C to cause this construct to fail and still meet the requirements of the standards. I don't know of any such implementations, but it is possible.

When printf is an address of a variable, why use void*?

Tags:

c

pointers

void

I saw some usage of (void*) in printf().

If I want to print a variable's address, can I do it like this:

int a = 19;
printf("%d", &a);

I think, &a is a's address which is just an integer, right?
Many articles I read use something like this:
```
printf("%p", (void*)&a);
```

What does %p stand for? (A pointer?)
Why use (void*)? Can't I use (int)&a instead?

297

asked Sep 03 '11 03:09

Alcott

3 Answers

Pointers are not numbers. They are often internally represented that way, but they are conceptually distinct.

void* is designed to be a generic pointer type. Any pointer value (other than a function pointer) may be converted to void* and back again without loss of information. This typically means that void* is at least as big as other pointer types.

printfs "%p" format requires an argument of type void*. That's why an int* should be cast to void* in that context. (There's no implicit conversion because it's a variadic function; there's no declared parameter, so the compiler doesn't know what to convert it to.)

Sloppy practices like printing pointers with "%d", or passing an int* to printf with a "%p" format, are things that you can probably get away with on most current systems, but they render your code non-portable. (Note that it's common on 64-bit systems for void* and int to be different sizes, so printing pointers with %d" is really non-portable, not just theoretically.)

Incidentally, the output format for "%p" is implementation-defined. Hexadecimal is common, (in upper or lower case, with or without a leading "0x" or "0X"), but it's not the only possibility. All you can count on is that, assuming a reasonable implementation, it will be a reasonable way to represent a pointer value in human-readable form (and that scanf will understand the output of printf).

The article you read is entirely correct. The correct way to print an int* value is

printf("%p", (void*)&a);

Don't take the lazy way out; it's not at all difficult to get it right.

Suggested reading: Section 4 of the comp.lang.c FAQ. (Further suggested reading: All the other sections.

EDIT:

In response to Alcott's question:

There is still one thing I don't quite understand. int a = 10; int *p = &a;, so p's value is a's address in mem, right? If right, then p's value will range from 0 to 2^32-1 (if cpu is 32-bit), and an integer is 4-byte on 32-bit OS, right? then What's the difference between the p's value and an integer? Can p's value go out of the range?

The difference is that they're of different types.

Assume a system on which int, int*, void*, and float are all 32 bits (this is typical for current 32-bit systems). Does the fact that float is 32 bits imply that its range is 0 to 2³²-1? Or -2³¹ to 2³¹-1? Certainly not; the range of float (assuming IEEE representation) is approximately -3.40282e+38 to +3.40282e+38, with widely varying resolution across the range, plus exotic values like negative zero, subnormalized numbers, denormalized numbers, infinities, and NaNs (Not-a-Number). int and float are both 32 bits, and you can take the 32 bits of a float object and treat it as an int representation, but the result won't have any straightforward relationship to the value of the float. The second low-order bit of an int, for example, has a specific meaning; it contributes 0 to the value if it's 0, and 2 to the value if it's 1; the corresponding bit of a float has a meaning, but it's quite different (it contributes a value that depends on the value of the exponent).

The situation with pointers is quite similar. A pointer value has a meaning: it's the address of some object (or any of several other things, but we'll set that aside for now). On most current systems, interpreting the bits of a pointer object as if it were an integer gives you something that makes sense on the machine level. But the language itself does not guarantee, or even hint, that that's the case.

Pointers are not numbers.

A concrete example: some years ago, I ran across some code that tried to compute the difference in bytes between two addresses by casting to integers. It was something like this:

unsigned char *p0;
unsigned char *p1;
long difference = (unsigned long)p1 - (unsigned long)p0;

If you assume that pointers are just numbers, representing addresses in a linear monolithic address space, then this code makes sense. But that assumption is not supported by the language. And in fact, there was a system on which that code was intended to run (the Cray T90) on which it simply would not have worked. The T90 had 64-bit pointers pointing to 64-bit words. Byte pointers were synthesized in software by storing an offset in the 3 high-order bits of a pointer object. Subtracting two pointers in the above manner, if they both had 0 offsets, would give you the number of words, not bytes, between the addresses. And if they had non-0 offsets, it would give you meaningless garbage. (Conversion from a pointer to an integer would just copy the bits; it could have done the work to give you a meaningful byte index, but it didn't.)

The solution was simple: drop the casts and use pointer arithmetic:

long difference = p1 - p0;

Other addressing schemes are possible. For example, an address might consist of a descriptor that (perhaps indirectly) references a block of memory, plus an offset within that block.

You can assume that addresses are just numbers, that the address space is linear and monolithic, that all pointers are the same size and have the same representation, that a pointer can be safely converted to int, or to long, and back again without loss of information. And the code you write based on those assumptions will probably work on most current systems. But it's entirely possible that some future systems will again use a different memory model, and your code will break.

If you avoid making any assumptions beyond what the language actually guarantees, your code will be far more future-proof. And even leaving portability issues aside, it will probably be cleaner.

145

answered Sep 27 '22 20:09

Keith Thompson

So much insanity present here...

%p is generally the correct format specifier to use if you just want to print out a representation of the pointer. Never, ever use %d.

The length of an int and the length of a pointer (void* or otherwise) have no relationship. Most data models on i386 just happen to have 32-bit ints AND 32-bit pointers -- other platforms, including x86-64, are not the same! (This is also historically known as "all the world's a VAX syndrome".) http://en.wikipedia.org/wiki/64-bit#64-bit_data_models

If for some reason you want to hold a memory address in an integral variable, use the right types! intptr_t and uintptr_t. They're in stdint.h. See http://en.wikipedia.org/wiki/Stdint.h#Integers_wide_enough_to_hold_pointers

answered Sep 27 '22 20:09

Nicholas Knight

In C void * is an un-typed pointer. void does not mean void... it means anything. Thus casting to void * would be the same as casting to "pointer" in another language.

Using (int *)&a should work too... but the stylistic point of saying (void *) is to say -- I don't care about the type -- just that it is a pointer.

Note: It is possible for an implementation of C to cause this construct to fail and still meet the requirements of the standards. I don't know of any such implementations, but it is possible.

answered Sep 27 '22 18:09

Hogan

Related questions
                            
                                how to run the .o file after make
                            
                                Is there an actual example where inline is detrimental to the performance of a C program?
                            
                                Why are i2c_smbus function not available? (I2C – Embedded Linux)
                            
                                Testing equality between two __m128i variables
                            
                                GLib-GIO-ERROR**: No GSettings schemas are installed on the system
                            
                                GTK3 and multithreading, replacing deprecated functions
                            
                                Sequence Points between printf function args; does the sequence point between conversions matter?
                            
                                Sleep | warning implicit declaration of function `sleep'?
                            
                                How to compile static .lib library for Windows in Linux or Macos
                            
                                Why is it bad to use short
                            
                                Normalize file path with WinAPI [duplicate]
                            
                                How to make a file descriptor blocking?
                            
                                Reserve RAM in C
                            
                                Access command line arguments without using char **argv in main
                            
                                How to find the current line position of file pointer in C?
                            
                                Ways to avoid Memory Leaks in C/C++
                            
                                Is it possible to access 32-bit registers in C?
                            
                                Are there any alternatives to C? [closed]
                            
                                Is there any harm in calling 'free' for the same pointer twice in a C program?
                            
                                Why does fopen/fgets use both mmap and read system calls to access the data?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With