Came across an interesting interview question: <pre class="prettyprint"><code>test 1: printf("test %s\n", NULL); printf("test %s\n", NULL); prints: test (null) test (null) test 2: printf("%s\n", NULL); printf("%s\n", NULL); prints Segmentation fault (core dumped) </code></pre> Though this might run fine on some systems, atleast mine is throwing a segmentation fault. What would be the best explanation of this behavior? Above code is in C. Following is my gcc info: <pre class="prettyprint"><code>deep@deep:~$ gcc --version gcc (Ubuntu/Linaro 4.6.3-1ubuntu5) 4.6.3 </code></pre>

First things first: <code>printf</code> is expecting a valid (i.e. non-NULL) pointer for its %s argument so passing it a NULL is officially undefined. It may print "(null)" or it may delete all files on your hard drive--either is correct behavior as far as ANSI is concerned (at least, that's what Harbison and Steele tells me.) That being said, yeah, this is really wierd behavior. It turns out that what's happening is that when you do a simple <code>printf</code> like this: <pre class="prettyprint"><code>printf("%s\n", NULL); </code></pre> gcc is (ahem) smart enough to deconstruct this into a call to <code>puts</code>. The first <code>printf</code>, this: <pre class="prettyprint"><code>printf("test %s\n", NULL); </code></pre> is complicated enough that gcc will instead emit a call to real <code>printf</code>. (Notice that gcc emits warnings about your invalid <code>printf</code> argument when you compile. That's because it long ago developed the ability to parse <code>*printf</code> format strings.) You can see this yourself by compiling with the <code>-save-temps</code> option and then looking through the resulting <code>.s</code> file. When I compiled the first example, I got: <pre class="prettyprint"><code>movl $.LC0, %eax movl $0, %esi movq %rax, %rdi movl $0, %eax call printf ; <-- Actually calls printf! </code></pre> (Comments were added by me.) But the second one produced this code: <pre class="prettyprint"><code>movl $0, %edi ; Stores NULL in the puts argument list call puts ; Calls puts </code></pre> The wierd thing is that it doesn't print the following newline. It's as though it's figured out that this is going to cause a segfault so it doesn't bother. (Which it has--it warned me when I compiled it.)

As far as the C language is concerned, the reason is that you're invoking undefined behavior and anything can happen. As for the mechanics of why this is happening, modern gcc optimizes <code>printf("%s\n", x)</code> to <code>puts(x)</code>, and <code>puts</code> does not have the silly code to print <code>(null)</code> when it sees a null pointer, whereas common implementations of <code>printf</code> have this special case. Since gcc can't optimize (in general) non-trivial format strings like this, <code>printf</code> actually gets called when the format string has other text present in it.

What is the behavior of printing NULL with printf's %s specifier?

Tags:

c

linux

language-lawyer

compiler-bug

Came across an interesting interview question:

test 1: printf("test %s\n", NULL); printf("test %s\n", NULL);  prints: test (null) test (null)  test 2: printf("%s\n", NULL); printf("%s\n", NULL); prints Segmentation fault (core dumped)

Though this might run fine on some systems, atleast mine is throwing a segmentation fault. What would be the best explanation of this behavior? Above code is in C.

Following is my gcc info:

deep@deep:~$ gcc --version gcc (Ubuntu/Linaro 4.6.3-1ubuntu5) 4.6.3

985

asked Jul 21 '12 04:07

Deepanjan Mazumdar

2 Answers

First things first: printf is expecting a valid (i.e. non-NULL) pointer for its %s argument so passing it a NULL is officially undefined. It may print "(null)" or it may delete all files on your hard drive--either is correct behavior as far as ANSI is concerned (at least, that's what Harbison and Steele tells me.)

That being said, yeah, this is really wierd behavior. It turns out that what's happening is that when you do a simple printf like this:

printf("%s\n", NULL);

gcc is (ahem) smart enough to deconstruct this into a call to puts. The first printf, this:

printf("test %s\n", NULL);

is complicated enough that gcc will instead emit a call to real printf.

(Notice that gcc emits warnings about your invalid printf argument when you compile. That's because it long ago developed the ability to parse *printf format strings.)

You can see this yourself by compiling with the -save-temps option and then looking through the resulting .s file.

When I compiled the first example, I got:

movl    $.LC0, %eax movl    $0, %esi movq    %rax, %rdi movl    $0, %eax call    printf      ; <-- Actually calls printf!

(Comments were added by me.)

But the second one produced this code:

movl    $0, %edi    ; Stores NULL in the puts argument list call    puts        ; Calls puts

The wierd thing is that it doesn't print the following newline. It's as though it's figured out that this is going to cause a segfault so it doesn't bother. (Which it has--it warned me when I compiled it.)

184

answered Sep 22 '22 07:09

Chris Reuter

As far as the C language is concerned, the reason is that you're invoking undefined behavior and anything can happen.

As for the mechanics of why this is happening, modern gcc optimizes printf("%s\n", x) to puts(x), and puts does not have the silly code to print (null) when it sees a null pointer, whereas common implementations of printf have this special case. Since gcc can't optimize (in general) non-trivial format strings like this, printf actually gets called when the format string has other text present in it.

answered Sep 20 '22 07:09

R.. GitHub STOP HELPING ICE

Related questions
                            
                                Download file using libcurl in C/C++
                            
                                How could I graphically display the memory layout from a .map file? [closed]
                            
                                Compile a DLL in C/C++, then call it from another program
                            
                                How to compile makefile using MinGW?
                            
                                How does GCC optimize out an unused variable incremented inside a loop?
                            
                                know if .lib is static or import
                            
                                Detached vs. Joinable POSIX threads
                            
                                When to use const char * and when to use const char []
                            
                                How does a NOP sled work?
                            
                                Can I get Unix's pthread.h to compile in Windows?
                            
                                Point in Polygon Algorithm
                            
                                C vs C++ struct alignment
                            
                                When is uint8_t ≠ unsigned char?
                            
                                Difference between files written in binary and text mode
                            
                                How to get the sign, mantissa and exponent of a floating point number
                            
                                Is a C compiler allowed to coalesce sequential assignments to volatile variables?
                            
                                How to build a release version binary in Go?
                            
                                Is it bad to declare a C-style string without const? If so, why?
                            
                                warning: left shift count >= width of type
                            
                                Global variables in header file

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With