I discovered that Microsoft Visual Studio compiler and gcc preprocess the following small snippet differently: <pre class="prettyprint"><code># define M3(x, y, z) x + y + z # define M2(x, y) M3(x, y) # define P(x, y) {x, y} # define M(x, y) M2(x, P(x, y)) M(a, b) </code></pre> 'gcc -E' gives the following: <pre class="prettyprint"><code>a + {a + b} </code></pre> , while 'cl /E' issues a warning about missing macro argument and produces the following output: <pre class="prettyprint"><code>a + {a, b} + </code></pre> It seems that commas that came from nested macro expansions are not considered to be argument separators. Unfortunately, I found no description of the algorithm implemented in cl preprocessor, and so I'm not sure that my suggestion is correct. Does anyone know how cl preprocessor works and what's the difference between its algorithm and gcc's? And how the observed behaviour can be explained?

<pre class="prettyprint"><code># define M3(x, y, z) x + y + z # define M2(x, y) M3(x, y) # define P(x, y) {x, y} # define M(x, y) M2(x, P(x, y)) M(a, b) </code></pre> Let us roll this out manually, step by step: <pre class="prettyprint"><code>M(a, b) --> M2(a, P(a, b)) --> M2(a, {a, b}) </code></pre> The standard says: <blockquote> The individual arguments within the list are separated by comma preprocessing tokens, but comma preprocessing tokens between matching inner parentheses do not separate </blockquote> only parentheses are mentioned, so ... <pre class="prettyprint"><code>--> M3(a, {a, b}) --> a + {a + b} </code></pre> Important: <pre class="prettyprint"><code>M3(a, {a, b}) </code></pre> Here, according to the previous quote from the standard, three "arguments" are passed to M3 (using single-quotes to describe tokens/arguments): <pre class="prettyprint"><code>M3('a', '{a', 'b}') </code></pre> which are expanded to <pre class="prettyprint"><code>'a' + '{a' + 'b}' </code></pre> And this is what <code>cpp</code> (4.6.1) gives verbatim: <pre class="prettyprint"><code># 1 "cpp.cpp" # 1 "<built-in>" # 1 "<command-line>" # 1 "cpp.cpp" a + {a + b} </code></pre> <code>cpp</code> (or <code>gcc</code> and <code>g++</code>) are correct, MSVC isn't. As a nobleman make sure a bug report exists.

The only logic that explains such a behavior looks like this. CL way: <pre class="prettyprint"><code> M(a,b) M2(a,P(a,b)) M3(a,P(a,b)) M3(a,{a,b}) -> M3 gets 2 arguments ( 'a' and '{a,b}') instead of 3. | \ / arg1 | arg2 </code></pre> Gcc way: <pre class="prettyprint"><code>M(a,b) M2(a,P(a,b)) M3(a,P(a,b)) M3(a,{a,b}) -> Gcc probably thinks there are 3 arguments here ('a', '{a', 'b}'). | | | arg1 | | arg2 | arg3 </code></pre>

Difference between gcc and Microsoft preprocessor

Tags:

c

visual-studio

c-preprocessor

gcc

I discovered that Microsoft Visual Studio compiler and gcc preprocess the following small snippet differently:

# define M3(x, y, z) x + y + z
# define M2(x, y) M3(x, y)
# define P(x, y) {x, y}
# define M(x, y) M2(x, P(x, y))
M(a, b)

'gcc -E' gives the following:

a + {a + b}

, while 'cl /E' issues a warning about missing macro argument and produces the following output:

a + {a, b} +

It seems that commas that came from nested macro expansions are not considered to be argument separators. Unfortunately, I found no description of the algorithm implemented in cl preprocessor, and so I'm not sure that my suggestion is correct. Does anyone know how cl preprocessor works and what's the difference between its algorithm and gcc's? And how the observed behaviour can be explained?

608

asked Jul 13 '12 11:07

Sergey Syromyatnikov

2 Answers

# define M3(x, y, z) x + y + z
# define M2(x, y) M3(x, y)
# define P(x, y) {x, y}
# define M(x, y) M2(x, P(x, y))
M(a, b)

Let us roll this out manually, step by step:

M(a, b)
--> M2(a, P(a, b))
--> M2(a, {a, b})

The standard says:

The individual arguments within the list are separated by comma preprocessing tokens, but comma preprocessing tokens between matching inner parentheses do not separate

only parentheses are mentioned, so ...

--> M3(a, {a, b})
--> a + {a + b}

Important:

M3(a, {a, b})

Here, according to the previous quote from the standard, three "arguments" are passed to M3 (using single-quotes to describe tokens/arguments):

M3('a', '{a', 'b}')

which are expanded to

'a' + '{a' + 'b}'

And this is what cpp (4.6.1) gives verbatim:

# 1 "cpp.cpp"
# 1 "<built-in>"
# 1 "<command-line>"
# 1 "cpp.cpp"




a + {a + b}

cpp (or gcc and g++) are correct, MSVC isn't.

As a nobleman make sure a bug report exists.

176

answered Oct 12 '22 00:10

Sebastian Mach

The only logic that explains such a behavior looks like this.

CL way:

 M(a,b) 
 M2(a,P(a,b)) 
 M3(a,P(a,b))
 M3(a,{a,b}) -> M3 gets 2 arguments ( 'a' and '{a,b}') instead of 3.
    |  \ /
  arg1  |
      arg2

Gcc way:

M(a,b) 
M2(a,P(a,b)) 
M3(a,P(a,b))
M3(a,{a,b}) -> Gcc probably thinks there are 3 arguments here ('a', '{a', 'b}').
   |  | |
 arg1 | |
   arg2 |
     arg3

answered Oct 12 '22 01:10

SingerOfTheFall

Related questions
                            
                                C/C++ Linux GDB API [closed]
                            
                                What is the type of a pointer to a variable-length array in C?
                            
                                Programming for Young tableaux
                            
                                Force order of execution of C statements?
                            
                                Image scaling (KeepAspectRatioByExpanding) through OpenGL
                            
                                Compiling C programs using libssl on OS X El Capitan?
                            
                                Why do compilers insist on using a callee-saved register here?
                            
                                are C functions declared in <c____> headers guaranteed to be in the global namespace as well as std?
                            
                                How to read & understand C & C++ Standards and the language grammar used therein?
                            
                                Performance of array of functions over if and switch statements
                            
                                mmap with /dev/zero
                            
                                Can I write bytes directly to video memory under Linux, or is there a better way to get data onto the screen?
                            
                                Does Malloc only use the heap if requested memory space is large?
                            
                                fork after malloc in parent... does the child process need to free it?
                            
                                What are the reasons to check for error on close()?
                            
                                Why can't I "goto default;" or "goto case x;" within a switch selection structure?
                            
                                Const correctness for array pointers?
                            
                                Free static checker for C99 code
                            
                                What does C1x inherit from C++?
                            
                                Run preprocessor only but with only for certain statements

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With