What does a parser for C++ do until it can differentiate between comparisons and template instantiations?

Tags:

After reading this question I am left wondering what happens (regarding the AST) when major C++ compilers parse code like this:

struct foo 
{
  void method() { a<b>c; }

  // a b c may be declared here
};

Do they handle it like a GLR parser would or in a different way? What other ways are there to parse this and similar cases?

For example, I think it's possible to postpone parsing the body of the method until the whole struct has been parsed, but is this really possible and practical?

450

asked Oct 26 '18 14:10

panoskj

1 Answers

Although it is certainly possible to use GLR techniques to parse C++ (see a number of answers by Ira Baxter), I believe that the approach commonly used in commonly-used compilers such as gcc and clang is precisely that of deferring the parse of function bodies until the class definition is complete. (Since C++ source code passes through a preprocessor before being parsed, the parser works on streams of tokens and that is what must be saved in order to reparse the function body. I don't believe that it is feasible to reparse the source code.)

It's easy to know when a function definition is complete, since braces ({}) must balance even if it is not known how angle brackets nest.

C++ is not the only language in which it is useful to defer parsing until declarations have been handled. For example, a language which allows users to define new operators with different precedences would require all expressions to be (re-)parsed once the names and precedences of operators are known. A more pathological example is COBOL, in which the precedence of OR in a = b OR c depends on whether c is an integer (a is equal to one of b or c) or a boolean (a is equal to b or c is true). Whether designing languages in this manner is a good idea is another question.

answered Sep 19 '22 03:09

rici

Related questions
                            
                                Windows: Get function address in C++
                            
                                swscaler bad src image pointers
                            
                                C++ lambdas as class methods
                            
                                Why is clang over-complicating my simple factorial function?
                            
                                clang vs gcc: different code for volatile access
                            
                                How to force a "statement has no effect" warning on overloaded==
                            
                                Intellisense not working for unit test project in Visual Studio Professional 2017
                            
                                c++ default move assignment cannot access protected base member
                            
                                Is initializing a pointer declarator with an invalid pointer undefined behavior?
                            
                                Finding 2 equal sum sub-sequences, with maximum sum?
                            
                                Visual Studio, running cmakesettings.json from the command line
                            
                                Are lock-free atomics address-free in practice?
                            
                                How to compile C++ code using Visual Studio Code in Ubuntu? [closed]
                            
                                Does std::vector::assign/std::vector::operator=(const&) guarantee to reuse the buffer in `this`?
                            
                                How to handle differently sized type in C library from C++
                            
                                inline function in different translation units with different compiler flags undefined behaviour?
                            
                                Overflow on bitfield for signed underlying type enum
                            
                                Google Sparsehash uses realloc() on type which is not trivially copyable
                            
                                Why doesn't C++ show a narrowing conversion error when casting a float to a char?
                            
                                Life-time of object declared in the second "parameter" of 'for' statement

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

What does a parser for C++ do until it can differentiate between comparisons and template instantiations?

Tags:

c++

parsing

gcc

compiler-construction

clang

panoskj

People also ask

1 Answers

rici

Recent Activity

Donate For Us