Why are there LR(0) parsers but not LL(0) parsers?

1 Answers

The difference has to do with what the k means in LR(k) versus LL(k).

In LL(k), the parser maintains information about a top-down, left-to-right parse that traces out a leftmost derivation. The parser works by repeatedly looking at the current nonterminal symbol, then inspecting the next k tokens of the input stream to determine which production should be used. As a result, if you have an LL(0) parser, the parser would have to predict which production to use based purely on the current nonterminal. This is only possible if each nonterminal has just one production associated with it, which means that the grammar either produces exactly one string or no strings at all (by going into a loop). Therefore, while it is mathematically well-defined, LL(0) parsing is never used in practice.

In LR(k), the parser works bottom-up. It maintains a stack of symbols, along with a current "state," and then continuously decides whether to perform a shift (pushing another symbol on top of the stack) or a reduce (popping some number of symbols from the stack and applying a production in reverse). One key difference between an LL(k) and LR(k) parser is how the LR(k) parser decides which action to perform. In the LR(k) parser, the decision of what to do next s based on both the next k tokens of lookahead and on the current state of the parser. As a result, an LR(0) parser can still make somewhat intelligent decisions about which action to perform even if it can't see any lookahead tokens, because the parser's current state can encode a huge amount of information about where in a production the parser is and what it can realistically expect to see as the next tokens of input (even if the parser can't directly look at those tokens).

In short, LL(0) is extremely weak because the parser has to purely base its decision on the current nonterminal, meaning that it can't take one of many different actions based on which production might be used. An LR(0) parser is decently powerful because the parser bases its choice on its internal state, which is usually sufficiently detailed to let the parser make informed decisions about what to do next.

There is one other reason LL(0) is weak while LR(0) is reasonably powerful. In an LL(0) parser, the parser has to immediately decide which productions ought to be performed, which means that the parser has to guess productions completely blindly. In an LR(0) parser, the parser can shift multiple symbols before deciding that it is time to reduce. Consequently, the parser, without any lookahead, can defer making its decision about which reduction to use until after it has seen enough tokens of input to get a sense for the structure of the string. This is a special case of the more general fact that any LL(k) grammar is automatically LR(k).

Hope this helps!

answered Oct 26 '22 02:10

templatetypedef

Related questions
                            
                                Do not convert JToken date time string as DateTime
                            
                                Fastest way to check if a string can be parsed
                            
                                Is it possible to use HttpBrowserCapabilities from a c# console application?
                            
                                Uncaught SyntaxError: Unexpected end of JSON input at JSON.parse (<anonymous>)
                            
                                Parsing Objective-C code for static analysis
                            
                                Extract data from XML Clob using SQL from Oracle Database
                            
                                Difference betwean RegexpParsers,StandardTokenParsers and JavaTokenParsers in scala
                            
                                How to parse signed zero?
                            
                                Recursive Descent Parser
                            
                                How do you parse an HTML in vb.net
                            
                                How does a C parser distinguish between a type cast and a function call in general?
                            
                                How does a parser for C++ differentiate between comparisons and template instantiations?
                            
                                regex match keywords that are not in quotes
                            
                                Matching math expression with regular expression?
                            
                                Custom Skip Parser with Boost::Spirit
                            
                                Why can't go parse the time represented by the provided formats?
                            
                                Using argparse to parse arguments of form "arg= val"
                            
                                Using DOMDocument, is it possible to get all elements that exists within a certain DOM?
                            
                                Shell script argument parsing
                            
                                Making Xerces parse a string instead of a file

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Why are there LR(0) parsers but not LL(0) parsers?

Tags:

parsing

compiler-construction

ll

lr

Shmoopy

People also ask

1 Answers

templatetypedef

Recent Activity

Donate For Us