Let's say I have this grammar: <pre class="prettyprint"><code>A: ε | B 'a' B: ε | B 'b' </code></pre> What is considered to be the closure of the item <code>A: • B 'a'</code>? In other words, how do I deal with the epsilon transitions when figuring out closures?

This is pretty straightforward. Included in the closure of <pre class="prettyprint"><code> A = ... <dot> X ... ; </code></pre> are all the rules <pre class="prettyprint"><code> X = <dot> R1 R2 R3 ... ; </code></pre> where first(R1) is not empty. For each (nonempty) token K in first(R1), you'll need to (transitively!) include <pre class="prettyprint"><code> R1 = <dot> k ... ; </code></pre> etc. but presumably you are already clear on this. You specific question is what happens if R1 can be empty? Then you also need to include <pre class="prettyprint"><code> X = R1 <dot> R2 ... ; </code></pre> Similarly for R2 being empty, if R1 can be empty, and similarly for Ri being empty if R1 .. Ri-1 can be empty. In extreme circumstances, all the Ri can be empty (lots of optional subclauses in your grammar), and you can end up including <pre class="prettyprint"><code> X = R1 R2 ... Rn <dot> ; </code></pre> Note that determining that first(R1) "can be empty" is itself a transitive closure question. The GLR parser generator that I built for DMS precomputes first_can_be_empty using Warshall's algorithm and then uses that in the closure construction.

What is the closure of a left-recursive LR(0) item with epsilon transitions?

Tags:

language-agnostic

parsing

grammar

lr

epsilon

Let's say I have this grammar:

A: ε
 | B 'a'
B: ε
 | B 'b'

What is considered to be the closure of the item A: • B 'a'?
In other words, how do I deal with the epsilon transitions when figuring out closures?

357

asked Oct 19 '12 05:10

user541686

1 Answers

This is pretty straightforward. Included in the closure of

    A = ... <dot> X ... ;

are all the rules

    X =   <dot> R1 R2 R3 ... ;

where first(R1) is not empty. For each (nonempty) token K in first(R1), you'll need to (transitively!) include

    R1 = <dot> k ... ;

etc. but presumably you are already clear on this.

You specific question is what happens if R1 can be empty? Then you also need to include

    X =   R1 <dot> R2 ... ;

Similarly for R2 being empty, if R1 can be empty, and similarly for Ri being empty if R1 .. Ri-1 can be empty. In extreme circumstances, all the Ri can be empty (lots of optional subclauses in your grammar), and you can end up including

    X =  R1 R2 ... Rn <dot> ;

Note that determining that first(R1) "can be empty" is itself a transitive closure question.

The GLR parser generator that I built for DMS precomputes first_can_be_empty using Warshall's algorithm and then uses that in the closure construction.

103

answered Sep 29 '22 04:09

Ira Baxter

Related questions
                            
                                DateTime.Parse fails for today (01 mar 2012)! o_0
                            
                                Haskell library for parsing Bash scripts?
                            
                                PHP parse_ini_file relative path?
                            
                                < > changed to &lt; and &gt; while parsing html with beautifulsoup in python
                            
                                Invertible State monad (and parsers)
                            
                                How to parse an html document into an AST that includes line numbers for each node?
                            
                                How can I skip a parsing rule using ANTLR 4?
                            
                                How to correctly error out in JSON parsing with Data.Aeson
                            
                                Boost::Spirit - on_error not printing
                            
                                Custom Converter for Retrofit
                            
                                Unindented code breaks my grammar
                            
                                Fast parsing of string that allows escaped characters?
                            
                                What does a parser for C++ do until it can differentiate between comparisons and template instantiations?
                            
                                Raise ParseError in Haskell/Parsec
                            
                                Parsing with DCGs in Scheme (without Prolog)?
                            
                                Fastest Multi-Threading Method of Serial Port Data Parsing C#
                            
                                Finding a language that is not LL(1)?
                            
                                converting .mov file to .h264 file
                            
                                Using PLY to parse SQL statements
                            
                                Flex++ Bisonc++ parser

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With