I have trouble understanding how to compute the lookaheads for the LR(1)-items. Lets say that I have this grammar: <pre class="prettyprint"><code>S -> AB A -> aAb | a B -> d </code></pre> A LR(1)-item is an LR(0) item with a lookahead. So we will get the following LR(0)-item for state 0: <pre class="prettyprint"><code>S -> .AB , {lookahead} A -> .aAb, {lookahead} A -> .a, {lookahead} </code></pre> State: 1 <pre class="prettyprint"><code>A -> a.Ab, {lookahead} A -> a. ,{lookahead} A -> .aAb ,{lookahead} A ->.a ,{lookahead} </code></pre> Can somebody explain how to compute the lookaheads ? What is the general approach ? Thank you in advance

The lookaheads used in an LR(1) parser are computed as follows. First, the start state has an item of the form <pre class="prettyprint"><code>S -> .w ($) </code></pre> for every production S -> w, where S is the start symbol. Here, the $ marker denotes the end of the input. Next, for any state that contains an item of the form A -> x.By (t), where x is an arbitrary string of terminals and nonterminals and B is a nonterminal, you add an item of the form B -> .w (s) for every production B -> w and for every terminal in the set FIRST(yt). (Here, FIRST refers to FIRST sets, which are usually introduced when talking about LL parsers. If you haven't seen them before, I would take a few minutes to look over those lecture notes). Let's try this out on your grammar. We start off by creating an item set containing <pre class="prettyprint"><code>S -> .AB ($) </code></pre> Next, using our second rule, for every production of A, we add in a new item corresponding to that production and with lookaheads of every terminal in FIRST(B$). Since B always produces the string d, FIRST(B$) = d, so all of the productions we introduce will have lookahead d. This gives <pre class="prettyprint"><code>S -> .AB ($) A -> .aAb (d) A -> .a (d) </code></pre> Now, let's build the state corresponding to seeing an 'a' in this initial state. We start by moving the dot over one step for each production that starts with a: <pre class="prettyprint"><code>A -> a.Ab (d) A -> a. (d) </code></pre> Now, since the first item has a dot before a nonterminal, we use our rule to add one item for each production of A, giving those items lookahead FIRST(bd) = b. This gives <pre class="prettyprint"><code>A -> a.Ab (d) A -> a. (d) A -> .aAb (b) A -> .a (b) </code></pre> Continuing this process will ultimately construct all the LR(1) states for this LR(1) parser. This is shown here: <pre class="prettyprint"><code>[0] S -> .AB ($) A -> .aAb (d) A -> .a (d) [1] A -> a.Ab (d) A -> a. (d) A -> .aAb (b) A -> .a (b) [2] A -> a.Ab (b) A -> a. (b) A -> .aAb (b) A -> .a (b) [3] A -> aA.b (d) [4] A -> aAb. (d) [5] S -> A.B ($) B -> .d ($) [6] B -> d. ($) [7] S -> AB. ($) [8] A -> aA.b (b) [9] A -> aAb. (b) </code></pre> In case it helps, I taught a compilers course last summer and have all the lecture slides available online. The slides on bottom-up parsing should cover all of the details of LR parsing and parse table construction, and I hope that you find them useful! Hope this helps!

LR(1) Item DFA - Computing Lookaheads

Tags:

I have trouble understanding how to compute the lookaheads for the LR(1)-items.

Lets say that I have this grammar:

Click to copy

S -> AB A -> aAb | a B -> d

A LR(1)-item is an LR(0) item with a lookahead. So we will get the following LR(0)-item for state 0:

Click to copy

S -> .AB , {lookahead}  A -> .aAb,  {lookahead}  A -> .a,  {lookahead}

State: 1

Click to copy

A ->  a.Ab, {lookahead}  A ->  a. ,{lookahead}  A -> .aAb ,{lookahead}  A ->.a ,{lookahead}

Can somebody explain how to compute the lookaheads ? What is the general approach ?

Thank you in advance

737

asked Dec 31 '12 15:12

mrjasmin

2 Answers

The lookaheads used in an LR(1) parser are computed as follows. First, the start state has an item of the form

Click to copy

S -> .w  ($)

for every production S -> w, where S is the start symbol. Here, the $ marker denotes the end of the input.

Next, for any state that contains an item of the form A -> x.By (t), where x is an arbitrary string of terminals and nonterminals and B is a nonterminal, you add an item of the form B -> .w (s) for every production B -> w and for every terminal in the set FIRST(yt). (Here, FIRST refers to FIRST sets, which are usually introduced when talking about LL parsers. If you haven't seen them before, I would take a few minutes to look over those lecture notes).

Let's try this out on your grammar. We start off by creating an item set containing

Click to copy

S -> .AB ($)

Next, using our second rule, for every production of A, we add in a new item corresponding to that production and with lookaheads of every terminal in FIRST(B$). Since B always produces the string d, FIRST(B$) = d, so all of the productions we introduce will have lookahead d. This gives

Click to copy

S -> .AB ($) A -> .aAb (d) A -> .a (d)

Now, let's build the state corresponding to seeing an 'a' in this initial state. We start by moving the dot over one step for each production that starts with a:

Click to copy

A -> a.Ab (d) A -> a. (d)

Now, since the first item has a dot before a nonterminal, we use our rule to add one item for each production of A, giving those items lookahead FIRST(bd) = b. This gives

Click to copy

A -> a.Ab (d) A -> a. (d) A -> .aAb (b) A -> .a (b)

Continuing this process will ultimately construct all the LR(1) states for this LR(1) parser. This is shown here:

Click to copy

[0] S -> .AB  ($) A -> .aAb (d) A -> .a   (d)  [1] A -> a.Ab (d) A -> a.   (d) A -> .aAb (b) A -> .a   (b)  [2] A -> a.Ab (b) A -> a.   (b) A -> .aAb (b) A -> .a   (b)  [3] A -> aA.b (d)  [4] A -> aAb. (d)  [5] S -> A.B  ($) B -> .d   ($)  [6] B -> d.   ($)  [7] S -> AB.  ($)  [8] A -> aA.b (b)  [9] A -> aAb. (b)

In case it helps, I taught a compilers course last summer and have all the lecture slides available online. The slides on bottom-up parsing should cover all of the details of LR parsing and parse table construction, and I hope that you find them useful!

Hope this helps!

113

answered Sep 30 '22 05:09

templatetypedef

here is the LR(1) automaton for the grammar as the follow has been done above I think it's better for the understanding to trying draw the automaton and the flow will make the idea of the lookaheads clearer

here is the automaton for the grammar

answered Sep 30 '22 07:09

M.Alamer

Related questions
                            
                                Javascript memory and leak problems
                            
                                Why does foreach fail to find my GetEnumerator extension method?
                            
                                Xcode Environment Variables Not Present During Archive
                            
                                Is there a general way to mark a JUnit test as pending?
                            
                                Why does Intellij-IDEA ignore my tomcat/conf/server.xml Context tag?
                            
                                Getting error : Uncaught TypeError: undefined is not a function bootstrap.js:29 (anonymous function) bootstrap.js:29 (anonymous function)
                            
                                Is the "X-Mailer: PHP/<phpversion>" header required to send a mail in PHP?
                            
                                Disable auto-pairing of characters in Textmate 2?
                            
                                Bouncy Castle : PEMReader => PEMParser
                            
                                Where to look first when optimizing Scala code? [closed]
                            
                                java.lang.RuntimeException: Failed to invoke public com.example.syncapp.MessageBase() with no args
                            
                                rqworker timeout

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

LR(1) Item DFA - Computing Lookaheads

Tags:

mrjasmin

People also ask

2 Answers

templatetypedef

M.Alamer

Recent Activity

Donate For Us