I am having difficulties understanding the principle of lookahead in LR(1) - items. How do I compute the lookahead sets? Say for an example that I have the following grammar: <pre class="prettyprint"><code>S -> AB A -> aAb | b B -> d </code></pre> Then the first state will look like this: <pre class="prettyprint"><code>S -> .AB , {look ahead} A -> .aAb, {look ahead} A -> .b, {look ahead} </code></pre> I know what look aheads are, but I don't know how to compute them. I have googled for answers but couldn't find a webpage which explains this in a simple manner. Thanks in advance

I'll write down the first two states for your example: <pre class="prettyprint"><code>S -> AB A -> aAb | b B -> d </code></pre> <h3>State 0:</h3> <pre class="prettyprint"><code>(1) S -> .AB, {$} # Once we have done this rule it's EOF ($) (2) A -> .aAb, {d} # from (1), after A there has to be a B whose first symbol has to be d (3) A -> .b, {d} # as above </code></pre> <h3>State 1:</h3> <pre class="prettyprint"><code>(4) A -> a.Ab, {d} # from (2) (5) A -> .aAb, {b} # from (4), the symbol after the A has to be b (6) A -> .b, {b} # from (4), as above (7) A -> b., {d} # from (3) (8) S -> A.B, {$} # from (1) and (7) (9) B -> .B, {$} # from (8) </code></pre> and so on, keep following the same shift/reduce/closure as you would for an LR(0) parser, but keep track of (lookahead for) the next symbol... (State 2+ are longer, I don't recommend working them out by hand!) I suggest looking into Udacity's Programming Languages course to learn more about lexing and parsing. There is also an example on wikipedia and a related SO question.

LR(1) - Items, Look Ahead

Tags:

parsing

context-free-grammar

formal-languages

automata-theory

I am having difficulties understanding the principle of lookahead in LR(1) - items. How do I compute the lookahead sets?

Say for an example that I have the following grammar:

Click to copy

S -> AB
A -> aAb | b
B -> d

Then the first state will look like this:

Click to copy

S -> .AB , {look ahead}
A -> .aAb, {look ahead}
A -> .b,   {look ahead}

I know what look aheads are, but I don't know how to compute them. I have googled for answers but couldn't find a webpage which explains this in a simple manner.

Thanks in advance

769

asked Nov 19 '12 18:11

mrjasmin

1 Answers

I'll write down the first two states for your example:

Click to copy

S -> AB
A -> aAb | b
B -> d

State 0:

Click to copy

(1) S -> .AB, {$}   # Once we have done this rule it's EOF ($) 
(2) A -> .aAb, {d}  # from (1), after A there has to be a B whose first symbol has to be d
(3) A -> .b, {d}    # as above

State 1:

Click to copy

(4) A -> a.Ab, {d}   # from (2)
(5) A -> .aAb, {b}   # from (4), the symbol after the A has to be b
(6) A -> .b, {b}     # from (4), as above
(7) A -> b., {d}     # from (3)
(8) S -> A.B, {$}    # from (1) and (7)
(9) B -> .B, {$}     # from (8)

and so on, keep following the same shift/reduce/closure as you would for an LR(0) parser, but keep track of (lookahead for) the next symbol...
(State 2+ are longer, I don't recommend working them out by hand!)

I suggest looking into Udacity's Programming Languages course to learn more about lexing and parsing. There is also an example on wikipedia and a related SO question.

answered Sep 23 '22 07:09

Andy Hayden

Related questions
                            
                                Extract coordinates from KML file in Java
                            
                                Node request throwing: Error: Invalid URI "www.urlworksinbrowser.com" or options.uri is a required argument
                            
                                C#, JSON Parsing, dynamic variable. How to check type?
                            
                                Is there a way or an algorithm to convert DCG into normal definite clauses in Prolog?
                            
                                Parsing a simple text grammar with Superpower
                            
                                Fastest way to parse JSON from String when format is known
                            
                                Parsing a string
                            
                                Interpreter in Python: Making your own programming language?
                            
                                Shove a delimited string into a List<int>
                            
                                Attempt to parse JSON without crashing Node.js server
                            
                                What is the internal mechanism that browsers use to process/understand HTML? [closed]
                            
                                DateTime parse not working as expected
                            
                                Determining redirected URL in Python
                            
                                Regular expression problem (extracting one text or another)
                            
                                Scala code parser (not compiler)
                            
                                gcc for parsing code
                            
                                How to parse between <div class ="foo"> and </div> easily in Perl
                            
                                Enum.GetName() for bit fields?
                            
                                How to parse invalid HTML with Perl?
                            
                                Haskell Parsec Parser for Encountering [...]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

LR(1) - Items, Look Ahead

Tags:

parsing

context-free-grammar

formal-languages

automata-theory

mrjasmin

People also ask

1 Answers

State 0:

State 1:

Andy Hayden

Recent Activity

Donate For Us