Explanation about viable prefix

Tags:

compiler-construction

In Ullman's book of compilers, in shift reduce parsing, following definition of viable prefix is given :

"The set of prefixes of right sentential forms that can appear on the stack of a shift-reduce parser are called viable prefixes. An equivalent definition of a viable prefix is that it is a prefix of right sentential form that does not continue past the right end of the rightmost handle of that sentential form. By this definition, it is always possible to add terminal symbols to the end of a viable prefix to obtain a right sentential form. Therefore, there is apparently no error as long as the portion of the input seen to a given point can be reduced to a viable prefix."

I can't understand this definition. Could someone explain the meaning of viable prefix with an example ?
In particular, please explain the meaning of
"An equivalent definition of a viable prefix is that it is a prefix of right sentential form that does not continue past the right end of the rightmost handle of that sentential form"

984

asked Nov 17 '10 07:11

Happy Mittal

2 Answers

EDIT (or actually rewrite): The sentence you asked for a clarification of is a major hairball! I needed a bit of a refresher on language and automata myself to pick that hairball apart. I found these lecture notes very useful in that regard.

It also doesn't make it any easier that the terms are defined in terms of top-down expansion, while rightmost derivation is typically used with bottom-up parsing!

I'll use the following expression grammar to illustrate:

     expr -> expr + term | term     term -> term * factor | factor     factor -> NUMBER | ( expr )

A right-sentential form is a sentential form which can be reached by rightmost derivation, which is another way to describe repeated expansion of only the rightmost nonterminal symbol when proceeding top-down. This is a rightmost derivation, and all the forms in it are therefore right-sentential forms:

     expr -> expr + term          -> expr + term * factor          -> expr + term * NUMBER          -> expr + factor * NUMBER          -> expr + NUMBER * NUMBER          -> expr + term + NUMBER * NUMBER          -> expr + NUMBER + NUMBER * NUMBER          -> term + NUMBER + NUMBER * NUMBER          -> NUMBER + NUMBER + NUMBER * NUMBER

A prefix of a sentential form (whether right or otherwise) is a sequence of input symbols that reduces to zero or more leading symbols of that sentential form. The empty sequence is trivially a prefix of every sentential form, and the complete sequence of symbols making up a sentential form is also trivially a prefix of it.
A simple phrase is the expansion of a single nonterminal symbol that holds a place in a sentential form. For example, term * factor is a simple phrase because it is an expansion of term, and term itself occurs in three productions.
The handle of a sentential form is the leftmost simple phrase within that form. (I admit, I find the term 'handle' somewhat confusing here.) In a rightmost derivation, the handle is easy to identify -- it's the sequence of symbols that resulted from the most recently expanded nonterminal. If you're working bottom-up the way a shift-reduce parser does, the handle is the simple phrase that needs to be reduced next. (Read the derivation table above from the bottom up, looking at which symbols were reduced, to see what I mean.)
A viable prefix of a right-sentential form is a prefix which does not extend beyond that form's handle -- in other words, that prefix is valid and contains no reducible simple phrases, with the possible exception of the handle itself if said prefix extends to exactly the end of the handle.

From a shift-reduce parser's point of view, as long as you have a viable prefix on the stack, you have not yet been forced to either reduce the (possibly incomplete) simple phrase on top of the stack to a new nonterminal or fail out of parsing if it can't be reduced. If shifting the next symbol would result in something other than a viable prefix, you must at that point either reduce or fail.

If you're parsing a context-free language, there is a rather convenient property that helps with the building of a table-driven shift-reduce parser: the set of all viable prefixes of a context-free language is itself a regular language! You can therefore build a finite automaton that recognizes the regular language of viable prefixes, and use it to determine when to shift and when to reduce. This combination of a stack and a finite state machine is essentially a push-down automaton, which is exactly the class of automaton needed to recognize a context-free language.

190

answered Oct 14 '22 13:10

Jeffrey Hantin

Consider the grammar given in book(i'm restating it here)

E -> E+T | T T -> T*F | F  F -> (E) | id

which is augmented by adding E' -> E in it

Now have a look at this derivation,

E' -> E    -> E+T    -> E+T*F

Claim E+T* is a viable prefix

Argument: This derivation is a right sentential form & E+T* is a prefix of it. Handle currently is T*F (as reducing T*F to T we can reach the start symbol & hence a successful parse)

And hence, E+T* is a viable prefix as it is a prefix of right sentential form & doesn't extend past the rightmost handle for this sentential form. :)

Other way to define it is:

The prefixes of right sentential forms that can appear on the stack of a shiftreduce parser are called viable prefixes.

answered Oct 14 '22 13:10

ak2

Related questions
                            
                                Scala parser combinators vs ANTLR/Java generated parser?
                            
                                What is the advantage of using a parser generator like happy as opposed to using parser combinators?
                            
                                How can I obtain the named arguments from a console application in the form of a Dictionary<string,string>?
                            
                                Best way to process large XML in PHP [duplicate]
                            
                                ffmpeg Progress Bar - Encoding Percentage in PHP
                            
                                How do you parse a filename in bash?
                            
                                Parse JSON String to JSON Object in C#.NET
                            
                                HtmlAgility - Save parsing to a string
                            
                                JavaScript can't convert Hindi/Arabic numbers to real numeric variables
                            
                                How can I parse dates and convert time zones in Perl?
                            
                                XPath to Parse "SRC" from IMG tag?
                            
                                Parsing hostname and port from string or url
                            
                                Parse URL in shell script
                            
                                Why is bottom-up parsing more common than top-down parsing?
                            
                                C# generic string parse to any object
                            
                                Read/Parse Binary files with Powershell
                            
                                Extracting table contents from a collection of PDF files [closed]
                            
                                get XPATH for all the nodes
                            
                                Purpose of FIRST and FOLLOW sets in LL(1) parsers?
                            
                                Parse data from JSON in ReactJS

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With