<pre class="prettyprint"><code>%token <token> PLUS MINUS INT %left PLUS MINUS </code></pre> THIS WORKS: <pre class="prettyprint"><code>exp : exp PLUS exp; exp : exp MINUS exp; exp : INT; </code></pre> THIS HAS 2 SHIFT/REDUCE CONFLICTS: <pre class="prettyprint"><code>exp : exp binaryop exp; exp : INT; binaryop: PLUS | MINUS ; </code></pre> WHY?

You need to specify a precedence for the <code>exp binop exp</code> rule if you want the precedence rules to resolve the ambiguity: <pre class="prettyprint"><code>exp : exp binaryop exp %prec PLUS; </code></pre> With that change, all the conflicts are resolved. Edit The comments seem to indicate some confusion as to what the precedence rules in yacc/bison do. The precedence rules are a way of semi-automatically resolving shift/reduce conflicts in the grammar. They're only semi-automatic in that you have to know what you are doing when you specify the precedences. Bascially, whenever there is a shift/reduce conflict between a token to be shifted and a rule to be reduced, yacc compares the precedence of the token to be shifted and the rule to be reduced, and -- as long as both have assigned precedences -- does whichever is higher precedence. If either the token or the rule has no precedence assigned, then the conflict is reported to the user. <code>%left</code>/<code>%right</code>/<code>%nonassoc</code> come into the picture when the token and rule have the SAME precedence. In that case <code>%left</code> means do the reduce, <code>%right</code> means do the shift, and <code>%nonassoc</code> means do neither, causing a syntax error at runtime if the parser runs into this case. The precedence levels themselves are assigned to tokens with<code>%left</code>/<code>%right</code>/<code>%nonassoc</code> and to rules with <code>%prec</code>. The only oddness is that rules with no <code>%prec</code> and at least one terminal on the RHS get the precedence of the last terminal on the RHS. This can sometimes end up assigning precedences to rules that you really don't want to have precedence, which can sometimes result in hiding conflicts due to resolving them incorrectly. You can avoid these problems by adding an extra level of indirection in the rule in question -- change the problematic terminal on the RHS to to a new non-terminal that expands to just that terminal.

Why does this simple grammar have a shift/reduce conflict?

Tags:

grammar

yacc

bison

shift-reduce-conflict

%token <token> PLUS MINUS INT
%left PLUS MINUS

THIS WORKS:

exp : exp PLUS exp;
exp : exp MINUS exp;
exp : INT;

THIS HAS 2 SHIFT/REDUCE CONFLICTS:

exp : exp binaryop exp;
exp : INT;
binaryop: PLUS | MINUS ;

WHY?

869

asked Mar 15 '12 09:03

MustafaM

2 Answers

This is because the second is in fact ambiguous. So is the first grammar, but you resolved the ambiguity by adding %left.

This %left does not work in the second grammar, because associativity and precedence are not inherited from rule to rule. I.e. the binaryop nonterminal does not inherit any such thing even though it produces PLUS and MINUS. Associativity and predecence are localized to a rule, and revolve around terminal symbols.

We cannot do %left binaryop, but we can slightly refactor the grammar:

exp : exp binaryop term
exp : term;
term : INT;
binaryop: PLUS | MINUS ;

That has no conflicts now because it is implicitly left-associative. I.e. the production of a longer and longer expression can only happen on the left side of the binaryop, because the right side is a term which produces only an INT.

189

answered Oct 06 '22 00:10

Kaz

You need to specify a precedence for the exp binop exp rule if you want the precedence rules to resolve the ambiguity:

exp : exp binaryop exp %prec PLUS;

With that change, all the conflicts are resolved.

Edit

The comments seem to indicate some confusion as to what the precedence rules in yacc/bison do.

The precedence rules are a way of semi-automatically resolving shift/reduce conflicts in the grammar. They're only semi-automatic in that you have to know what you are doing when you specify the precedences.

Bascially, whenever there is a shift/reduce conflict between a token to be shifted and a rule to be reduced, yacc compares the precedence of the token to be shifted and the rule to be reduced, and -- as long as both have assigned precedences -- does whichever is higher precedence. If either the token or the rule has no precedence assigned, then the conflict is reported to the user.

%left/%right/%nonassoc come into the picture when the token and rule have the SAME precedence. In that case %left means do the reduce, %right means do the shift, and %nonassoc means do neither, causing a syntax error at runtime if the parser runs into this case.

The precedence levels themselves are assigned to tokens with%left/%right/%nonassoc and to rules with %prec. The only oddness is that rules with no %prec and at least one terminal on the RHS get the precedence of the last terminal on the RHS. This can sometimes end up assigning precedences to rules that you really don't want to have precedence, which can sometimes result in hiding conflicts due to resolving them incorrectly. You can avoid these problems by adding an extra level of indirection in the rule in question -- change the problematic terminal on the RHS to to a new non-terminal that expands to just that terminal.

answered Oct 06 '22 01:10

Chris Dodd

Related questions
                            
                                Is the base access defined correctly in the C# Language Specification 4.0?
                            
                                Does JSON allow positive sign for numbers?
                            
                                C++ Why adding a destructor to my class makes my class unmovable?
                            
                                Parsing a possibly nested braced item using a grammar
                            
                                More than enough "Always succeed"? [ RAKU ]
                            
                                It is possible to write NQP's precedence parser in Raku
                            
                                Parse tree and grammar information
                            
                                Antlr: Simplest way to recognize dates and numbers?
                            
                                Is this grammar not LR(1)?
                            
                                Bison: Shift Reduce Conflict
                            
                                Grammar ambiguity: why? (problem is: "(a)" vs "(a-z)")
                            
                                Non greedy parsing with pyparsing
                            
                                Determine whether a grammar is an LL using pairwise disjoint test
                            
                                Do grammar subparse on a file
                            
                                How to use matching delimiters in Raku
                            
                                Learn Prolog Now! DCG Practice Example
                            
                                Is this an ambiguous grammar? How should I resolve it?
                            
                                What does ^ and ! stand for in ANTLR grammar
                            
                                Parsing with incomplete grammars
                            
                                Using Parse::RecDescent

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With