Is this an ambiguous grammar? How should I resolve it?

Tags:

To preface this, my knowledge of this kind of stuff is puny.

Anyways, I've been developing a context-free grammar to describe the structure of alegbraic expressions so I can teach myself how the CYK parsing algorithm works. I understand how such a structure can work with only infix algebraic expressions, but I cannot understand how to develop a grammar that can handle both the unary and binary definitions of the "-" operator.

For reference, here's the grammar I've written (where S is the start symbol) in CNF:

S -> x
A -> O S
S -> L B
B -> S R
S -> K S
O -> +
O -> -
O -> *
O -> /
O -> ^
K -> -
L -> (
R -> )

The problem is that how can the CYK parsing algorithm know ahead of time whether to decide between S -> K S and A -> O S when it encounters the "-" operator? Is such a grammar context-free anymore? And most importantly, since programming languages can handle languages with both the binary and unary minus sign, how should I reasonably parse this?

459

asked Jun 26 '10 01:06

Tom O

2 Answers

This seems like a problem related to finite state automata and I don't remember everything from my coursework, but I wrote a CYK parser in OCaml, so I'll go ahead and take a shot :)

If you're trying to parse an expression like 3- -4 for example, you would have your S -> K S rule consume the -4 and then your A -> O S rule would absorb the - -4. This would eventually work up to the top-most S production rule. You should be careful with the grammar you're using though, since the A production rule you listed cannot be reached from S and you should probably have a S -> S O S rule of some sort.

The way that CYK parsing algorithms work is through backtracking, not through the "knowing ahead of time" that you mentioned in your question. What your CYK algorithm should do is to parse the -4 as a S -> K S rule and then it would try to absorb the second - with the S -> K S rule again because this production rule allows for an arbitrarily long chain of unary -. But once your algorithm realizes that it's stuck with the intermediate parse 3 S, it realizes that it has no production symbols that it can use to parse this. Once it realizes that this is no longer parseable, it will go back and instead try to parse the - as an S -> O S rule instead and continue on its merry way.

This means that your grammar remains context-free since a context-sensitive grammar means that you have terminals on the left side of the production rules, so you're good in that respect. HTH!

155

answered Sep 16 '22 22:09

SHC

The grammar is ambiguous, and the parser cannot decide which case to take.

You should probably use a grammar like the following:

S -> EXPR
EXPR -> (EXPR)
EXPR -> - EXPR
EXPR -> EXPR + EXPR
EXPR -> EXPR - EXPR
// etc...

answered Sep 19 '22 22:09

apaderno

Related questions
                            
                                Modular run-length encoding
                            
                                finding the count of number of sub arrays of size K whose sum is divisible by M?
                            
                                Find longest adjacent repeating non-overlapping substring
                            
                                Reduce binary string to an empty string by removing subsequences with alternative characters
                            
                                Can this code to find the neighborhood of a string be sped up?
                            
                                Numpy matrix multiplication but instead of multiplying it XOR's elements
                            
                                fast geometric proximity predicate
                            
                                What is the fastest way to find the point of intersection between a ray and a polygon?
                            
                                n-dimensional matching algorithm
                            
                                Algorithm for unique CD-KEY generation with validation
                            
                                Finding the highest 2 numbers- computer science
                            
                                Algorithm for fitting 2D polygons in an area?
                            
                                Fast counting of 2D sub-matrices withing a large, dense 2D matrix?
                            
                                Algorithm for creating cells by spiral on the hexagonal field
                            
                                Strategy to implement tree traversing algorithm in parallel?
                            
                                Traversal of a tree to find a node
                            
                                How can I build an incremental directed acyclic word graph to store and search strings?
                            
                                Building a tree using a list of objects
                            
                                Algorithm to see if keywords exist inside a string
                            
                                Finding a small image in a bigger one [duplicate]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Is this an ambiguous grammar? How should I resolve it?

Tags:

algorithm

theory

grammar

context-free-grammar

Tom O

People also ask

2 Answers

SHC

apaderno

Recent Activity

Donate For Us