BNF vs EBNF vs ABNF: which to choose?

Tags:

I want to come up with a language syntax. I have read a bit about these three, and can't really see anything that one can do that another can't. Is there any reason to use one over another? Or is it just a matter of preference?

436

asked Apr 04 '10 16:04

Jason Baker

2 Answers

You have to think about EBNF and ABNF as extensions that help you just to be more concise and expressive while developing your grammars.

For example think about an optional non-terminal symbol, in a BNF grammar you would define it by using intermediate symbols like:

A        ::= OPTIONAL OTHER OPTIONAL ::= opt_part | epsilon

while with EBNF you can do it directly using optional syntax:

A ::= [opt_part] OTHER

Then since there's no way to express precedence in a BNF you have to use always intermediate symbols also for nested choices:

BNF A ::= B C B ::= a | b | c  EBNF A ::= (a | b | c) C

This is true for many syntax issues that are allowed in an EBNF or ABNF grammar, thanks to syntactic sugar but not with a normal BNF. ABNF extends EBNF, allowing you to do more complicated things, like specifying how many occurrence of a symbol can be found together (i.e. 4*DIGIT)

So choosing an ABNF or an EBNF as language of choice for your grammar will make your work easier, since you will be more expressive without filling you grammar with useless symbols that will be generated anyway by your parser generator, but you won't care about them!

187

answered Oct 14 '22 19:10

Jack

According to Wikipedia, ABNF's double quoted string literals are case-insensitive, and case-sensitive matches must be defined as numeric ASCII values. I consider that a disadvantage.

Literal text is specified through the use of a string enclosed in quotation marks ("). These strings are case-insensitive and the character set used is (US-)ASCII. Therefore the string “abc” will match “abc”, “Abc”, “aBc”, “abC”, “ABc”, “AbC”, “aBC”, and “ABC”. For a case-sensitive match the explicit characters must be defined: to match “aBc” the definition will be %d97.66.99.

https://en.wikipedia.org/wiki/Augmented_Backus%E2%80%93Naur_Form#Terminal_values

However, RFC 7405 seems to add case-sensitive string literals to ABNF.

https://www.rfc-editor.org/rfc/rfc7405

answered Oct 14 '22 18:10

trololo

Related questions
                            
                                JavaScript dot notation [duplicate]
                            
                                Is there any difference between "T" and "const T" in template parameter?
                            
                                Why does one select Scala type members with a hash instead of a dot?
                            
                                Private scoping with square brackets (private[...]) in Scala
                            
                                Passing function as block of code between curly braces
                            
                                Is there a specification for a man page's SYNOPSIS section?
                            
                                AutoHotkey syntax highlighting in Notepad++
                            
                                Why must delegation to a different constructor happen first in a Java constructor?
                            
                                Should I put a Semicolon (;) when I use onclick=""
                            
                                What is under the hood of x = 'y' 'z' in Python?
                            
                                PowerShell string interpolation syntax
                            
                                How to define a function on one line
                            
                                A textbox/richtextbox that has syntax highlighting? [C#] [closed]
                            
                                Why is the semicolon optional in the last statement in php?
                            
                                How to do a single line If statement in VBScript for Classic-ASP?
                            
                                What are the rules governing usage of parenthesis in VBA function calls?
                            
                                Why a full stop, "." and not a plus symbol, "+", for string concatenation in PHP?
                            
                                Why are MySQL syntax error messages so bad?
                            
                                Better "return if not None" in Python
                            
                                Keyboard shortcut to "Comment" a line in NANO?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

BNF vs EBNF vs ABNF: which to choose?

Tags:

syntax

grammar

bnf

ebnf

Jason Baker

People also ask

2 Answers

Jack

trololo

Recent Activity

Donate For Us