Why does C++11 support 6 different regular expression grammars?

2 Answers

The standardization process is all about pragmatism. There are benefits to including a RE grammar in the standard, as long as it's correctly specified, but no benefit to dropping one.

Exclusion would make it easier for a library implementer to apply a "100% C++11 compliant" badge, but who really cares? Nobody should be making that claim anyway, and only ignorant PHBs would be looking for it. Libraries always have bugs which prevent reaching 100%, and a good library has an excess of features.

Note that all the included grammars are specified by already existing international standards. So little effort is needed on the part of the C++ committee. Just §28.13, which is a couple pages long.

If they leave out a standardized grammar, then different Standard Library implementers will add it under different names, resulting in incompatibility. This is unlikely to happen for a grammar which is merely defined by a popular library, where the library implementer will be responsible for the C++ interface, not Standard Library vendors.

151

answered Sep 26 '22 22:09

Potatoswatter

This is covered by the TR1 proposal. I will attempt to summarize.

It seemed prudent to build on an existing standard rather than to strike out on their own.

Two existing standards that they could build upon were identified: POSIX REs and ECMAScript REs. Perl REs were left out because they aren’t standardized. (Which reasonable people could disagree with.) Also, ECMAScript REs were seen as an simpler subset of Perl REs which covers the most useful (or perhaps most used) features.

Of the two, POSIX REs’ “leftmost longest” implementation did not play well with important features, like non-greedy repeats, and was at odds with how most RE engines work these days.

On the other hand, ECMAScript REs lacked the localization support of POSIX REs. So, they extended ECMAScript REs to include POSIX-RE—style localization support.

POSIX RE support was included as optional since it’s behavior is different enough from ECMAScript REs to justify it being an standard option. The POSIX standard comes with two grammars: Basic and extended. The awk, grep, and egrep REs are all just trivial variations to the basic or extended POSIX grammars rather than truly separate grammars.

So: Two standards, three grammars, six variations.

answered Sep 24 '22 22:09

Robert Fisher

Related questions
                            
                                Fold expression with comma operator and variadic template parameter pack
                            
                                Perfect forwarding in constructors (C++17)
                            
                                Why is const int fine for char brace init?
                            
                                How does a vector as a key works internally in C++?
                            
                                Why is a name's point of declaration before its initializer?
                            
                                Can spaceship operator be used in fold expressions?
                            
                                using constexpr to return pointer
                            
                                Recommended Open Source Profilers [closed]
                            
                                Threadsafe Vector class for C++
                            
                                How to detect when an exception is in flight?
                            
                                Why is the delete operator required to be static?
                            
                                Choosing between instance methods and free functions?
                            
                                Is there a way to use template specialization to separate new from new[]?
                            
                                How is inheritance implemented at the memory level?
                            
                                C++ anonymous class initialization
                            
                                Convert BSTR to char*
                            
                                std::lower_bound and comparator function with different types?
                            
                                Difference between default-initialize and value-initialize? [duplicate]
                            
                                Setting Attributes on Datasets using HDF5 C++ api
                            
                                delete cout; delete cin; do not give compilation error - a flaw in the Standard library?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Why does C++11 support 6 different regular expression grammars?

Tags:

c++

regex

std

c++11

rkjnsn

People also ask

2 Answers

Potatoswatter

Robert Fisher

Recent Activity

Donate For Us