Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

needed: open source C/C++ regular expression library that does unicode

I'm looking for a good open source C/C++ regular expression library that has full Unicode support.

I'm using this in an environment where the library might get ASCII, UTF-8, or UTF-16. If it gets UTF-16 it might or might not have the necessary quoting characters (FF FE) or (FE FF).

I've looked around and there don't seem to be any options other than PCRE.

My second problem is that I'm currently using flex to build some HUGE regular expressions. Ideally I would have a flex-like lexical expression generator that also handles Unicode.

Any suggestions?

like image 337
vy32 Avatar asked Feb 23 '11 05:02

vy32


1 Answers

Have you considered ICU?

It has mature regular expression support.

like image 126
John Weldon Avatar answered Sep 21 '22 06:09

John Weldon