Is anyone aware of any JavaScript implementations of UAX #29, Unicode Text Segmentation? I'm specifically interested in Word Boundaries.
I was hopeful when I came across XRegExp, but it seems to use the standard JavaScript implementation of \b
.
https://github.com/orling/grapheme-splitter is a pure js implementation of UAX #29 Grapheme Cluster Boundaries.
There is also an ES proposal on implementing Intl.Segmenter using UAX #29, see https://github.com/tc39/proposal-intl-segmenter.
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With