When I imbue a regex object with a particular locale, how does it affect the matching behavior? Does it affect collation, or anything else? I can't seem to find an explanation anywhere.
It affects at least the following:
[a-f]
imbued with a French locale should match the character é.\w
in a Finnish locale should match the character ä (but [a-z]
should not, as å, ä and ö collate after z in Finnish. In German, however, [a-z]
should match ä.)[=e=]
should match é in a French locale but not in an English locale.On the spanish locale, please note that "ch" and "ll" are not considered single letters in the alphabet any more, as this was changed by relevant entities. I could not find the exact date, but it right now "ch" and "ll" are two letters:
http://en.wikipedia.org/wiki/Ll
I think implementations now reflect that fact.
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With