How can I create an alphanumeric Regex for all languages?

Question

I had this problem today:

This regex matches only English: [a-zA-Z0-9].

If I need support for any language in this world, what regex should I write?

amit · Accepted Answer

Alphabet/Letter: \p{L}

Number: \p{N}

So for alphnum match for all languages, you can use: [\p{L}\p{N}]+

I was looking for a way to replace all non-alphanum chars for all languages with a space in JS and ended up using the following way to do it:

const regexForNonAlphaNum = new RegExp(/[^\p{L}\p{N}]+/ug);
someText.replace(regexForNonAlphaNum, " ");

Here as it is JS, we need to add u at end to make the regex unicode aware and g stands for global as I wanted match all instances and not just a single instance.

References:

https://www.linkedin.com/pulse/regex-one-pattern-rule-them-all-find-bring-darkness-bind-carranza/?trackingId=U6tRte%2BzTAG6O4AA3CrFmA%3D%3D

https://www.regular-expressions.info/unicode.html

How can I create an alphanumeric Regex for all languages?

Tags:

language-agnostic

regex

unicode

non-english

tawfekov

1 Answers

amit

Recent Activity

Donate For Us

How can I create an alphanumeric Regex for all languages?

Tags:

language-agnostic

regex

unicode

non-english

tawfekov

1 Answers

amit

Related questions

Recent Activity

Donate For Us