To match A to Z, we will use regex: <blockquote> [A-Za-z] </blockquote> How to allow regex to match utf8 characters entered by user? For example Chinese words like 环保部

What you are looking for are Unicode properties. e.g. <code>\p{L}</code> is any kind of letter from any language So a regex to match such a Chinese word could be something like <pre class="prettyprint"><code>\p{L}+ </code></pre> There are many such properties, for more details see regular-expressions.info Another option is to use the modifier <code>Pattern.UNICODE_CHARACTER_CLASS</code> In Java 7 there is a new property <code>Pattern.UNICODE_CHARACTER_CLASS</code> that enables the Unicode version of the predefined character classes see my answer here for some more details and links You could do something like this <pre class="prettyprint"><code>Pattern p = Pattern.compile("\\w+", Pattern.UNICODE_CHARACTER_CLASS); </code></pre> and <code>\w</code> would match all letters and all digits from any languages (and of course some word combining characters like <code>_</code>).

Java regex for support Unicode?

1 Answers

What you are looking for are Unicode properties.

e.g. \p{L} is any kind of letter from any language

So a regex to match such a Chinese word could be something like

\p{L}+

There are many such properties, for more details see regular-expressions.info

Another option is to use the modifier

Pattern.UNICODE_CHARACTER_CLASS

In Java 7 there is a new property Pattern.UNICODE_CHARACTER_CLASS that enables the Unicode version of the predefined character classes see my answer here for some more details and links

You could do something like this

Pattern p = Pattern.compile("\\w+", Pattern.UNICODE_CHARACTER_CLASS);

and \w would match all letters and all digits from any languages (and of course some word combining characters like _).

146

answered Oct 24 '22 16:10

stema

Related questions
                            
                                equivalent to push() or pop() for arrays?
                            
                                How do you create an asynchronous HTTP request in JAVA?
                            
                                Junit before class ( non static )
                            
                                In RxJava, how to pass a variable along when chaining observables?
                            
                                What is the default stack size, can it grow, how does it work with garbage collection?
                            
                                Java: Converting String to and from ByteBuffer and associated problems
                            
                                Access maven properties defined in the pom
                            
                                Is there a way in Java to determine if a path is valid without attempting to create a file?
                            
                                System.console() returns null
                            
                                READ_EXTERNAL_STORAGE permission for Android
                            
                                Storing UUID as base64 String
                            
                                What is a possible use case of BigInteger's .isProbablePrime()?
                            
                                Handling passwords used for auth in source code
                            
                                Which "if" construct is faster - statement or ternary operator?
                            
                                Code cleanup in netbeans
                            
                                How to create immutable objects in Java?
                            
                                Which comes first in a 2D array, rows or columns?
                            
                                What are the Java semantics of an escaped number in a character literal, e.g. '\15' ?
                            
                                Is there an equivalent of Scala's Either in Java 8?
                            
                                How can I "intercept" Ctrl+C in a CLI application?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Java regex for support Unicode?

Tags:

java

regex

unicode

cjk

cometta

People also ask

1 Answers

stema

Recent Activity

Donate For Us