How to know text is Arabic or in Urdu

Tags:

I want to know is text contain any letter in Urdu or Arabic..using this condition which produce false results when special characters comes.what is right way to do it .any library or what is right regex for this ?

Click to copy

   if (cap.replaceAll("\\s+", "").matches("[A-Za-z]+")
                    || cap.replaceAll("\\s+", "").matches("[A-Za-z0-9]+")) {
                Log.d("isUrdu", "false");
                caption.setTypeface(Typeface.DEFAULT);
                caption.setTextSize(16);

            } else {
                Log.d("isUrdu", "True");
             /*   if (Build.VERSION.SDK_INT > Build.VERSION_CODES.JELLY_BEAN_MR1) {*/
                    caption.setTypeface(typeface);
                    caption.setTextSize(20);

         /*       }*/
            }

339

asked Oct 03 '16 10:10

Usman Saeed

1 Answers

Taking a look at the Wikipedia Urdu alphabet, it includes the following Unicode ranges:

Click to copy

U+0600 to U+06FF
U+0750 to U+077F
U+FB50 to U+FDFF
U+FE70 to U+FEFF

To match an Arabic letter, you may use a \p{InArabic} Unicode property class.

So, you may use

Click to copy

if (cap.matches("(?s).*[\\u0600-\\u06FF\\u0750-\\u077F\\uFB50-\\uFDFF\\uFE70‌-\\uFEFF].*"))
{
    /*There is an Urdu character*/
} 
else if (cap.matches("(?s).*\\p{InArabic}.*"))
{  
    /* The string contains an Arabic character */ 
}
else { /*No Arabic nor Urdu chars detected */ }

Note that (?s) enables the DOTALL modifier so that . could match linebreak symbols, too.

For better performance with matches, you may use reverse classes instead of the first .*: "(?s)[^\\u0600-\\u06FF\\u0750-\\u077F\\uFB50-\\uFDFF\\uFE70‌-\\uFEFF]*[\\u0600-\\u06FF\\u0750-\\u077F\\uFB50-\\uFDFF\\uFE70‌-\\uFEFF].*" and "(?s)\\P{InArabic}*\\p{InArabic}.*" respectively.

Note you may also use shorter "[\\u0600-\\u06FF\\u0750-\\u077F\\uFB50-\\uFDFF\\uFE70‌-\\uFEFF]" and "\\p{InArabic}" patterns with Matcher#find().

answered Sep 21 '22 19:09

Wiktor Stribiżew

Related questions
                            
                                Spring data repositories - parameter what to retrieve all records?
                            
                                Flink does not run my application due to invalidtypesexception when using java8 lambdas
                            
                                Spring Boot Soap Web-Service (Java) - code first?
                            
                                Why UnaryFunction<Object> can be casted to UnaryFunction<T>?
                            
                                How does node know which nodes have seen the cluster current state?
                            
                                Hexadecimal -> Float Conversion Inaccurate
                            
                                "java.lang.IllegalArgumentException: Results log file is not empty" error showing while generate dashboard report for Non GUI mode in jmeter3.0
                            
                                JSR 303 implementation gives ClassCastException
                            
                                Translator using Antlr4
                            
                                Spring 3 HandlerInterceptor passing information to Controller
                            
                                REST Server to client communication
                            
                                How to build SELECT query with sqlbuilder?
                            
                                Convert negative seconds to hour:minute:second
                            
                                Removing Sublist from ArrayList
                            
                                How to import Apache Flink SNAPSHOT artifacts?
                            
                                NoSuchMethodError on compiling Spring Application with FileSystemXmlApplicationContext
                            
                                Massive number of "SET autocommit=0/1" queries in MySQL
                            
                                Multiple access to a single derby database
                            
                                Query on Date only with Spring Boot Data JPA / Java 8 Instant?
                            
                                Add border to merged cells in excel Apache poi java.?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to know text is Arabic or in Urdu

Tags:

java

regex

android

arabic

urdu

Usman Saeed

People also ask

1 Answers

Wiktor Stribiżew

Recent Activity

Donate For Us