preg_match and (non-English) Latin characters?

Tags:

I have a XHTML form where I ask people to enter their full name. I then match that with preg_match() using this pattern: /^[\p{L}\s]+$/

On my local server running PHP 5.2.13 (PCRE 7.9 2009-04-11) this works fine. On the webhost running PHP 5.2.10 (PCRE 7.3 2007-08-28) it doesn't match when the entered string contains the Danish Latin character ø ( http://www.ltg.ed.ac.uk/~richard/utf-8.cgi?input=%F8&mode=char ).

Is this a bug? Is there a work around?

Thank you in advance!

217

asked Mar 24 '11 19:03

Jonas Delfs

1 Answers

So, the problem is as presumed. You are not using the /u modifier. This means that PCRE will not look for UTF-8 characters.

In any case, this is how it should be done:

var_dump(preg_match('/^[\p{L}\s]+$/u', "ø"));

And works on all my versions. There might be a bug in others, but that's not likely here.

Your problem is that this also works:

var_dump(preg_match('/^[\p{L}\s]+$/', utf8_decode("ø")));

Notice that this uses ISO-8859-1 instead of UTF-8, and leaves out the /u modifier. The result is int(1). Obviously PCRE interprets the Latin-1 ø as matching \p{L} when in non-/unicode mode. (Most of the single-byte \xA0-\xFF are letter symbols in Latin-1, and the 8-bit code point as the same as in Unicode, so that's actually ok.)

Conclusion: Your input is actually ISO-8859-1. That's why it accidentally worked for you without the /u. Change that, and be eaxact with input charsets.

answered Oct 01 '22 20:10

mario

Related questions
                            
                                What do I need to know before I can call myself a PHP programmer? [closed]
                            
                                pixel font size in imagettftext instead of point size
                            
                                I have a class with 14 static methods and 4 static properties - is that bad?
                            
                                Add values to one array after explode function
                            
                                What is Closures/Lambda in PHP or Javascript in layman terms? [duplicate]
                            
                                MySql speed of executing max(), min(), sum() on relatively large database
                            
                                array_push into a multi-dimensional array
                            
                                Getting the Facebook creation date of a profile [duplicate]
                            
                                Which framework to use: CodeIgniter, Symfony or CakePHP? [closed]
                            
                                Download htaccess protected files using PHP and CURL
                            
                                PHP APC module. Any disadvantages?
                            
                                Download a image from SSL using curl?
                            
                                how to secure POST method without using SSL?
                            
                                Logging In To Joomla 1.5 Using External Form (not within joomla folder, but on same server)
                            
                                How do I protect against ajax-spam in PHP?
                            
                                fetch synonym of a word
                            
                                symfony: setHttpHeader() doesn't work, header() does
                            
                                TCPDF / HTML2PDF
                            
                                What is the reason for casting in php?
                            
                                call_user_func_array vs $controller->$method($params)?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

preg_match and (non-English) Latin characters?

Tags:

php

character-encoding

expression

preg-match

Jonas Delfs

People also ask

1 Answers

mario

Recent Activity

Donate For Us