I have a string like that (it's an empty paragraph) saved from my heavily edited and after-processed input from TinyMCE. That is how it looks like after echo, in HTML source code in browser: <pre class="prettyprint"><code> </code></pre> Now, I need to remove those empty paragraphs. I have already tried <pre class="prettyprint"><code>$output = str_ireplace(" ", "", $string); $output = preg_replace("/ <\/p>/", "", $string); $output = preg_replace("/[ \t\n\r]*<\/p>/", "", $string); $output = preg_replace("/[\s]*<\/p>/", "", $string); </code></pre> and many more variations with no luck. It's still there, intact. I have also tried mb_ereg_replace and matching <code>&nbsp;</code> which isn't apparently the case. On the other hand, this works: <pre class="prettyprint"><code>$output = preg_replace("/.*<\/p>/", "", $string); </code></pre> but of course striping also paragraphs with actual content. What else could that "space-like" character be? How am I supposed to match it? SOLVED Thanks to Ibizaman and this thread link, I've found the character. It is nbsp in unicode value. See http://unicodelookup.com/#160/1 This works: <pre class="prettyprint"><code>$output = preg_replace("/[\x{00A0}\s]*<\/p>/u", "", $string); </code></pre> As pointed by mcrumley, this might work even better: <pre class="prettyprint"><code>"/[\p{Zs}\s]*<\/p>/iu" </code></pre>

Since you don't know which character is being outputted, first parse the output of <code>$string</code> with functions outputting unicode values (see this SO question). Or, you can proceed the other way around and only accept well-formed paragraphs: <pre class="prettyprint"><code>$output = preg_replace("/([^a-zA-Z0-9]*<\/p>)/", "\1", $string); </code></pre> Disclaimer : I already put this in comments but since it solved the problem, it's better placed in an answer for future reference, I think.

Match special kind of whitespace

Tags:

regex

php

I have a string like that (it's an empty paragraph) saved from my heavily edited and after-processed input from TinyMCE.

That is how it looks like after echo, in HTML source code in browser:

Click to copy

<p> </p>

Now, I need to remove those empty paragraphs.

I have already tried

Click to copy

$output = str_ireplace("<p> </p>", "", $string);
$output = preg_replace("/<p> <\/p>/", "", $string);
$output = preg_replace("/<p>[ \t\n\r]*<\/p>/", "", $string);
$output = preg_replace("/<p>[\s]*<\/p>/", "", $string);

and many more variations with no luck. It's still there, intact. I have also tried mb_ereg_replace and matching   which isn't apparently the case.

On the other hand, this works:

Click to copy

$output = preg_replace("/<p>.*<\/p>/", "", $string);

but of course striping also paragraphs with actual content.

What else could that "space-like" character be? How am I supposed to match it?

SOLVED Thanks to Ibizaman and this thread link, I've found the character. It is nbsp in unicode value. See http://unicodelookup.com/#160/1

This works:

Click to copy

$output = preg_replace("/<p>[\x{00A0}\s]*<\/p>/u", "", $string);

As pointed by mcrumley, this might work even better:

Click to copy

"/<p>[\p{Zs}\s]*<\/p>/iu"

928

asked Nov 20 '13 13:11

Saix

2 Answers

You can use the Unicode character property to match all spaces. \p{Zs} is "Space separator" and includes space, non-breaking space, thin space, etc. You can also use \pZ to match all separators, including line separator and paragraph separator. See http://www.php.net/manual/en/regexp.reference.unicode.php for details.

Click to copy

$output = preg_replace("/<p>[\p{Zs}\s]*<\/p>/iu", "", $string);

174

answered Oct 12 '22 03:10

mcrumley

Since you don't know which character is being outputted, first parse the output of $string with functions outputting unicode values (see this SO question).

Or, you can proceed the other way around and only accept well-formed paragraphs:

Click to copy

$output = preg_replace("/(<p>[^a-zA-Z0-9]*<\/p>)/", "\1", $string);

Disclaimer : I already put this in comments but since it solved the problem, it's better placed in an answer for future reference, I think.

answered Oct 12 '22 01:10

ibizaman

Related questions
                            
                                PHP curl response differs based on network (encoding issue?)
                            
                                Wrong update limits mysql
                            
                                Global variable not working in php
                            
                                Codeigniter: share session with subdomain & sess_time_to_update
                            
                                Java equivalent of php pack('H*', str)
                            
                                PHP soapClient send custom XML
                            
                                Symfony2 Form Gives Catchable Error About FormView
                            
                                Include path problems installing PEAR/PHPUnit
                            
                                exporting 20k or more records from MY SQL to CSV using php [duplicate]
                            
                                What is in_addr?
                            
                                How to transfer money from PayPal account to another PayPal account via API
                            
                                Prevent loading pages with .php file extension (only load without it)
                            
                                Using WordPress admin_notices in a OOP plugin
                            
                                How to show a drop-down list with a pre-selected option
                            
                                Symfony2 get to the access_control parameters located in the security.yml
                            
                                Use yii2 for production
                            
                                How to set database time zone in application.ini
                            
                                Laravel's pivot table + Pivot table in general
                            
                                How to force https for prod but http for dev environment?
                            
                                When to use PHP template engines [closed]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Match special kind of whitespace

Tags:

regex

php

Saix

People also ask

2 Answers

mcrumley

ibizaman

Recent Activity

Donate For Us