regex match any whitespace

Tags:

php

I want to make a replacement using regex and preg_replace function. this is my code

$verif = "/wordA(\s*)wordB(?! wordc)/i";
$replacement = 'wordA wordb wordc';
$newvar = preg_replace($verif, $replacement, $article->text);

That works if only we have one whitespace between wordA and wordB. I need to match what ever the number of whitespaces between wordA & wordB.

example:

wordA (10 or more whitespace) wordB -> wordA wordb wordc same wordA(1 whitespace) wordB -> wordA wordb wordc ...

269

asked Feb 23 '14 20:02

2 Answers

Your regex should work 'as-is'. Assuming that it is doing what you want it to.

wordA(\s*)wordB(?! wordc)

This means match wordA followed by 0 or more spaces followed by wordB, but do not match if followed by wordc. Note the single space between ?! and wordc which means that wordA wordB wordc will not match, but wordA wordB wordc will.

Here are some example matches and the associated replacement output:

enter image description here

Note that all matches are replaced no matter how many spaces. There are a couple of other points: -

(?! wordc) is a negative lookahead, so you wont match lines wordA wordB wordc which is assume is intended (and is why the last line is not matched). Currently you are relying on the space after ?! to match the whitespace. You may want to be more precise and use (?!\swordc). If you want to match against more than one space before wordc you can use (?!\s*wordc) for 0 or more spaces or (?!\s*+wordc) for 1 or more spaces depending on what your intention is. Of course, if you do want to match lines with wordc after wordB then you shouldn't use a negative lookahead.
* will match 0 or more spaces so it will match wordAwordB. You may want to consider + if you want at least one space.
(\s*) - the brackets indicate a capturing group. Are you capturing the whitespace to a group for a reason? If not you could just remove the brackets, i.e. just use \s.

Update based on comment

Hello the problem is not the expression but the HTML out put that are not considered as whitespace. it's a Joomla website.

Preserving your original regex you can use:

wordA((?:\s|&nbsp;)*)wordB(?!(?:\s|&nbsp;)wordc)

The only difference is that not the regex matches whitespace OR  . I replaced wordc with \swordc since that is more explicit. Note as I have already pointed out that the negative lookahead ?! will not match when wordB is followed by a single whitespace and wordc. If you want to match multiple whitespaces then see my comments above. I also preserved the capture group around the whitespace, if you don't want this then remove the brackets as already described above.

Example matches:

enter image description here

183

answered Sep 21 '22 13:09

acarlon

The reason I used a + instead of a '*' is because a plus is defined as one or more of the preceding element, where an asterisk is zero or more. In this case we want a delimiter that's a little more concrete, so "one or more" spaces.

word[Aa]\s+word[Bb]\s+word[Cc]

will match:

wordA wordB     wordC worda wordb wordc wordA   wordb   wordC

The words, in this expression, will have to be specific, and also in order (a, b, then c)

answered Sep 20 '22 13:09

mrres1

Related questions
                            
                                Using Docker I get the error: "SQLSTATE[HY000] [2002] No such file or directory"
                            
                                Apache 2.4 + PHP-FPM and Authorization headers
                            
                                Getting data posted in between two dates
                            
                                The CSRF token is invalid. Please try to resubmit the form
                            
                                Best way to list Alphabetical(A-Z) using PHP
                            
                                What is difference between PHP cli and PHP cgi?
                            
                                PHP Xdebug on OS X 10.9 Mavericks
                            
                                PHP json_encode class private members
                            
                                In PHP's Laravel, how do I clear laravel.log?
                            
                                How to route GET and POST for same pattern in Laravel?
                            
                                How to pass arguments to an included file?
                            
                                How to get only images using scandir in PHP?
                            
                                write utf-8 characters to file with fputcsv in php
                            
                                Why is PHP session_destroy() not working?
                            
                                Could not open input file: composer.phar
                            
                                PhantomJS hangs for a long time saying "Asynchronous Sessions clean-up phase starting NOW"
                            
                                How to implement Isotope with Pagination
                            
                                Using dependency injection over laravel facades
                            
                                In a PHP / Apache / Linux context, why exactly is chmod 777 dangerous?
                            
                                Does php have an equivalent to python's virtualenv or ruby's sandbox?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

regex match any whitespace

Tags:

regex

php

user3344311

People also ask

2 Answers

acarlon

mrres1

Recent Activity

Donate For Us