Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Removing whitespace-characters, except inside quotation marks in PHP?

I need to remove all whitespace from string, but quotations should stay as they were.

Here's an example:

string to parse:
hola hola "pepsi cola" yay

output:
holahola"pepsi cola"yay

Any idea? I'm sure this can be done with regex, but any solution is okay.

like image 690
Martti Laine Avatar asked Jan 22 '23 08:01

Martti Laine


2 Answers

We could match strings or quotations with

[^\s"]+|"[^"]*"

So we just need to preg_match_all and concatenate the result.


Example:

$str = 'hola hola "pepsi cola" yay';

preg_match_all('/[^\s"]+|"[^"]*"/', $str, $matches);

echo implode('', $matches[0]);
// holahola"pepsi cola"yay
like image 143
kennytm Avatar answered Jan 23 '23 22:01

kennytm


Martti, resurrecting this question because it had a simple solution that lets you do the replace in one go—no need for implode. (Found your question while doing some research for a general question about how to exclude patterns in regex.)

Here's our simple regex:

"[^"]*"(*SKIP)(*F)|\s+

The left side of the alternation matches complete "quoted strings" then deliberately fails. The right side matches whitespace characters, and we know they are the right whitespace characters because they were not matched by the expression on the left.

This code shows how to use the regex (see the results at the bottom of the online demo):

<?php
$regex = '~"[^"]*"(*SKIP)(*F)|\s+~';
$subject = 'hola hola "pepsi cola" yay';
$replaced = preg_replace($regex,"",$subject);
echo $replaced."<br />\n";
?>

Reference

How to match (or replace) a pattern except in situations s1, s2, s3...

like image 21
zx81 Avatar answered Jan 23 '23 22:01

zx81