I have a string that has php code in it, I need to remove the php code from the string, for example: <pre class="prettyprint"><code><?php $db1 = new ps_DB() ?>Dummy </code></pre> Should return <code>Dummy</code> And a string with no php for example <code>Dummy</code> should return the same string. I know this can be done with a regular expression, but after 4h I haven't found a solution.

<pre class="prettyprint"><code> <?php function filter_html_tokens($a){ return is_array($a) && $a[0] == T_INLINE_HTML ? $a[1]: ''; } $htmlphpstring = '<a>foo</a> something <?php $db1 = new ps_DB() ?>Dummy'; echo implode('',array_map('filter_html_tokens',token_get_all($htmlphpstring))); ?> </code></pre> <hr> As ircmaxell pointed out: this would require valid PHP! A regex route would be (allowing for no 'php' with short tags. no ending ?> in the string / file (for some reason Zend recommends this?) and of course an UNgreedy & DOTALL pattern: <pre class="prettyprint"><code>preg_replace('/<\\?.*(\\?>|$)/Us', '',$htmlphpstring); </code></pre>

Well, you can use DomDocument to do it... <pre class="prettyprint"><code>function stripPHPFromHTML($html) { $dom = new DomDocument(); $dom->loadHtml($html); removeProcessingInstructions($dom); $simple = simplexml_import_dom($d->getElementsByTagName('body')->item(0)); return $simple->children()->asXml(); } function removeProcessingInstructions(DomNode &$node) { foreach ($node->childNodes as $child) { if ($child instanceof DOMProcessingInstruction) { $node->removeChild($child); } else { removeProcessingInstructions($child); } } } </code></pre> Those two functions will turn <pre class="prettyprint"><code>$str = '<?php echo "foo"; ?>Bar'; $clean = stripPHPFromHTML($str); $html = 'Bar'; </code></pre> Edit: Actually, after looking at Wrikken's answer, I realized that both methods have a disadvantage... Mine requires somewhat valid HTML markup (Dom is decent, but it won't parse <code>foo<?php echo $bar</code>). Wrikken's requires valid PHP (any syntax errors and it'll fail). So perhaps a combination of the two (try one first. If it fails, try the other. If both fail, there's really not much you can do without trying to figure out the exact reason they failed)...

How to remove php code from a string?

Tags:

php

preg-replace

I have a string that has php code in it, I need to remove the php code from the string, for example:

Click to copy

<?php $db1 = new ps_DB() ?><p>Dummy</p>

Should return Dummy

And a string with no php for example Dummy should return the same string.

I know this can be done with a regular expression, but after 4h I haven't found a solution.

849

asked Jul 15 '10 18:07

Gonzalo

2 Answers

Click to copy

 <?php
 function filter_html_tokens($a){
    return is_array($a) && $a[0] == T_INLINE_HTML ?
      $a[1]:
      '';
 }
 $htmlphpstring = '<a>foo</a> something <?php $db1 = new ps_DB() ?><p>Dummy</p>';
 echo implode('',array_map('filter_html_tokens',token_get_all($htmlphpstring)));
 ?>

As ircmaxell pointed out: this would require valid PHP!

A regex route would be (allowing for no 'php' with short tags. no ending ?> in the string / file (for some reason Zend recommends this?) and of course an UNgreedy & DOTALL pattern:

Click to copy

preg_replace('/<\\?.*(\\?>|$)/Us', '',$htmlphpstring);

120

answered Sep 26 '22 03:09

Wrikken

Well, you can use DomDocument to do it...

Click to copy

function stripPHPFromHTML($html) {
    $dom = new DomDocument();
    $dom->loadHtml($html);
    removeProcessingInstructions($dom);
    $simple = simplexml_import_dom($d->getElementsByTagName('body')->item(0));
    return $simple->children()->asXml();
}

function removeProcessingInstructions(DomNode &$node) {
    foreach ($node->childNodes as $child) {
        if ($child instanceof DOMProcessingInstruction) {
            $node->removeChild($child);
        } else {
            removeProcessingInstructions($child);
        }
    }
}

Those two functions will turn

Click to copy

$str = '<?php echo "foo"; ?><b>Bar</b>';
$clean = stripPHPFromHTML($str);
$html = '<b>Bar</b>';

Edit: Actually, after looking at Wrikken's answer, I realized that both methods have a disadvantage... Mine requires somewhat valid HTML markup (Dom is decent, but it won't parse foo<?php echo $bar). Wrikken's requires valid PHP (any syntax errors and it'll fail). So perhaps a combination of the two (try one first. If it fails, try the other. If both fail, there's really not much you can do without trying to figure out the exact reason they failed)...

answered Sep 26 '22 03:09

ircmaxell

Related questions
                            
                                PHP forum software that integrates easily with existing website? [closed]
                            
                                Any PHP IDE written in C? [closed]
                            
                                PHP preg_replace non-greedy trouble
                            
                                PHP script stops running arbitrarily with no errors
                            
                                mysql_real_escape_string alternative for SQL Server [duplicate]
                            
                                How to make my code more secure
                            
                                Replacing based on position in string
                            
                                PHP rounding problem (5.2.3)?
                            
                                PHP - Access object properties without case sensitivity?
                            
                                What to store in a session?
                            
                                get or session?
                            
                                How much data can be sent via $_GET
                            
                                How server manage different user's requests at a time?
                            
                                Check if db connection is closed - php
                            
                                Am I supposed to store hashes for passwords?
                            
                                TinyMCE security question: How do you prevent malicious input?
                            
                                How do you use autocomplete for thousands of entries?
                            
                                Are detailed exception/error messages a security risk?
                            
                                Redbean O/RM store "date" as varchar(255)?
                            
                                How to send and receive encrypted email using PHP

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to remove php code from a string?

Tags:

php

preg-replace

Gonzalo

People also ask

2 Answers

Wrikken

ircmaxell

Recent Activity

Donate For Us