I'm using DOM to parse string. I need function that strips span tags and its contents. For example, if I have: <pre class="prettyprint"><code>This is some text that contains photo. photobyile </code></pre> I would like function to return <pre class="prettyprint"><code>This is some text that contains photo. </code></pre> This is what I tried: <pre class="prettyprint"><code> $dom = new domDocument; $dom->loadHTML($string); $dom->preserveWhiteSpace = false; $spans = $dom->getElementsByTagName('span'); foreach($spans as $span) { $naslov = $span->nodeValue; echo $naslov; $string = preg_replace("/$naslov/", " ", $string); } </code></pre> I'm aware that <code>$span->nodeValue</code> returns value of span tag and not whole tag, but I don't know how to get whole tag, together with class name. Thanks, Ile

Try removing the spans directly from the DOM tree. <pre class="prettyprint"><code>$dom = new DOMDocument(); $dom->loadHTML($string); $dom->preserveWhiteSpace = false; $elements = $dom->getElementsByTagName('span'); while($span = $elements->item(0)) { $span->parentNode->removeChild($span); } echo $dom->saveHTML(); </code></pre>

@ile - I've had that problem - it's because the index of the foreach iterator happily keeps incrementing, while calling removeChild() on the DOM also seems to remove the nodes from the DomNodeList ($spans). So for every span you remove, the nodelist shrinks one element and then gets its foreach counter incremented by one. Net result: it skips one span. I'm sure there is a more elegant way, but this is how I did it - I moved the references from the DomNodeList to a second array, where they would not be removed by the removeChild() operation. <pre class="prettyprint"><code> foreach($spans as $span) { $nodes[] = $span; } foreach($nodes as $span) { $span->parentNode->removeChild($span); } </code></pre>

Strip HTML tags and its contents

Tags:

html

dom

php

strip

ilija veselica

2 Answers

Try removing the spans directly from the DOM tree.

$dom = new DOMDocument();
$dom->loadHTML($string);
$dom->preserveWhiteSpace = false;

$elements = $dom->getElementsByTagName('span');
while($span = $elements->item(0)) {       
   $span->parentNode->removeChild($span);
}

echo $dom->saveHTML();

129

answered Sep 21 '22 05:09

Lukáš Lalinský

@ile - I've had that problem - it's because the index of the foreach iterator happily keeps incrementing, while calling removeChild() on the DOM also seems to remove the nodes from the DomNodeList ($spans). So for every span you remove, the nodelist shrinks one element and then gets its foreach counter incremented by one. Net result: it skips one span.

I'm sure there is a more elegant way, but this is how I did it - I moved the references from the DomNodeList to a second array, where they would not be removed by the removeChild() operation.

    foreach($spans as $span) {
        $nodes[] = $span;
    }
    foreach($nodes as $span) {
        $span->parentNode->removeChild($span);
    }

answered Sep 19 '22 05:09

kander

Related questions
                            
                                Set volume using php exec and amixer
                            
                                laravel whereHas results weird
                            
                                Preparing a MySQL INSERT/UPDATE statement with DEFAULT values
                            
                                Inherit static properties in subclass without redeclaration?
                            
                                PHP Multiple Concurrent Sessions Per User
                            
                                Redirect from embedded controller
                            
                                In PHP, what purpose does (unset) have?
                            
                                iconv UTF-8//IGNORE still produces "illegal character" error
                            
                                MSSQL Server's Native ODBC Driver for Linux and PHP 5.4
                            
                                From a performance perspective, how efficient is it to use a MySQL temporary table for a highly used website feature?
                            
                                PHP MySQL PDO: how to preserve leading zeros of zerofill int columns
                            
                                How to cache doctrine "findOneBy()" query with cache id and cache lifetime option in Symfony 2.4?
                            
                                Cancel pending AJAX requests in PHP app?
                            
                                Doctrine 2 - Log changes in manyToMany relation
                            
                                Laravel response Cache-Control headers always containing 'no-cache'
                            
                                How to copy the CSS and JS files in vendor folder of composer to public?
                            
                                how to pass variable to next page with cookies
                            
                                Laravel hasMany Many to Many To One Eloquent
                            
                                Delete the store details from PHP Application after uninstalling app from Shopify Store
                            
                                Make specific folders writeable in laravel coaster cms in google app engine

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With