Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

PHP function to strip tags, except a list of whitelisted tags and attributes

I have to strip all HTML tags and attributes from a user input except the ones considered "safe" (ie, a white list approach).

strip_tags() strips all tags except the ones listed in the $allowable_tags parameter. But I also need to be able to strip all the not whitelisted attributes; for example, I want to allow the <b> tag, but I don't want to allow the onclick attribute for obvious reasons.

Is there a function to do that, or will I have to make my own?

like image 481
Thomas Bonini Avatar asked Dec 10 '22 14:12

Thomas Bonini


1 Answers

As far as I know, the strip_tags solution is about the fastest way to get rid of unwanted tags, and barring 3rd party packages, checking for allowable attributes would be quite easy in DOMDocument,

$string = strip_tags($string,'<b>');
$dom = new DOMDocument();
$dom->loadHTML($string);
$allowed_attributes = array('id');
foreach($dom->getElementsByTagName('*') as $node){
    for($i = $node->attributes->length -1; $i >= 0; $i--){
        $attribute = $node->attributes->item($i);
        if(!in_array($attribute->name,$allowed_attributes)) $node->removeAttributeNode($attribute);
    }
}
var_dump($dom->saveHTML());
like image 169
Wrikken Avatar answered Jan 04 '23 23:01

Wrikken