How to strip HTML tags from string in JavaScript? [duplicate]

People also ask

How do I strip text tags in HTML?

In order to strip out tags we can use replace() function and can also use . textContent property, . innerText property from HTML DOM.

How do I remove a tag from a string?

The HTML tags can be removed from a given string by using replaceAll() method of String class. We can remove the HTML tags from a given string by using a regular expression. After removing the HTML tags from a string, it will return a string as normal text.

How do you remove tags in HTML?

Approach: Select the HTML element which need to remove. Use JavaScript remove() and removeChild() method to remove the element from the HTML document.

How remove HTML tag from string in react?

To remove html tags from string in react js, just use the /(<([^>]+)>)/ig regex with replace() method it will remove tags with their attribute and return new string.

cleanText = strInputCode.replace(/<\/?[^>]+(>|$)/g, "");

Distilled from this website (web.achive).

This regex looks for <, an optional slash /, one or more characters that are not >, then either > or $ (the end of the line)

Examples:

'<div>Hello</div>' ==> 'Hello'
 ^^^^^     ^^^^^^
'Unterminated Tag <b' ==> 'Unterminated Tag '
                  ^^

But it is not bulletproof:

'If you are < 13 you cannot register' ==> 'If you are '
            ^^^^^^^^^^^^^^^^^^^^^^^^
'<div data="score > 42">Hello</div>' ==> ' 42">Hello'
 ^^^^^^^^^^^^^^^^^^          ^^^^^^

If someone is trying to break your application, this regex will not protect you. It should only be used if you already know the format of your input. As other knowledgable and mostly sane people have pointed out, to safely strip tags, you must use a parser.

If you do not have acccess to a convenient parser like the DOM, and you cannot trust your input to be in the right format, you may be better off using a package like sanitize-html, and also other sanitizers are available.

Using the browser's parser is the probably the best bet in current browsers. The following will work, with the following caveats:

Your HTML is valid within a <div> element. HTML contained within <body> or <html> or <head> tags is not valid within a <div> and may therefore not be parsed correctly.
textContent (the DOM standard property) and innerText (non-standard) properties are not identical. For example, textContent will include text within a <script> element while innerText will not (in most browsers). This only affects IE <=8, which is the only major browser not to support textContent.
The HTML does not contain <script> elements.
The HTML is not null
The HTML comes from a trusted source. Using this with arbitrary HTML allows arbitrary untrusted JavaScript to be executed. This example is from a comment by Mike Samuel on the duplicate question: <img onerror='alert(\"could run arbitrary JS here\")' src=bogus>

Code:

var html = "<p>Some HTML</p>";
var div = document.createElement("div");
div.innerHTML = html;
var text = div.textContent || div.innerText || "";

var html = "<p>Hello, <b>World</b>";
var div = document.createElement("div");
div.innerHTML = html;
alert(div.innerText); // Hello, World

That pretty much the best way of doing it, you're letting the browser do what it does best -- parse HTML.

Edit: As noted in the comments below, this is not the most cross-browser solution. The most cross-browser solution would be to recursively go through all the children of the element and concatenate all text nodes that you find. However, if you're using jQuery, it already does it for you:

alert($("<p>Hello, <b>World</b></p>").text());

Check out the text method.

I know this question has an accepted answer, but I feel that it doesn't work in all cases.

For completeness and since I spent too much time on this, here is what we did: we ended up using a function from php.js (which is a pretty nice library for those more familiar with PHP but also doing a little JavaScript every now and then):

http://phpjs.org/functions/strip_tags:535

It seemed to be the only piece of JavaScript code which successfully dealt with all the different kinds of input I stuffed into my application. That is, without breaking it – see my comments about the <script /> tag above.

Related questions
                            
                                Named capturing groups in JavaScript regex?
                            
                                React component initialize state from props
                            
                                How to trim a string to N chars in Javascript?
                            
                                ESLint Parsing error: Unexpected token
                            
                                How to test if a string is JSON or not?
                            
                                How to prevent a click on a '#' link from jumping to top of page?
                            
                                Random alpha-numeric string in JavaScript?
                            
                                How to remove all listeners in an element? [duplicate]
                            
                                React Context vs React Redux, when should I use each one? [closed]
                            
                                How to include JavaScript file or library in Chrome console?
                            
                                How can I round down a number in Javascript?
                            
                                In JavaScript, why is "0" equal to false, but when tested by 'if' it is not false by itself?
                            
                                Do something if screen width is less than 960 px
                            
                                Return string without trailing slash
                            
                                How to implement "select all" check box in HTML?
                            
                                Detect IE version (prior to v9) in JavaScript
                            
                                TypeScript: Creating an empty typed container array
                            
                                Write / add data in JSON file using Node.js
                            
                                How does "this" keyword work within a function?
                            
                                jQuery remove options from select

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to strip HTML tags from string in JavaScript? [duplicate]

Tags:

javascript

html-parsing

People also ask

Recent Activity

Donate For Us