Unescape HTML entities in JavaScript?

Tags:

I have some JavaScript code that communicates with an XML-RPC backend. The XML-RPC returns strings of the form:

<img src='myimage.jpg'>

However, when I use the JavaScript to insert the strings into HTML, they render literally. I don't see an image, I literally see the string:

<img src='myimage.jpg'>

My guess is that the HTML is being escaped over the XML-RPC channel.

How can I unescape the string in JavaScript? I tried the techniques on this page, unsuccessfully: http://paulschreiber.com/blog/2008/09/20/javascript-how-to-unescape-html-entities/

What are other ways to diagnose the issue?

874

asked Dec 16 '09 05:12

Joseph Turian

2 Answers

Most answers given here have a huge disadvantage: if the string you are trying to convert isn't trusted then you will end up with a Cross-Site Scripting (XSS) vulnerability. For the function in the accepted answer, consider the following:

htmlDecode("<img src='dummy' onerror='alert(/xss/)'>");

The string here contains an unescaped HTML tag, so instead of decoding anything the htmlDecode function will actually run JavaScript code specified inside the string.

This can be avoided by using DOMParser which is supported in all modern browsers:

function htmlDecode(input) {    var doc = new DOMParser().parseFromString(input, "text/html");    return doc.documentElement.textContent;  }    console.log(  htmlDecode("&lt;img src='myimage.jpg'&gt;")  )      // "<img src='myimage.jpg'>"    console.log(  htmlDecode("<img src='dummy' onerror='alert(/xss/)'>")  )    // ""

This function is guaranteed to not run any JavaScript code as a side-effect. Any HTML tags will be ignored, only text content will be returned.

Compatibility note: Parsing HTML with DOMParser requires at least Chrome 30, Firefox 12, Opera 17, Internet Explorer 10, Safari 7.1 or Microsoft Edge. So all browsers without support are way past their EOL and as of 2017 the only ones that can still be seen in the wild occasionally are older Internet Explorer and Safari versions (usually these still aren't numerous enough to bother).

120

answered Oct 09 '22 22:10

Wladimir Palant

Do you need to decode all encoded HTML entities or just & itself?

If you only need to handle & then you can do this:

var decoded = encoded.replace(/&amp;/g, '&');

If you need to decode all HTML entities then you can do it without jQuery:

var elem = document.createElement('textarea'); elem.innerHTML = encoded; var decoded = elem.value;

Please take note of Mark's comments below which highlight security holes in an earlier version of this answer and recommend using textarea rather than div to mitigate against potential XSS vulnerabilities. These vulnerabilities exist whether you use jQuery or plain JavaScript.

answered Oct 09 '22 20:10

LukeH

Related questions
                            
                                All falsey values in JavaScript
                            
                                How to call a REST web service API from JavaScript?
                            
                                Is Node.js native Promise.all processing in parallel or sequentially?
                            
                                How to add external JS scripts to VueJS Components?
                            
                                Checking length of dictionary object [duplicate]
                            
                                How to embed an autoplaying YouTube video in an iframe?
                            
                                How can I bind to the change event of a textarea in jQuery?
                            
                                How do I pre-populate a jQuery Datepicker textbox with today's date?
                            
                                Why powershell does not run Angular commands? [duplicate]
                            
                                Where do you include the jQuery library from? Google JSAPI? CDN?
                            
                                What is the difference between object keys with quotes and without quotes?
                            
                                Force DOM redraw/refresh on Chrome/Mac
                            
                                How to get the title of HTML page with JavaScript?
                            
                                animating addClass/removeClass with jQuery
                            
                                How to use format() on a moment.js duration?
                            
                                What is the difference between & vs @ and = in angularJS
                            
                                How to split comma separated string using JavaScript? [duplicate]
                            
                                How to sort an array of objects by multiple fields?
                            
                                Parse string to date with moment.js
                            
                                Node.js Port 3000 already in use but it actually isn't?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Unescape HTML entities in JavaScript?

Tags:

javascript

html

escaping

xml-rpc

Joseph Turian

People also ask

2 Answers

Wladimir Palant

LukeH

Recent Activity

Donate For Us