I have Googled my brains out and can't figure out how to make this work. Here is what I'm trying to do: HTML: <pre class="prettyprint"><code><div id=derp>&quot;Hi, my name is..&quot;</div> </code></pre> Javascript: <pre class="prettyprint"><code>var div = document.getElementById('derp'); alert(div.innerHTML); alert(div.innerText); alert(div.textContent); </code></pre> All of those alerts interpret and return the <code>&quot;</code> as <code>"</code> in the resulting string. I want to get the raw text with <code>&quot;</code> uninterpreted. They all return: <pre class="prettyprint"><code>"Hi, my name is.." </code></pre> When I want to get: <pre class="prettyprint"><code>&quot;Hi, my name is..&quot; </code></pre> Is there a way to do this? Preferably without trying to use a regex to replace every instance of <code>"</code> with <code>&quot;</code>. It's kind of a long story of what I'm trying to do, but simply using replace() to search and replace every instance of <code>"</code> would be a headache to implement because of other regex matching/parsing that needs to occur. Thanks in advance for any Javascript wizards who can save my sanity!

I prepared some days ago a bin with some different approaches: http://jsbin.com/urazer/4/edit My favorite: <pre class="prettyprint"><code>var text = "<a href='#' title=\"Foo\"></a>"); var html = text.replace(/[<&>'"]/g, function(c) { return "&#" + c.charCodeAt() + ";"; }); </code></pre>

Getting raw text content of HTML element with HTML uninterpreted

Tags:

javascript

html

I have Googled my brains out and can't figure out how to make this work. Here is what I'm trying to do:

HTML:

<div id=derp>&quot;Hi, my name is..&quot;</div>

Javascript:

var div = document.getElementById('derp');
alert(div.innerHTML);
alert(div.innerText);
alert(div.textContent);

All of those alerts interpret and return the " as " in the resulting string. I want to get the raw text with " uninterpreted.

They all return:

"Hi, my name is.."

When I want to get:

&quot;Hi, my name is..&quot;

Is there a way to do this? Preferably without trying to use a regex to replace every instance of " with ".

It's kind of a long story of what I'm trying to do, but simply using replace() to search and replace every instance of " would be a headache to implement because of other regex matching/parsing that needs to occur.

Thanks in advance for any Javascript wizards who can save my sanity!

822

asked Mar 14 '13 20:03

Trey

2 Answers

To quote bobince

When you ask the browser for an element node's innerHTML, it doesn't give you the original HTML source that was parsed to produce that node, because it no longer has that information. Instead, it generates new HTML from the data stored in the DOM. The browser decides on how to format that HTML serialisation; different browsers produce different HTML, and chances are it won't be the same way you formatted it originally.

In summary: innerHTML/innerText/text/textContent/nodeValue/indexOf, none of them will give you the unparsed text.

The only possible way to do this is with regex, or you can do an ajax post to the page itself, but that is a bad practice.

147

answered Sep 18 '22 23:09

gkiely

I prepared some days ago a bin with some different approaches: http://jsbin.com/urazer/4/edit

My favorite:

var text = "<a href='#' title=\"Foo\"></a>");
var html = text.replace(/[<&>'"]/g, function(c) {
  return "&#" + c.charCodeAt() + ";";
});

answered Sep 19 '22 23:09

yckart

Related questions
                            
                                dataType 'application/json' vs. 'json' [duplicate]
                            
                                How to hide an element, based on its text, with JavaScript?
                            
                                Load a javascript file and css file depending on user agent
                            
                                How to play mp3 on link click
                            
                                Fastest way to iterate pixels in a canvas and copy some of them in another one
                            
                                Moving index in JavaScript regex matching
                            
                                crypto.pbkdf2 is asynchronous, how do I treat it as synchronous?
                            
                                In JavaScript what is a ',' in a conditional
                            
                                fnObj = window[functionName] is not a function issue - eval() works well
                            
                                check if jquery ui accordion exists?
                            
                                jQuery TypeError: $("img").draggable(); is not a function [closed]
                            
                                Limit how many times an event listener can trigger every second
                            
                                Disable then re-enable click function jQuery
                            
                                Flexslider 100% width creates horizontal scroll-bar
                            
                                Is it possible to check against multiple types within a toEqual in Jasmine.Js?
                            
                                Clarity on the difference between "LexicalEnvironment" and "VariableEnvironment" in ECMAScript/JavaScript
                            
                                How to catch dragend event in JavaScript?
                            
                                How do I prevent event bubbling in a Titanium Alloy view?
                            
                                how to detect if a link was clicked when window.onbeforeunload is triggered?
                            
                                Using a WScript.shell activeX to execute a command line

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With