I'm trying to insert some HTML into a page using javascript, and the HTML I'm inserting contains CDATA blocks. I'm finding, in Firefox and Chrome, that the CDATA is getting converted to a comment. The HTML is not under my control, so it's difficult for me to avoid using CDATA. The following test case, when there is a div on the page with id "test": <pre class="prettyprint"><code>document.getElementById('test').innerHTML = '<![CDATA[foo]]> bar' </code></pre> causes the following HTML to be appeded to the 'test' div: <pre class="prettyprint"><code> bar </code></pre> Is there any way I can insert, verbatim, HTML containing CDATA into a document using javascript?

<code>document.createCDATASection</code> should do it, but the real answer to your question is that although HTML 5 does have CDATA sections cross-browser support for them is pretty spotty. EDIT The CDATA sections just aren't in the HTML 4 definition, so most browsers won't recognize them. But it doesn't require a full DOM parser. Here's a simple lexical solution that will fix the problem. <pre class="prettyprint"><code>function htmlWithCDATASectionsToHtmlWithout(html) { var ATTRS = "(?:[^>\"\']|\"[^\"]*\"|\'[^\']*\')*", // names of tags with RCDATA or CDATA content. SCRIPT = "[sS][cC][rR][iI][pP][tT]", STYLE = "[sS][tT][yY][lL][eE]", TEXTAREA = "[tT][eE][xX][tT][aA][rR][eE][aA]", TITLE = "[tT][iI][tT][lL][eE]", XMP = "[xX][mM][pP]", SPECIAL_TAG_NAME = [SCRIPT, STYLE, TEXTAREA, TITLE, XMP].join("|"), ANY = "[\\s\\S]*?", AMP = /&/g, LT = /</g, GT = />/g; return html.replace(new RegExp( // Entities and text "[^<]+" + // Comment "|" + // Regular tag "|<\/?(?!"+SPECIAL_TAG_NAME+")[a-zA-Z]"+ATTRS+">" + // Special tags "|<\/?"+SCRIPT +"\\b"+ATTRS+">"+ANY+"<\/"+SCRIPT +"\\s*>" + "|<\/?"+STYLE +"\\b"+ATTRS+">"+ANY+"<\/"+STYLE +"\\s*>" + "|<\/?"+TEXTAREA+"\\b"+ATTRS+">"+ANY+"<\/"+TEXTAREA+"\\s*>" + "|<\/?"+TITLE +"\\b"+ATTRS+">"+ANY+"<\/"+TITLE +"\\s*>" + "|<\/?"+XMP +"\\b"+ATTRS+">"+ANY+"<\/"+XMP +"\\s*>" + // CDATA section. Content in capturing group 1. "|<!\\[CDATA\\[("+ANY+")\\]\\]>" + // A loose less-than "|<", "g"), function (token, cdataContent) { return "string" === typeof cdataContent ? cdataContent.replace(AMP, "&amp;").replace(LT, "&lt;") .replace(GT, "&gt;") : token === "<" ? "&lt;" // Normalize loose less-thans. : token; }); } </code></pre> Given <pre class="prettyprint"><code>foo<![CDATA[bar]]> </code></pre> it produces <pre class="prettyprint"><code>foo&lt;i&gt;bar&lt;/i&gt; </code></pre> and given something that looks like a CDATA section inside a <code>script</code> or other special tag or comment, it correctly does not muck with it: <pre class="prettyprint"><code><script>/*<![CDATA[*/foo=bar<baz&amp;//]]></script><![CDATA[fish: <><]]> </code></pre> becomes <pre class="prettyprint"><code><script>/*<![CDATA[*/foo=bar<baz&amp;//]]></script>fish: &lt;&gt;&lt; </code></pre>

innerHTML converts CDATA to comments

Q: Does HTML support CDATA?

[CDATA[ … ]]> The only sequence which is not allowed within a CDATA section is the closing sequence of a CDATA section itself, ]]> . Note: CDATA sections should not be used within HTML they are considered as comments and not displayed.

Tags:

javascript

html

dom

cdata

I'm trying to insert some HTML into a page using javascript, and the HTML I'm inserting contains CDATA blocks.

I'm finding, in Firefox and Chrome, that the CDATA is getting converted to a comment.

The HTML is not under my control, so it's difficult for me to avoid using CDATA.

The following test case, when there is a div on the page with id "test":

document.getElementById('test').innerHTML = '<![CDATA[foo]]> bar'

causes the following HTML to be appeded to the 'test' div:

<!--[CDATA[foo]]--> bar

Is there any way I can insert, verbatim, HTML containing CDATA into a document using javascript?

763

asked Aug 15 '11 13:08

Rich

1 Answers

document.createCDATASection should do it, but the real answer to your question is that although HTML 5 does have CDATA sections cross-browser support for them is pretty spotty.

EDIT

The CDATA sections just aren't in the HTML 4 definition, so most browsers won't recognize them.

But it doesn't require a full DOM parser. Here's a simple lexical solution that will fix the problem.

function htmlWithCDATASectionsToHtmlWithout(html) {
    var ATTRS = "(?:[^>\"\']|\"[^\"]*\"|\'[^\']*\')*",
        // names of tags with RCDATA or CDATA content.
        SCRIPT = "[sS][cC][rR][iI][pP][tT]",
        STYLE = "[sS][tT][yY][lL][eE]",
        TEXTAREA = "[tT][eE][xX][tT][aA][rR][eE][aA]",
        TITLE = "[tT][iI][tT][lL][eE]",
        XMP = "[xX][mM][pP]",
        SPECIAL_TAG_NAME = [SCRIPT, STYLE, TEXTAREA, TITLE, XMP].join("|"),
        ANY = "[\\s\\S]*?",
        AMP = /&/g,
        LT = /</g,
        GT = />/g;
    return html.replace(new RegExp(
        // Entities and text
        "[^<]+" +
        // Comment
        "|<!--"+ANY+"-->" +
        // Regular tag
        "|<\/?(?!"+SPECIAL_TAG_NAME+")[a-zA-Z]"+ATTRS+">" +
        // Special tags
        "|<\/?"+SCRIPT  +"\\b"+ATTRS+">"+ANY+"<\/"+SCRIPT  +"\\s*>" +
        "|<\/?"+STYLE   +"\\b"+ATTRS+">"+ANY+"<\/"+STYLE   +"\\s*>" +
        "|<\/?"+TEXTAREA+"\\b"+ATTRS+">"+ANY+"<\/"+TEXTAREA+"\\s*>" +
        "|<\/?"+TITLE   +"\\b"+ATTRS+">"+ANY+"<\/"+TITLE   +"\\s*>" +
        "|<\/?"+XMP     +"\\b"+ATTRS+">"+ANY+"<\/"+XMP     +"\\s*>" +
        // CDATA section.  Content in capturing group 1.
        "|<!\\[CDATA\\[("+ANY+")\\]\\]>" +
        // A loose less-than
        "|<", "g"),

        function (token, cdataContent) {
          return "string" === typeof cdataContent
              ? cdataContent.replace(AMP, "&amp;").replace(LT, "&lt;")
                .replace(GT, "&gt;")
              : token === "<"
              ? "&lt;"  // Normalize loose less-thans.
              : token;
        });
}

Given

<b>foo</b><![CDATA[<i>bar</i>]]>

it produces

<b>foo</b>&lt;i&gt;bar&lt;/i&gt;

and given something that looks like a CDATA section inside a script or other special tag or comment, it correctly does not muck with it:

<script>/*<![CDATA[*/foo=bar<baz&amp;//]]></script><![CDATA[fish: <><]]>

becomes

<script>/*<![CDATA[*/foo=bar<baz&amp;//]]></script>fish: &lt;&gt;&lt;

answered Sep 30 '22 12:09

Mike Samuel

Related questions
                            
                                Compress data in php and uncompress in javascript [closed]
                            
                                Whats the best javascript library for creating a rich text editor? [closed]
                            
                                Javascript Arrays merging using indexes
                            
                                Hide element with Javascript. Doesn't work with IE and Chrome
                            
                                Javascript and AI, fact or fiction? [closed]
                            
                                Websockets message loss
                            
                                Flashing jQuery .animation()
                            
                                Get link name in javascript
                            
                                Strike the Element using Id in Javascript
                            
                                How do i convert this jQuery code into jQuery function?
                            
                                Hide a div if it overlaps another div
                            
                                Requesting multiple Facebook fanpage data in a single call
                            
                                Click source in JavaScript and jQuery, human or automated?
                            
                                Template strings with JavaScript
                            
                                Google Charts: minValue doesn't work with logScale
                            
                                Javascript case-insensitive matching and replacing?
                            
                                Javascript: besides "use strict", which other "use" directives are there?
                            
                                What is the best way to store a value for use in a later function? I'm hearing global variables are evil
                            
                                Passing a multidimensional PHP array to javascript
                            
                                Blurring borders in SVG (Raphael.js)

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With