I need to extract a HTML-Substring with JS which is position dependent. I store special characters HTML-encoded. For example: HTML <pre class="prettyprint"><code><div id="test">l&ouml;sen &amp; gr&uuml;&szlig;en</div> </code></pre> Text <pre class="prettyprint"><code>lösen & grüßen </code></pre> My problem lies in the JS-part, for example when I try to extract the fragment <code>lö</code>, which has the HTML-dependent starting position of <code>3</code> and the end position of <code>9</code> inside the <code><div></code> block. JS seems to convert some special characters internally so that the count from <code>3</code> to <code>9</code> is wrongly interpreted as "<code>lösen</code> " and not "<code>l&ouml;</code>". Other special characters like the <code>&amp;</code> are not affected by this. So my question is, if someone knows why JS is behaving in that way? Characters like <code>&auml;</code> or <code>&ouml;</code> are being converted while characters like <code>&amp;</code> or <code>&nbsp;</code> are plain. Is there any possibility to avoid this conversion? I've set up a fiddle to demonstrate this: JSFiddle Thanks for any help! EDIT: Maybe I've explained it a bit confusing, sorry for that. What I want is the HTML: <code>l&ouml;sen &amp; gr&uuml;&szlig;en</code> . Every special character should be unconverted, except the HTML-Tags. Like in the HTML above. But JS converts the <code>&ouml;</code> or <code>&uuml;</code> into <code>ö</code> or <code>ü</code> automatically, what I need to avoid.

That's because the browser (and not JavaScript) turns entities that don't need to be escaped in HTML into their respective Unicode characters (e.g. it skips <code>&amp;</code>, <code>&lt;</code> and <code>&gt;</code>). So by the time you inspect <code>.innerHTML</code>, it no longer contains exactly what was in the original page source; you could reverse this process, but it involves the full map of character <code><-></code> entity pairs which is just not practical.

JavaScript automatically converts some special characters

Tags:

javascript

jquery

character-encoding

I need to extract a HTML-Substring with JS which is position dependent. I store special characters HTML-encoded.

For example:

HTML

Click to copy

<div id="test"><p>l&ouml;sen &amp; gr&uuml;&szlig;en</p></div>

Text

Click to copy

lösen & grüßen

My problem lies in the JS-part, for example when I try to extract the fragment lö, which has the HTML-dependent starting position of 3 and the end position of 9 inside the <div> block. JS seems to convert some special characters internally so that the count from 3 to 9 is wrongly interpreted as "lösen " and not "lö". Other special characters like the & are not affected by this.

So my question is, if someone knows why JS is behaving in that way? Characters like ä or ö are being converted while characters like & or   are plain. Is there any possibility to avoid this conversion?

I've set up a fiddle to demonstrate this: JSFiddle

Thanks for any help!

EDIT:

Maybe I've explained it a bit confusing, sorry for that. What I want is the HTML:

lösen & grüßen .

Every special character should be unconverted, except the HTML-Tags. Like in the HTML above.

But JS converts the ö or ü into ö or ü automatically, what I need to avoid.

205

asked Nov 22 '12 13:11

noplacetoh1de

1 Answers

That's because the browser (and not JavaScript) turns entities that don't need to be escaped in HTML into their respective Unicode characters (e.g. it skips &, < and >).

So by the time you inspect .innerHTML, it no longer contains exactly what was in the original page source; you could reverse this process, but it involves the full map of character <-> entity pairs which is just not practical.

119

answered Oct 26 '22 17:10

Ja͢ck

Related questions
                            
                                Is there an acceptable cross-platform method for displaying a numeric keypad in standard web forms on a touch-based device?
                            
                                How to have search functionality in android using phonegap?
                            
                                Endlessly spinning image/div (cross-browser)
                            
                                Elliptic curve cryptography with SJCL in JS and OpenSSL in Ruby
                            
                                Create array of unique combinations from array of strings
                            
                                setting innerHTML with a script inside [duplicate]
                            
                                Dynamically add option to chosen select multiple JQuery plugin
                            
                                How can I load multiple optimized requirejs modules dynamically in a production env?
                            
                                HTML5 audio IE9 error: Unexpected call to method or property access
                            
                                Mongo --quiet Not Suppressing --eval output
                            
                                How I can save the image file generated by canvas tag with heatmap.js?
                            
                                Array.map() and D3 selection?
                            
                                Does Fancybox mute Click events?
                            
                                How can I create a new window object without iframes?
                            
                                backbone-style models in angularjs?
                            
                                Limiting Web Worker CPU Utilization?
                            
                                Rotation Matrix about arbitrary point
                            
                                Feature detection for ability to drop file over HTML file input
                            
                                How to find multivariable regression equation in javascript
                            
                                debugging node js garbage collection / memory problems with chrome

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With