I run the following code: <pre class="prettyprint"><code>$page = 'Ä'; $DOM = new DOMDocument; $DOM->loadHTML($page); echo 'source:'.$page; echo 'dom: '.$DOM->getElementsByTagName('p')->item (0)->textContent; </code></pre> and it outputs the following: <blockquote> source: Ä dom: Ã </blockquote> so, I don't understand why when the text comes through DOMDocument its encoding becomes broken?

Here's a workaround that adds the proper encoding via meta header: <pre class="prettyprint"><code>$DOM->loadHTML('<meta http-equiv="Content-Type" content="text/html; charset=UTF-8" />' . $page); </code></pre> I'm not sure if that's the actual character set you're trying to use, but adjust where necessary See also: domdocument character set issue

DOMDocument breaks encoding?

Tags:

php

encoding

domdocument

I run the following code:

$page = '<p>Ä</p>';
$DOM = new DOMDocument;
$DOM->loadHTML($page);
echo 'source:'.$page;
echo 'dom: '.$DOM->getElementsByTagName('p')->item (0)->textContent;

and it outputs the following:

source: Ä

dom: Ã

so, I don't understand why when the text comes through DOMDocument its encoding becomes broken?

662

asked Oct 01 '12 16:10

Mike

2 Answers

Here's a workaround that adds the proper encoding via meta header:

$DOM->loadHTML('<meta http-equiv="Content-Type" content="text/html; charset=UTF-8" />' . $page);

I'm not sure if that's the actual character set you're trying to use, but adjust where necessary

See also: domdocument character set issue

132

answered Oct 25 '22 09:10

Ja͢ck

DOMDocument appears to be treating the input as UTF-8. In this conversion, Ä becomes Ã„. Here's the catch: That second character does not exist in ISO-8859-1, but does exist in Windows-1252. This is why you are seeing no second character in your output.

You can fix this by calling utf8_decode on the output of textContent, or using UTF-8 as your page's character encoding.

answered Oct 25 '22 11:10

Niet the Dark Absol

Related questions
                            
                                Quickly insert 250k rows
                            
                                Static / Non-Static Method Issue
                            
                                Get the single returned value with PDO
                            
                                $_SESSION v. $_COOKIE
                            
                                ArrayObject, getIterator();
                            
                                PHP imagettftext partially bold
                            
                                How do I clean up this if/else statement? (refactoring)
                            
                                Payment Gateway integration in Opencart
                            
                                Make Gmail automatically show images I embed in my HTML email?
                            
                                How to log simple debug messages to a file in Symfony?
                            
                                scandir fail to open directory
                            
                                how to read facebook signed_request to get user_id
                            
                                How to get the number of hours untill midnight with PHP
                            
                                Using PHPUnit to test helper functions
                            
                                Show weather based on users location
                            
                                $mysqli->fetch_object($result) not working
                            
                                What is gained by changing the name of the PHPSESSID cookie?
                            
                                PHP submit button not working.
                            
                                How can I convert MySQL database to SQLite in PHP?
                            
                                Creating folder in bucket google cloud storage using php

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With