I am creating a file that is to be saved on a local user's computer (not rendered in a web browser). I am currently using <code>html_entity_decode</code>, but this isn't converting characters like <code>&#8211;</code> (which is the n-dash) and was wondering what other function I should be using. For example, when the file is imported into the software, instead of the ndash or just a - it shows up as <code>&#8211;</code>. I know I could use <code>str_replace</code>, but if it's happening with this character, it could happen with many others since the data is dynamic.

You need to define the target character set. <code>&#8211;</code> is not a valid character in the default ISO-8859-1 character set, so it's not decoded. Define UTF-8 as the output charset and it will decode: <pre class="prettyprint"><code>echo html_entity_decode('&#8211;', ENT_NOQUOTES, 'UTF-8'); </code></pre> If at all possible, you should avoid HTML entities to begin with. I don't know where that encoded data comes from, but if you're storing it like this in the database or elsewhere, you're doing it wrong. Always store data UTF-8 encoded and only convert to HTML entities or otherwise escape for output when necessary.

Try <code>mb_convert_encoding()</code>: <pre class="prettyprint"><code>$string = "n&ndash;dash"; $output = mb_convert_encoding($string, 'UTF-8', 'HTML-ENTITIES'); echo $output; </code></pre>

How to convert HTML entities like – to their character equivalents?

Q: How do you show entities in HTML?

You have to use HTML character entities &lt; and &gt; in place of the < and > symbols so they aren't interpreted as HTML tags.

Q: What is HTML &GT?

&gt; and &lt; is a character entity reference for the > and < character in HTML. It is not possible to use the less than (<) or greater than (>) signs in your file, because the browser will mix them with tags. for these difficulties you can use entity names( &gt; ) and entity numbers( &#60; ).

Tags:

php

character-encoding

special-characters

I am creating a file that is to be saved on a local user's computer (not rendered in a web browser).

I am currently using html_entity_decode, but this isn't converting characters like – (which is the n-dash) and was wondering what other function I should be using.

For example, when the file is imported into the software, instead of the ndash or just a - it shows up as –. I know I could use str_replace, but if it's happening with this character, it could happen with many others since the data is dynamic.

245

asked Feb 02 '11 22:02

Cofey

2 Answers

You need to define the target character set. – is not a valid character in the default ISO-8859-1 character set, so it's not decoded. Define UTF-8 as the output charset and it will decode:

echo html_entity_decode('&#8211;', ENT_NOQUOTES, 'UTF-8');

If at all possible, you should avoid HTML entities to begin with. I don't know where that encoded data comes from, but if you're storing it like this in the database or elsewhere, you're doing it wrong. Always store data UTF-8 encoded and only convert to HTML entities or otherwise escape for output when necessary.

171

answered Nov 15 '22 15:11

deceze

Try mb_convert_encoding():

$string = "n&ndash;dash";
$output = mb_convert_encoding($string, 'UTF-8', 'HTML-ENTITIES');
echo $output;

answered Nov 15 '22 15:11

Lèse majesté

Related questions
                            
                                Recursive function: Call php function itself
                            
                                php DateTime with date without hours
                            
                                Laravel redirect issue from blade template
                            
                                php dyld: Library not loaded for libldap
                            
                                Rename an uploaded file with PHP but keep the extension
                            
                                PHPUnit: get arguments to a mock method call
                            
                                subtract 6 hours from date('g:i a', strtotime($time_date_data));
                            
                                Force https://www. for Codeigniter in htaccess with mod_rewrite
                            
                                Undefined variable: HTTP_RAW_POST_DATA
                            
                                mysqli_fetch_assoc() expects parameter 1 to be mysqli_result, boolean given [duplicate]
                            
                                mysql_connect(): No such file or directory
                            
                                PDO Error - PDOException' with message 'SQLSTATE[HY000]: General error' [duplicate]
                            
                                How to check if a page is category or product in woocommerce?
                            
                                Symfony2 Doctrine schema update fails
                            
                                Setup Laravel project after cloning
                            
                                Check if column exist in Laravel model's table and then apply condition
                            
                                What's Is the Best File Format for Configuration Files
                            
                                Best way to defend against mysql injection and cross site scripting
                            
                                Google reCAPTCHA - keep getting `incorrect-captcha-sol`
                            
                                curl_init undefined?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to convert HTML entities like – to their character equivalents?

Tags:

php

character-encoding

special-characters

Cofey

People also ask

2 Answers

deceze

Lèse majesté

Recent Activity

Donate For Us

How to convert HTML entities like &#8211; to their character equivalents?

Tags:

php

character-encoding

special-characters

Cofey

People also ask

2 Answers

deceze

Lèse majesté

Related questions

Recent Activity

Donate For Us

How to convert HTML entities like – to their character equivalents?