Do I really need to encode '&' as '&'?

Methodology 1

Write some content which includes ampersand characters.
Encode them all.

Methodology 2

(with a grain of salt, please ;) )

Write some content which includes ampersand characters.
On a case-by-case basis, look at each ampersand. Determine if:

It is isolated, and as such unambiguously an ampersand. eg. volt & amp
> In that case don't bother encoding it.
It is not isolated, but you feel it is nonetheless unambiguous, as the resulting entity does not exist and will never exist since the entity list could never evolve. E.g., amp&volt
>. In that case, don't bother encoding it.
It is not isolated, and ambiguous. E.g., volt&amp
> Encode it.

HTML5 rules are different from HTML4. It's not required in HTML5 - unless the ampersand looks like it starts a parameter name. "&copy=2" is still a problem, for example, since © is the copyright symbol.

However it seems to me that it's harder work to decide to encode or not to encode depending on the following text. So the easiest path is probably to encode all the time.

I think this has turned into more of a question of "why follow the spec when browser's don't care." Here is my generalized answer:

Standards are not a "present" thing. They are a "future" thing. If we, as developers, follow web standards, then browser vendors are more likely to correctly implement those standards, and we move closer to a completely interoperable web, where CSS hacks, feature detection, and browser detection are not necessary. Where we don't have to figure out why our layouts break in a particular browser, or how to work around that.

Specifically, if HTML5 does not require using & in your specific situation, and you're using an HTML5 doctype (and also expecting your users to be using HTML5-compliant browsers), then there is no reason to do it.

Well, if it comes from user input then absolutely yes, for obvious reasons. Think if this very website didn't do it: the title of this question would show up as Do I really need to encode ‘&’ as ‘&’?

If it's just something like echo '<title>Dolce & Gabbana</title>'; then strictly speaking you don't have to. It would be better, but if you don't, no user will notice the difference.

Could you show us what your title actually is? When I submit

Click to copy

<!DOCTYPE html>
<html>
<title>Dolce & Gabbana</title>
<body>
<p>Am I allowed loose & mpersands?</p>
</body>
</html>

to http://validator.w3.org/ - explicitly asking it to use the experimental HTML 5 mode - it has no complaints about the &s...

Related questions
                            
                                How to validate inputs dynamically created using ng-repeat, ng-show (angular)
                            
                                What is the minimum length of a valid international phone number?
                            
                                Email address validation using ASP.NET MVC data type attributes
                            
                                Validating email addresses using jQuery and regex
                            
                                rails 3 validation on uniqueness on multiple attributes
                            
                                How to check whether a given string is valid JSON in Java
                            
                                How do I turn off the mysql password validation?
                            
                                C# Sanitize File Name
                            
                                Ruby on Rails Callback, what is difference between :before_save and :before_create?
                            
                                Better way to check variable for null or empty string?
                            
                                JavaScript: client-side vs. server-side validation
                            
                                Using the HTML5 "required" attribute for a group of checkboxes?
                            
                                Check whether an input string contains a number in javascript
                            
                                Validate uniqueness of multiple columns
                            
                                Right HTTP status code to wrong input
                            
                                What is the minimum valid JSON?
                            
                                Tool to generate JSON schema from JSON data [closed]
                            
                                RegEx for matching UK Postcodes
                            
                                jQuery add required to input fields
                            
                                No grammar constraints (DTD or XML schema) detected for the document

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Do I really need to encode '&' as '&'?

Tags:

html

validation

character-encoding

utf-8

People also ask

Methodology 1

Methodology 2

Recent Activity

Donate For Us

Do I really need to encode '&' as '&amp;'?

Tags:

html

validation

character-encoding

utf-8

People also ask

Methodology 1

Methodology 2

Related questions

Recent Activity

Donate For Us

Do I really need to encode '&' as '&'?