I am of course familiar with the java.net.URLEncoder
and java.net.URLDecoder
classes. However, I only need HTML-style encoding. (I don't want ' '
replaced with '+'
, etc). I am not aware of any JDK built in class that will do just HTML encoding. Is there one? I am aware of other choices (for example, Jakarta Commons Lang 'StringEscapeUtils', but I don't want to add another external dependency to the project where I need this.
I'm hoping that something has been added to a recent JDK (aka 5 or 6) that will do this that I don't know about. Otherwise I have to roll my own.
HTMLEncoding turns this character into "<" which is the encoded representation of the less-than sign. URLEncoding does the same, but for URLs, for which the special characters are different, although there is some overlap.
escapeHtml4() [Apache Commons Text] This method takes the raw string as parameter and then escapes the characters using HTML entities. It supports all known HTML 4.0 entities. Apostrophe escape character (') is not a legal entity and so is not supported.
There isn't a JDK built in class to do this, but it is part of the Jakarta commons-lang library.
String escaped = StringEscapeUtils.escapeHtml3(stringToEscape); String escaped = StringEscapeUtils.escapeHtml4(stringToEscape);
Check out the JavaDoc
Adding the dependency is usually as simple as dropping the jar somewhere, and commons-lang has so many useful utilities that it is often worthwhile having it on board.
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With