I am using the OWASP Html Sanitizer to prevent XSS attacks on my web app. For many fields that should be plain text the Sanitizer is doing more than I expect. For example: <pre class="prettyprint"><code>HtmlPolicyBuilder htmlPolicyBuilder = new HtmlPolicyBuilder(); stripAllTagsPolicy = htmlPolicyBuilder.toFactory(); stripAllTagsPolicy.sanitize('a+b'); // return a&#43;b stripAllTagsPolicy.sanitize('foo@example.com'); // return foo&#64;example.com </code></pre> When I have fields such as email address that have a <code>+</code> in it such as <code>foo+bar@gmail.com</code> I end up with the wrong data in the the database. So two questions: <ol> <li>Are characters such as <code>+ - @</code> dangerous on their own do they really need to be encoded? </li> <li>How do I configure the OWASP html sanitizer to allow specific characters such as + - @?</li> </ol> Question 2 is the more important one for me to get an answer to.

You may want to use ESAPI API to filter specific characters. Although if you like to allow specific HTML element or attribute you can use following allowElements and allowAttributes. // Define the policy. <pre class="prettyprint"><code>Function<HtmlStreamEventReceiver, HtmlSanitizer.Policy> policy = new HtmlPolicyBuilder() .allowElements("a", "p") .allowAttributes("href").onElements("a") .toFactory(); // Sanitize your output. HtmlSanitizer.sanitize(myHtml, policy.apply(myHtmlStreamRenderer)); </code></pre>

How to allow specific characters with OWASP HTML Sanitizer?

Tags:

java

security

xss

owasp

sanitization

I am using the OWASP Html Sanitizer to prevent XSS attacks on my web app. For many fields that should be plain text the Sanitizer is doing more than I expect.

For example:

HtmlPolicyBuilder htmlPolicyBuilder = new HtmlPolicyBuilder();
stripAllTagsPolicy = htmlPolicyBuilder.toFactory();
stripAllTagsPolicy.sanitize('a+b'); // return a&#43;b
stripAllTagsPolicy.sanitize('[email protected]'); // return foo&#64;example.com

When I have fields such as email address that have a + in it such as [email protected] I end up with the wrong data in the the database. So two questions:

Are characters such as + - @ dangerous on their own do they really need to be encoded?
How do I configure the OWASP html sanitizer to allow specific characters such as + - @?

Question 2 is the more important one for me to get an answer to.

410

asked Sep 24 '12 03:09

ams

1 Answers

You may want to use ESAPI API to filter specific characters. Although if you like to allow specific HTML element or attribute you can use following allowElements and allowAttributes.

// Define the policy.

Function<HtmlStreamEventReceiver, HtmlSanitizer.Policy> policy
     = new HtmlPolicyBuilder()
         .allowElements("a", "p")
         .allowAttributes("href").onElements("a")
         .toFactory();

 // Sanitize your output.
 HtmlSanitizer.sanitize(myHtml, policy.apply(myHtmlStreamRenderer));

116

answered Sep 20 '22 16:09

Mahendra

Related questions
                            
                                How can I know whether a Java object is in tenure or eden space from heap dump
                            
                                Why does implementing Externalizable need a default public constructor?
                            
                                String POOL in java
                            
                                Android Maven Directory Structure
                            
                                JNI, Garbage collection and Pointers- Java/C++ who should do what?
                            
                                How to adjust the default size of the Javadoc popup window in Eclipse?
                            
                                How to get Ruby generated HMAC for SHA256 that is url safe to match Java?
                            
                                Efficiently deploy multiple instances of same WAR ( different contexts, same container )
                            
                                Java speed access array index versus temp variable
                            
                                How can I have two classes share the same variable definitions
                            
                                java io ioexception unable to parse response from server geocoder
                            
                                What classloader to use with Parcel.readHashMap?
                            
                                Smack Client - User is still 'online' although connection aborted
                            
                                is there a way to know which objects are in "old" area of Heap
                            
                                How to use a JavaScript chart library like D3.js or Raphaël in server-side Java
                            
                                Best practice to use Sprites in a game using AndEngine GLES2
                            
                                How to stream data to database BLOB using Hibernate (no in-memory storing in byte[])
                            
                                Is it possible to configure Dozer such that by default fields are rather accessed directly that through setter-/getter method
                            
                                What effect has $ in a Java class name
                            
                                Java random is always delivering a negative trend on the long run?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With