I'm looking for class/util etc. to sanitize HTML code i.e. remove dangerous tags, attributes and values to avoid XSS and similar attacks. I get html code from rich text editor (e.g. TinyMCE) but it can be send malicious way around, ommiting TinyMCE validation ("Data submitted form off-site"). Is there anything as simple to use as InputFilter in PHP? Perfect solution I can imagine works like that (assume sanitizer is encapsulated in HtmlSanitizer class): <pre class="prettyprint"><code>String unsanitized = "...<...>..."; // some potentially // dangerous html here on input HtmlSanitizer sat = new HtmlSanitizer(); // sanitizer util class created String sanitized = sat.sanitize(unsanitized); // voila - sanitized is safe... </code></pre> <hr> Update - the simpler solution, the better! Small util class with as little external dependencies on other libraries/frameworks as possible - would be best for me. <hr> How about that?

<strike> You could use OWASP ESAPI for Java, which is a security library that is built to do such operations.</strike> Not only does it have encoders for HTML, it also has encoders to perform JavaScript, CSS and URL encoding. Sample uses of ESAPI can be found in the XSS prevention cheatsheet published by OWASP. You could use the OWASP AntiSamy project to define a site policy that states what is allowed in user-submitted content. The site policy can be later used to obtain "clean" HTML that is displayed back. You can find a sample TinyMCE policy file on the AntiSamy downloads page.

How to sanitize HTML code in Java to prevent XSS attacks?

Tags:

java

html

xss

sanitization

I'm looking for class/util etc. to sanitize HTML code i.e. remove dangerous tags, attributes and values to avoid XSS and similar attacks.

I get html code from rich text editor (e.g. TinyMCE) but it can be send malicious way around, ommiting TinyMCE validation ("Data submitted form off-site").

Is there anything as simple to use as InputFilter in PHP? Perfect solution I can imagine works like that (assume sanitizer is encapsulated in HtmlSanitizer class):

Click to copy

String unsanitized = "...<...>...";           // some potentially                                                // dangerous html here on input  HtmlSanitizer sat = new HtmlSanitizer();      // sanitizer util class created  String sanitized = sat.sanitize(unsanitized); // voila - sanitized is safe...

Update - the simpler solution, the better! Small util class with as little external dependencies on other libraries/frameworks as possible - would be best for me.

How about that?

568

asked Aug 05 '10 09:08

WildWezyr

2 Answers

You can try OWASP Java HTML Sanitizer. It is very simple to use.

Click to copy

PolicyFactory policy = new HtmlPolicyBuilder()     .allowElements("a")     .allowUrlProtocols("https")     .allowAttributes("href").onElements("a")     .requireRelNofollowOnLinks()     .build();  String safeHTML = policy.sanitize(untrustedHTML);

answered Oct 14 '22 17:10

Saljack

~~You could use OWASP ESAPI for Java, which is a security library that is built to do such operations.~~

Not only does it have encoders for HTML, it also has encoders to perform JavaScript, CSS and URL encoding. Sample uses of ESAPI can be found in the XSS prevention cheatsheet published by OWASP.

You could use the OWASP AntiSamy project to define a site policy that states what is allowed in user-submitted content. The site policy can be later used to obtain "clean" HTML that is displayed back. You can find a sample TinyMCE policy file on the AntiSamy downloads page.

answered Oct 14 '22 18:10

Vineet Reynolds

Related questions
                            
                                Eclipse IDE- Add jar? Add External jar? Add Library?
                            
                                Android - Get keyboard key press
                            
                                Elegant way of counting occurrences in a java collection
                            
                                Can't Set Fill Color Apache POI Excel Workbook
                            
                                I can't use @PostConstruct and @PostDestroy with Java 11
                            
                                Java memory model - can someone explain it?
                            
                                Access the size of a collection in JSP/JSTL/EL [duplicate]
                            
                                How to remove all components from a JFrame in Java?
                            
                                How can I get the package name of the current launcher in android 2.3 and above?
                            
                                Protect ArrayList from write access
                            
                                Unrooted Tests
                            
                                How to "cat" a file in JGit?
                            
                                Benefits of an enterprise service bus
                            
                                How to implement low pass filter using java
                            
                                How to convert ArrayList of custom class to JsonArray in Java?
                            
                                How to express numbers in scientific notation in java? [duplicate]
                            
                                Where should I keep the credentials of my database?
                            
                                Amazon SQS Java SDK - cannot receive message attributes
                            
                                How to emit and handle custom events?
                            
                                Data-driven tests with jUnit

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With