What test text do you try and type into your web forms to check that they handle all the edge cases properly (especially Unicode and XSS style problems). I am particularly interested in good Unicode strings that may do something odd if they are mis-encoded when they are displayed again. Text that contains potentially problematic characters, like quotes, <code><</code>, <code>></code> etc would also be interesting.

Your idea of HTML-sensitive characters is a good start. I also like using characters that are kind of readable, but are still Unicode. When I was doing this kind of testing for tabblo.com, I used this string: <pre class="prettyprint"><code>Testing «ταБЬℓσ»: 1<2 & 4+1>3, now 20% off! </code></pre> This has HTML-sensitive characters, ASCII, upper-half ISO characters, and multi-byte Unicode characters.

What makes a good test string for testing web forms for unicode compatibility?

2 Answers

Your idea of HTML-sensitive characters is a good start. I also like using characters that are kind of readable, but are still Unicode. When I was doing this kind of testing for tabblo.com, I used this string:

Click to copy

Testing «ταБЬℓσ»: 1<2 & 4+1>3, now 20% off!

This has HTML-sensitive characters, ASCII, upper-half ISO characters, and multi-byte Unicode characters.

184

answered Oct 20 '22 19:10

Ned Batchelder

Turkey testing!

http://www.moserware.com/2008/02/does-your-code-pass-turkey-test.html

This is actually pretty advanced internationalization testing, not for the faint of heart, including date formatting, percent calculations, upper/lowercase translations, etc.

answered Oct 20 '22 20:10

willoller

Related questions
                            
                                Java charAt used with characters that have two code units
                            
                                Displaying the hex value of a string from a oracle varchar2?
                            
                                Emacs: automatically replace LaTeX to Unicode symbols
                            
                                What is the point of COLLATIONS for nvarchar (Unicode) columns?
                            
                                Selenium webdriver and unicode
                            
                                How to initialize char array using hex numbers?
                            
                                latin-1 to ascii
                            
                                Using non-ASCII characters inside functions for packages
                            
                                In C++ when to use WCHAR and when to use CHAR
                            
                                Difference between encoding utf-8 and utf8 in Python 3.5
                            
                                D2009 TStringlist ansistring
                            
                                How to deal with unicode string in URL in python3?
                            
                                Convert Unicode surrogate pair to literal string
                            
                                Check whether the JSON (object property exists) & print it as unicode decoded
                            
                                Unicode characters for «email», «save», «print»
                            
                                Is there a Unicode character for plus over minus? (+/-)
                            
                                Unable to encode/decode pprint output
                            
                                unicode characters appear as question marks in IntelliJ IDEA console
                            
                                Is it possible to use a Unicode "argv"?
                            
                                python : working with german umlaut

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

What makes a good test string for testing web forms for unicode compatibility?

Tags:

unicode

xss

Rik Heywood

People also ask

2 Answers

Ned Batchelder

willoller

Recent Activity

Donate For Us