Unicode Support in Various Programming Languages

Perl

Perl has built-in Unicode support, mostly. Sort of. From perldoc:

perlunitut - Tutorial on using Unicode in Perl. Largely teaches in absolute terms about what you should and should not do as far as Unicode. Covers basics.
perlunifaq - Frequently asked questions about Unicode in Perl.
perluniintro - Introduction to Unicode in Perl. Less "preachy" than perlunitut.
perlunicode - For when you absolutely have to know everything there is to know about Unicode and Perl.

Python 3k

Python 3k (or 3.0 or 3000) has new approach for handling text (unicode) and data:
Text Vs. Data Instead Of Unicode Vs. 8-bit. See also Unicode HOWTO.

Java

Same as with .NET, Java uses UTF-16 internally: java.lang.String

A String represents a string in the UTF-16 format in which supplementary characters are represented by surrogate pairs (see the section Unicode Character Representations in the Character class for more information). Index values refer to char code units, so a supplementary character uses two positions in a String.

HQ9+

The Q command has complete Unicode support in most implementations.

Go

Google's Go programming language supports Unicode and works with UTF-8.

Delphi

Delphi 2009 fully supports Unicode. They've changed the implementation of string to default to 16-bit Unicode encoding, and most libraries including the third party ones support Unicode. See Marco Cantù's Delphi and Unicode.

Prior to Delphi 2009, the support for Unicode was limited, but there was WideChar and WideString to store the 16-bit encoded string. See Unicode in Delphi for more info.

Note, you can still develop bilingual CJKV application without using Unicode. For example, Shift JIS encoded string for Japanese can be stored using plain AnsiString.

Related questions
                            
                                Matching only a unicode letter in Python re
                            
                                Newline symbol unicode character
                            
                                How to read Unicode input and compare Unicode strings in Python?
                            
                                How to convert a unichar value to an NSString in Objective-C?
                            
                                UnicodeEncodeError only when running as a cron job [duplicate]
                            
                                The proper way to handle Unicode with C++ in 2018?
                            
                                Is there a way to programmatically determine if a font file has a specific Unicode Glyph?
                            
                                Unicode symbol that represent "download" [closed]
                            
                                Convert UTF-16 to UTF-8 under Windows and Linux, in C
                            
                                How to correctly parse UTF-8 encoded HTML to Unicode strings with BeautifulSoup? [duplicate]
                            
                                Why UTF-32 exists whereas only 21 bits are necessary to encode every character?
                            
                                How to convert a TCHAR array to std::string?
                            
                                Unicode mirror character?
                            
                                Remove diacritics using Go
                            
                                Char or String -> Unicode value in Scala?
                            
                                How can I clean source code files of invisible characters?
                            
                                Does Ruby support unicode and how does it work?
                            
                                Why is non-breaking space not a whitespace character in Java?
                            
                                How to determine if a String contains invalid encoded characters
                            
                                Use Unicode characters in strings.xml

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Unicode Support in Various Programming Languages

Tags:

unicode

programming-languages

People also ask