Is it guarantees by the standard the order of the characters? Can i count that '1' symbol are followed in the character set table by the '2' symbol, for example? Or is it platform-specific?
There are three different Unicode character encodings: UTF-8, UTF-16 and UTF-32.
A character set can also be called a coded character set, a code set, a code page, or an encoding. Examples of character sets include International EBCDIC, Latin 1, and Unicode. Character sets are chosen on the basis of the letters and symbols required.
This is done by assigning a specific numeric value to a letter, number or symbol. These letters, numbers, and symbols are classified as “characters”. Characters are grouped together into specific “character sets” or “repertoires” that associate each one with a numerical value called a “code point”.
All of the characters that a computer can use are called a character set . Two standard character sets in common use are: American Standard Code for Information Interchange (ASCII) Unicode.
The C standard from 1999 says this about the character sets:
Both the basic source and basic execution character sets shall have the following members:
the 26 uppercase letters of the Latin alphabet
...
the 26 lowercase letters of the Latin alphabet
...
the 10 decimal digits
0 1 2 3 4 5 6 7 8 9
the following 29 graphic characters
...
In both the source and execution basic character sets, the value of each character after 0 in the above list of decimal digits shall be one greater than the value of the previous.
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With