This is a noob question, but I wanna know why there are different encoding types and what are their differences (ie. ASCII, utf-8 and 16, base64, etc.)

Reasons are many I believe but the main point is: "How many characters you need to display (encode)?" If you live in US for example, you could go pretty far with ASCII. But in many counties we need characters like ä, å, ü etc. (If SO was ASCII only or you try to read this text as ASCII encoded text, you'd see some weird characters in the places of ä, å and ü.) Think also the China, Japan, Thailand and other "exotic" countires. Those weird figures on photos you may have seen around the world just might be letters, not pretty pictures. As for the differences between different encoding types you need to see their specification. Here's something for UTF-8. <ul> <li>http://www.unicode.org/standard/standard.html</li> <li>http://www.utf-8.com/</li> <li>http://en.wikipedia.org/wiki/UTF-8#Compared_to_other_multi-byte_encodings</li> </ul> I'm not familiar with UTF-16. Here's some information about the differences. <ul> <li>http://en.wikipedia.org/wiki/Unicode</li> <li>http://en.wikipedia.org/wiki/Unicode_plane</li> </ul> Base64 is used when there is a need to encode binary data that needs to be stored and transferred over media that are designed to deal with textual data. If you've ever made somesort of email system with PHP, you've probably encountered Base64. <ul> <li>http://en.wikipedia.org/wiki/Base64</li> <li>http://www.phpeveryday.com/articles/PHP-Email-Using-Embedded-Images-in-HTML-Email-P113.html</li> </ul> Is short: To support computer program's user interface localizations to many different languages. (Programming languages still mainly consist of characters found in ASCII encoding, althought it's possible for example in Java to use UTF-8 encoding in variable names, and the source code file is usually stored as something else than ASCII encoded text, for example UTF-8 encoding.) In short vol.2: Always when different people are trying to solve some problem from a specific point of view (or even without a point of view if it's even possible), results may be quite different. Quote from Joel's unicode article (link below): "Because bytes have room for up to eight bits, lots of people got to thinking, "gosh, we can use the codes 128-255 for our own purposes." The trouble was, lots of people had this idea at the same time, and they had their own ideas of what should go where in the space from 128 to 255." Thanks to Joachim and tchrist for all the info and discussion. Here's two articles I just read. (Both links are on the page I linked to earlier.) I'd forgotten most of the stuff from Joel's article since I last read it a few years back. Good introduction to the subject I hope. Mark Davis goes a little deeper. <ul> <li>http://www.joelonsoftware.com/articles/Unicode.html</li> <li>http://www.icu-project.org/docs/papers/forms_of_unicode/</li> </ul>

Why are there different encoding types?

1 Answers

Reasons are many I believe but the main point is: "How many characters you need to display (encode)?" If you live in US for example, you could go pretty far with ASCII. But in many counties we need characters like ä, å, ü etc. (If SO was ASCII only or you try to read this text as ASCII encoded text, you'd see some weird characters in the places of ä, å and ü.) Think also the China, Japan, Thailand and other "exotic" countires. Those weird figures on photos you may have seen around the world just might be letters, not pretty pictures.

As for the differences between different encoding types you need to see their specification. Here's something for UTF-8.

http://www.unicode.org/standard/standard.html
http://www.utf-8.com/
http://en.wikipedia.org/wiki/UTF-8#Compared_to_other_multi-byte_encodings

I'm not familiar with UTF-16. Here's some information about the differences.

http://en.wikipedia.org/wiki/Unicode
http://en.wikipedia.org/wiki/Unicode_plane

Base64 is used when there is a need to encode binary data that needs to be stored and transferred over media that are designed to deal with textual data. If you've ever made somesort of email system with PHP, you've probably encountered Base64.

http://en.wikipedia.org/wiki/Base64
http://www.phpeveryday.com/articles/PHP-Email-Using-Embedded-Images-in-HTML-Email-P113.html

Is short: To support computer program's user interface localizations to many different languages. (Programming languages still mainly consist of characters found in ASCII encoding, althought it's possible for example in Java to use UTF-8 encoding in variable names, and the source code file is usually stored as something else than ASCII encoded text, for example UTF-8 encoding.)

In short vol.2: Always when different people are trying to solve some problem from a specific point of view (or even without a point of view if it's even possible), results may be quite different. Quote from Joel's unicode article (link below): "Because bytes have room for up to eight bits, lots of people got to thinking, "gosh, we can use the codes 128-255 for our own purposes." The trouble was, lots of people had this idea at the same time, and they had their own ideas of what should go where in the space from 128 to 255."

Thanks to Joachim and tchrist for all the info and discussion. Here's two articles I just read. (Both links are on the page I linked to earlier.) I'd forgotten most of the stuff from Joel's article since I last read it a few years back. Good introduction to the subject I hope. Mark Davis goes a little deeper.

http://www.joelonsoftware.com/articles/Unicode.html
http://www.icu-project.org/docs/papers/forms_of_unicode/

181

answered Oct 11 '22 19:10

ZZ-bb

Related questions
                            
                                Read in .xlsx with csv module in python
                            
                                Caveats Encoding a C# string to a Javascript string
                            
                                Java, Ant error: unmappable character for encoding Cp1252
                            
                                Different utf8 encoding in filenames os x
                            
                                HttpUtility.HtmlEncode doesn't encode everything
                            
                                Intellij IDEA: "unmappable character for encoding UTF-8" compiling ISO-8859-1 files
                            
                                Detect Chinese character in java
                            
                                How to add encoding information to the response stream in ASP.NET?
                            
                                How do I replace special characters in a URL?
                            
                                How can I detect certain Unicode characters in a string in Ruby?
                            
                                What does "The .NET framework uses the UTF-16 encoding standard by default" mean?
                            
                                Encoding / Error Correction Challenge
                            
                                Auto-Detect Character Encoding in Java
                            
                                UnicodeEncodeError: 'ascii' codec can't encode character u'\xe4'
                            
                                PHP Security: how can encoding be misused?
                            
                                Firefox automatically decoding encoded parameter in url, does not happen in IE
                            
                                Java 8 UTF-8 encoding issue (java bug?)
                            
                                Java String encoding (UTF-8)
                            
                                '+' symbol problem in URL in IIS 7.x
                            
                                encoding of query string parameters in IE10

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Why are there different encoding types?

Tags:

character-encoding

encoding

Coola

People also ask

1 Answers

ZZ-bb

Recent Activity

Donate For Us