Why isn't UTF-8 allowed as the "ANSI" code page?

Tags:

The Windows _setmbcp function allows any valid code page...

(except UTF-7 and UTF-8, which are not supported)

OK, not supporting UTF-7 makes sense: Characters have non-unique representations and that introduces complexity and security risks.

But why not UTF-8?

As I understand it, the "ANSI" versions of the Windows API functions convert their arguments to UTF-16, call the equivalent "W" function, and convert any strings in the output to "ANSI". This is what I've been doing manually. So why can't Windows do it for me?

417

asked Jun 08 '10 06:06

dan04

1 Answers

The "ANSI" codepage is basically legacy: Windows 9X era. All modern software should be Unicode (that is, UTF-16) based anyway.

Basically, when the Ansi code page stuff was originally designed, UTF-8 wasn't even invented and so support for multi-byte encodings was rather haphazard (i.e. most Ansi code pages are single byte, with the exception of some East Asian code pages which are one-or-two byte). Adding support for "proper" multi-byte encodings was probably deemed not worth the effort when all new development should be done in UTF-16 anyway.

107

answered Sep 18 '22 07:09

Dean Harding

Related questions
                            
                                boot2docker on windows missing apt-get / package manager
                            
                                Set Java process name on Windows
                            
                                Sending ATA commands directly to device in Windows?
                            
                                Overcoming Windows Azure Sql Database 150 gb size limitation
                            
                                In windows service what is the difference between stop and pause?
                            
                                Setting environment variables in pre-build event and using in compilation step
                            
                                How to pass a parameter to a windows service once and for all at install instead of each start
                            
                                Is it possible to source a batch file in windows cmd like you can in unix?
                            
                                Eclipse doesn't generate MainActivity.java & activity_main.xml
                            
                                How to create a virtual printer in Windows?
                            
                                GetLastError(), errno, FormatMessageA() and strerror_s()?
                            
                                How to distribute a GTK+ application on Windows?
                            
                                Differences between unix and windows files
                            
                                Keyboard shortcut to move away from a full-screen remote desktop session [closed]
                            
                                How to enum modules in a 64bit process from a 32bit WOW process
                            
                                How to create a scheduled task under SYSTEM user account with SCHTASKS
                            
                                How to reboot the JVM under windows
                            
                                Bash - seamlessly run scripts with CRLF line endings
                            
                                How to install PCNTL extension in Windows?
                            
                                How do I get a list of installed updates and hotfixes?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Why isn't UTF-8 allowed as the "ANSI" code page?

Tags:

windows

utf-8

locale

codepages

mbcs

dan04

People also ask

1 Answers

Dean Harding

Recent Activity

Donate For Us