What encoding Win32 API functions expect?

Tags:

For example, MessageBox function has LPCTSTR typed argument for text and caption, which is a pointer to char or pointer to wchar when _UNICODE or _MBCS are defined, respectively.

How does the MessageBox function interpret those stings? As which encoding?

Only explanation I managed to find is this:

http://msdn.microsoft.com/en-us/library/cwe8bzh0(VS.90).aspx

But it doesn't say anything about encoding? Just that in case of _MBCS one character takes up one wchar (which is 16-bit on Windows) and that in case of _UNICODE one or two char's (8-bit).

So are those some Microsoft's versions of UTF-8 and UTF-16 that ignore anything that has to be encoded in 3 or four bytes in case of UTF-8 and anything that has to be encoded in 4 bytes in case of UTF-16? And is there a way to show anything outside of basic multilingual plane of Unicode with MessageBox?

911

asked Nov 10 '10 09:11

Bojan

2 Answers

There are normally two different implementations of each function:

MessageBoxA, which accepts ANSI strings
MessageBoxW, which accepts Unicode strings

Here, 'ANSI' means the multi-byte code page currently assigned to the process. This varies according to the user's preferences and locale setting, although Win32 API functions such as WideCharToMultiByte can be counted on to do the right conversion, and the GetACP function will tell you the code page in use. MSDN explains the ANSI code page and how it interacts with Unicode.

'Unicode' generally means UCS-2; that is, support for characters above 0xFFFF isn't consistent. I haven't tried this, but UI functions such as MessageBox in recent versions (> Windows 2000) should support characters outside the BMP.

139

answered Oct 14 '22 03:10

Tim Robinson

The ...A functions are obsolete and only wrap the ...W functions. The former were required for compatibility with Windows 9x, but since that is not used any more, you should avoid them at any costs and use the ...W functions exclusively. They require UTF-16 strings, the only native Windows encoding. All modern Windows versions should support non-BMP characters quite well (if there is a font that has these characters, of course).

answered Oct 14 '22 05:10

Philipp

Related questions
                            
                                Comments in Python 3.5 giving unicode error
                            
                                Python2: Using .decode with errors='replace' still returns errors
                            
                                How to convert unicode string into normal text in python
                            
                                Is __repr__ supposed to return bytes or unicode?
                            
                                String.compareIgnoreCase returns wrong result
                            
                                iOS 13 not displaying Russian Ruble (₽) unicode symbol
                            
                                Delphi 2009 RawByteString vagaries
                            
                                How do you reference unicode characters in ColdFusion regex?
                            
                                How to display japanese characters in JTextArea
                            
                                how to use french letters in a django template?
                            
                                UTF8 Filenames in PHP and Different Unicode Encodings
                            
                                How can I make unicode characters from integers?
                            
                                Best way to decode hex sequence of unicode characters to string
                            
                                C# ASCII or Unicode
                            
                                Get ready for Delphi 2009 and up when developing with Delphi 7?
                            
                                Using Unicode characters in C# controls
                            
                                Weird SQL Server 2005 Collation difference between varchar() and nvarchar()
                            
                                Why does the Java ecosystem use different character encodings throughout their software stack?
                            
                                Precompose Unicode Character Sequences in Python
                            
                                Python Unicode CSV export (using Django)

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

What encoding Win32 API functions expect?

Tags:

encoding

unicode

winapi

Bojan

People also ask

2 Answers

Tim Robinson

Philipp

Recent Activity

Donate For Us