I'm wondering what the Stack Overflow community thinks when it comes to creating a project (thinking primarily c++ here) with a unicode or a multi-byte character set. <ul> <li>Are there pros to going Unicode straight from the start, implying all your strings will be in wide format? Are there performance issues / larger memory requirements because of a standard use of a larger character?</li> <li>Is there an advantage to this method? Do some processor architectures handle wide characters better?</li> <li>Are there any reasons to make your project Unicode if you don't plan on supporting additional languages?</li> <li>What reasons would one have for creating a project with a multi-byte character set?</li> <li>How do all of the factors above collide in a high performance environment (such as a modern video game) ?</li> </ul>

Two issues I'd comment on. First, you don't mention what platform you're targeting. Although recent Windows versions (Win2000, WinXP, Vista and Win7) support both Multibyte and Unicode versions of system calls using strings, the Unicode versions are faster (the multibyte versions are wrappers that convert to Unicode, call the Unicode version, then convert any returned strings back to mutlibyte). So if you're making a lot of these types of calls the Unicode will be faster. Just because you're not planning on explicitly supporting additional languages, you should still consider supporting Unicode if your application saves and displays text entered by the users. Just because your application is unilingual, it doesn't follow that all it's users will be unilingual too. They may be perfectly happy to use your English language GUI, but might want to enter names, comments or other text in their own language and have them displayed properly.

C++ project type: unicode vs multi-byte; pros and cons

2 Answers

Two issues I'd comment on.

First, you don't mention what platform you're targeting. Although recent Windows versions (Win2000, WinXP, Vista and Win7) support both Multibyte and Unicode versions of system calls using strings, the Unicode versions are faster (the multibyte versions are wrappers that convert to Unicode, call the Unicode version, then convert any returned strings back to mutlibyte). So if you're making a lot of these types of calls the Unicode will be faster.

Just because you're not planning on explicitly supporting additional languages, you should still consider supporting Unicode if your application saves and displays text entered by the users. Just because your application is unilingual, it doesn't follow that all it's users will be unilingual too. They may be perfectly happy to use your English language GUI, but might want to enter names, comments or other text in their own language and have them displayed properly.

126

answered Oct 08 '22 13:10

Stephen C. Steel

You are talking about the VC++ Project setting here, right?

The only thing it affects is the version of Win32 API calls it ends up being exectuted. For instance, a call to MessageBox will end up as a call to MessageBoxA in case of the multi-byte setting, and MessageBoxW in case of Unicode setting. Of course, that will affect the types of string parameters to that functions as well. Internally, MessageBoxA calls MessageBoxW after converting the string paramteres from the current system locale to Unicode.

My advice is to use the Unicode settings and pass Unicode strings to Win32 API calls. That does not stop you from using strings in any other encoding internally.

answered Oct 08 '22 14:10

Nemanja Trifunovic

Related questions
                            
                                Why can't you declare a variable inside the expression portion of a do while loop?
                            
                                C++ equal(==) overload, Shortcut or best way comparing all attributes
                            
                                What's bigger than a double?
                            
                                C/C++ switch case with string [duplicate]
                            
                                fork() and pipes() in c
                            
                                Getting a buffer into a stringstream in hex representation:
                            
                                C++ std::find with a custom comparator
                            
                                Confused by use of double logical not (!!) operator [duplicate]
                            
                                Where does one get the "sys/socket.h" header/source file?
                            
                                How to change the integer type used by an enum (C++)?
                            
                                Conditionally disabling a copy constructor
                            
                                Function template with an operator
                            
                                Fill the holes in OpenCV [duplicate]
                            
                                How can I use a struct as key in a std::map?
                            
                                C++ bool returns 0 1 instead of true false
                            
                                Direct formula for summing XOR
                            
                                Initializing a static const array of const strings in C++
                            
                                c++ getline() isn't waiting for input from console when called multiple times
                            
                                c++ memcpy return value
                            
                                endl and flushing the buffer

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

C++ project type: unicode vs multi-byte; pros and cons

Tags:

c++

unicode

visual-c++

ansi

Stefan Valianu

People also ask

2 Answers

Stephen C. Steel

Nemanja Trifunovic

Recent Activity

Donate For Us