The question is how to convert wstring to string? I have next example : <pre class="prettyprint"><code>#include <string> #include <iostream> int main() { std::wstring ws = L"Hello"; std::string s( ws.begin(), ws.end() ); //std::cout <<"std::string = "<<s<<std::endl; std::wcout<<"std::wstring = "<<ws<<std::endl; std::cout <<"std::string = "<<s<<std::endl; } </code></pre> the output with commented out line is : <pre class="prettyprint"><code>std::string = Hello std::wstring = Hello std::string = Hello </code></pre> but without is only : <pre class="prettyprint"><code>std::wstring = Hello </code></pre> Is anything wrong in the example? Can I do the conversion like above? EDIT New example (taking into account some answers) is <pre class="prettyprint"><code>#include <string> #include <iostream> #include <sstream> #include <locale> int main() { setlocale(LC_CTYPE, ""); const std::wstring ws = L"Hello"; const std::string s( ws.begin(), ws.end() ); std::cout<<"std::string = "<<s<<std::endl; std::wcout<<"std::wstring = "<<ws<<std::endl; std::stringstream ss; ss << ws.c_str(); std::cout<<"std::stringstream = "<<ss.str()<<std::endl; } </code></pre> The output is : <pre class="prettyprint"><code>std::string = Hello std::wstring = Hello std::stringstream = 0x860283c </code></pre> therefore the stringstream can not be used to convert wstring into string.

As Cubbi pointed out in one of the comments, <code>std::wstring_convert</code> (C++11) provides a neat simple solution (you need to <code>#include</code> <code><locale></code> and <code><codecvt></code>): <pre class="prettyprint"><code>std::wstring string_to_convert; //setup converter using convert_type = std::codecvt_utf8<wchar_t>; std::wstring_convert<convert_type, wchar_t> converter; //use converter (.to_bytes: wstr->str, .from_bytes: str->wstr) std::string converted_str = converter.to_bytes( string_to_convert ); </code></pre> I was using a combination of <code>wcstombs</code> and tedious allocation/deallocation of memory before I came across this. http://en.cppreference.com/w/cpp/locale/wstring_convert update(2013.11.28) One liners can be stated as so (Thank you Guss for your comment): <pre class="prettyprint"><code>std::wstring str = std::wstring_convert<std::codecvt_utf8<wchar_t>>().from_bytes("some string"); </code></pre> Wrapper functions can be stated as so: (Thank you ArmanSchwarz for your comment) <pre class="prettyprint"><code>std::wstring s2ws(const std::string& str) { using convert_typeX = std::codecvt_utf8<wchar_t>; std::wstring_convert<convert_typeX, wchar_t> converterX; return converterX.from_bytes(str); } std::string ws2s(const std::wstring& wstr) { using convert_typeX = std::codecvt_utf8<wchar_t>; std::wstring_convert<convert_typeX, wchar_t> converterX; return converterX.to_bytes(wstr); } </code></pre> Note: there's some controversy on whether <code>string</code>/<code>wstring</code> should be passed in to functions as references or as literals (due to C++11 and compiler updates). I'll leave the decision to the person implementing, but it's worth knowing. Note: I'm using <code>std::codecvt_utf8</code> in the above code, but if you're not using UTF-8 you'll need to change that to the appropriate encoding you're using: http://en.cppreference.com/w/cpp/header/codecvt

How to convert wstring into string?

Q: What is a Wstring C++?

This function is used to convert the numerical value to the wide string i.e. it parses a numerical value of datatypes (int, long long, float, double ) to a wide string. It returns a wide string of data type wstring representing the numerical value passed in the function.

Tags:

c++

unicode

stl

wstring

The question is how to convert wstring to string?

I have next example :

#include <string>
#include <iostream>

int main()
{
    std::wstring ws = L"Hello";
    std::string s( ws.begin(), ws.end() );

  //std::cout <<"std::string =     "<<s<<std::endl;
    std::wcout<<"std::wstring =    "<<ws<<std::endl;
    std::cout <<"std::string =     "<<s<<std::endl;
}

the output with commented out line is :

std::string =     Hello
std::wstring =    Hello
std::string =     Hello

but without is only :

std::wstring =    Hello

Is anything wrong in the example? Can I do the conversion like above?

EDIT

New example (taking into account some answers) is

#include <string>
#include <iostream>
#include <sstream>
#include <locale>

int main()
{
    setlocale(LC_CTYPE, "");

    const std::wstring ws = L"Hello";
    const std::string s( ws.begin(), ws.end() );

    std::cout<<"std::string =     "<<s<<std::endl;
    std::wcout<<"std::wstring =    "<<ws<<std::endl;

    std::stringstream ss;
    ss << ws.c_str();
    std::cout<<"std::stringstream =     "<<ss.str()<<std::endl;
}

The output is :

std::string =     Hello
std::wstring =    Hello
std::stringstream =     0x860283c

therefore the stringstream can not be used to convert wstring into string.

559

asked Sep 28 '22 14:09

BЈовић

2 Answers

As Cubbi pointed out in one of the comments, std::wstring_convert (C++11) provides a neat simple solution (you need to #include <locale> and <codecvt>):

std::wstring string_to_convert;

//setup converter
using convert_type = std::codecvt_utf8<wchar_t>;
std::wstring_convert<convert_type, wchar_t> converter;

//use converter (.to_bytes: wstr->str, .from_bytes: str->wstr)
std::string converted_str = converter.to_bytes( string_to_convert );

I was using a combination of wcstombs and tedious allocation/deallocation of memory before I came across this.

http://en.cppreference.com/w/cpp/locale/wstring_convert

update(2013.11.28)

One liners can be stated as so (Thank you Guss for your comment):

std::wstring str = std::wstring_convert<std::codecvt_utf8<wchar_t>>().from_bytes("some string");

Wrapper functions can be stated as so: (Thank you ArmanSchwarz for your comment)

std::wstring s2ws(const std::string& str)
{
    using convert_typeX = std::codecvt_utf8<wchar_t>;
    std::wstring_convert<convert_typeX, wchar_t> converterX;

    return converterX.from_bytes(str);
}

std::string ws2s(const std::wstring& wstr)
{
    using convert_typeX = std::codecvt_utf8<wchar_t>;
    std::wstring_convert<convert_typeX, wchar_t> converterX;

    return converterX.to_bytes(wstr);
}

Note: there's some controversy on whether string/wstring should be passed in to functions as references or as literals (due to C++11 and compiler updates). I'll leave the decision to the person implementing, but it's worth knowing.

Note: I'm using std::codecvt_utf8 in the above code, but if you're not using UTF-8 you'll need to change that to the appropriate encoding you're using:

http://en.cppreference.com/w/cpp/header/codecvt

369

answered Oct 13 '22 13:10

dk123

An older solution from: http://forums.devshed.com/c-programming-42/wstring-to-string-444006.html

std::wstring wide( L"Wide" ); 
std::string str( wide.begin(), wide.end() );

// Will print no problemo!
std::cout << str << std::endl;

Update (2021): However, at least on more recent versions of MSVC, this may generate a wchar_t to char truncation warning. The warning can be quieted by using std::transform instead with explicit conversion in the transformation function, e.g.:

std::wstring wide( L"Wide" );

std::string str;
std::transform(wide.begin(), wide.end(), std::back_inserter(str), [] (wchar_t c) {
    return (char)c;
});

Or if you prefer to preallocate and not use back_inserter:

std::string str(wide.length(), 0);
std::transform(wide.begin(), wide.end(), str.begin(), [] (wchar_t c) {
    return (char)c;
});

See example on various compilers here.

Beware that there is no character set conversion going on here at all. What this does is simply to assign each iterated wchar_t to a char - a truncating conversion. It uses the std::string c'tor:

template< class InputIt >
basic_string( InputIt first, InputIt last,
              const Allocator& alloc = Allocator() );

As stated in comments:

values 0-127 are identical in virtually every encoding, so truncating values that are all less than 127 results in the same text. Put in a chinese character and you'll see the failure.

the values 128-255 of windows codepage 1252 (the Windows English default) and the values 128-255 of unicode are mostly the same, so if that's teh codepage you're using most of those characters should be truncated to the correct values. (I totally expected á and õ to work, I know our code at work relies on this for é, which I will soon fix)

And note that code points in the range 0x80 - 0x9F in Win1252 will not work. This includes €, œ, ž, Ÿ, ...

163

answered Oct 13 '22 14:10

namar0x0309

Related questions
                            
                                smart pointers (boost) explained
                            
                                Replace part of a string with another string
                            
                                What exactly is a reentrant function?
                            
                                When to use inline function and when not to use it?
                            
                                Difference between a virtual function and a pure virtual function [duplicate]
                            
                                Is it possible to use std::string in a constexpr?
                            
                                What to do about a 11000 lines C++ source file?
                            
                                What breaking changes are introduced in C++11?
                            
                                Why do I have to access template base class members through the this pointer?
                            
                                Do I need to manually close an ifstream?
                            
                                Is std::vector so much slower than plain arrays?
                            
                                What is the correct way of using C++11's range-based for?
                            
                                How do I convert between big-endian and little-endian values in C++?
                            
                                What's the best free C++ profiler for Windows? [closed]
                            
                                When to use std::size_t?
                            
                                cout is not a member of std
                            
                                What is the difference between _tmain() and main() in C++?
                            
                                C++ display stack trace on exception
                            
                                std::unique_ptr with an incomplete type won't compile
                            
                                What are some better ways to avoid the do-while(0); hack in C++?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With