How to UTF-8 encode a character/string

Tags:

I am using a Twitter API library to post a status to Twitter. Twitter requires that the post be UTF-8 encoded. The library contains a function that URL encodes a standard string, which works perfectly for all special characters such as !@#$%^&*() but is the incorrect encoding for accented characters (and other UTF-8).

For example, 'é' gets converted to '%E9' rather than '%C3%A9' (it pretty much only converts to a hexadecimal value). Is there a built-in function that could input something like 'é' and return something like '%C9%A9"?

edit: I am fairly new to UTF-8 in case what I am requesting makes no sense.

edit: if I have a

string foo = "bar é";

I would like to convert it to

"bar %C3%A9"

Thanks

861

asked Feb 22 '11 19:02

tom

1 Answers

If you have a wide character string, you can encode it in UTF8 with the standard wcstombs() function. If you have it in some other encoding (e.g. Latin-1) you will have to decode it to a wide string first.

Edit: ... but wcstombs() depends on your locale settings, and it looks like you can't select a UTF8 locale on Windows. (You don't say what OS you're using.) WideCharToMultiByte() might be more useful on Windows, as you can specify the encoding in the call.

153

answered Nov 05 '22 07:11

Martin Stone

Related questions
                            
                                Understanding references vs. pointers. Why does this work?
                            
                                What is the right way to include Qt headers?
                            
                                Why to use .cpp files if I can have all of my C++ code in .h file?
                            
                                creating two-dimensional array dynamically in continuous memory block
                            
                                OpenGL voxel engine slow
                            
                                c++ placement new vs. overloading new
                            
                                The use of getters and setters for different programming languages [closed]
                            
                                Why would fopen fail to open a file that exists?
                            
                                forward declaring with inheritance information
                            
                                reinterpret_cast
                            
                                Eclipse Helios - "cannot run program make; unknown reason"
                            
                                Stopping destructor being called
                            
                                Explicit override of virtual function
                            
                                std::vector<std::string> crash
                            
                                What's c++ compiling performance bottle neck?
                            
                                What's the correct way to add 1 byte to a pointer in C/C++?
                            
                                Creating a counter that stays synchronized across MPI processes
                            
                                Random memory accesses are expensive?
                            
                                C++: "const" in front of a class method
                            
                                Iterators and templates

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to UTF-8 encode a character/string

Tags:

c++

string

character-encoding

utf-8

twitter

tom

People also ask

1 Answers

Martin Stone

Recent Activity

Donate For Us