Somehow I couldn't find the answer in Google. Probably I'm using the wrong terminology when I'm searching. I'm trying to perform a simple task, convert a number that represents a character to the characters itself like in this table: http://unicode-table.com/en/#0460 For example, if my number is 47 (which is '\'), I can just put 47 in a <code>char</code> and print it using <code>cout</code> and I will see in the console a backslash (there is no problem for numbers lower than 256). But if my number is 1120, the character should be 'Ѡ' (omega in Latin). I assume it is represented by several characters (which <code>cout</code> would know to convert to 'Ѡ' when it prints to the screen). How do I get these "several characters" that represent 'Ѡ'? I have a library called ICU, and I'm using UTF-8.

What you call Unicode number is typically called a code point. If you want to work with C++ and Unicode strings, ICU offers a icu::UnicodeString class. You can find the documentation here. To create a UnicodeString holding a single character, you can use the constructor that takes a code point in a UChar32: <pre class="prettyprint"><code>icu::UnicodeString::UnicodeString(UChar32 ch) </code></pre> Then you can call the toUTF8String method to convert the string to UTF-8. Example program: <pre class="prettyprint"><code>#include <iostream> #include <string> #include <unicode/unistr.h> int main() { icu::UnicodeString uni_str((UChar32)1120); std::string str; uni_str.toUTF8String(str); std::cout << str << std::endl; return 0; } </code></pre> On a Linux system like Debian, you can compile this program with: <pre class="prettyprint"><code>g++ so.cc -o so -licuuc </code></pre> If your terminal supports UTF-8, this will print an omega character.

How to convert a Unicode code point to characters in C++ using ICU?

Tags:

c++

unicode

icu

Somehow I couldn't find the answer in Google. Probably I'm using the wrong terminology when I'm searching. I'm trying to perform a simple task, convert a number that represents a character to the characters itself like in this table: http://unicode-table.com/en/#0460

For example, if my number is 47 (which is '\'), I can just put 47 in a char and print it using cout and I will see in the console a backslash (there is no problem for numbers lower than 256).

But if my number is 1120, the character should be 'Ѡ' (omega in Latin). I assume it is represented by several characters (which cout would know to convert to 'Ѡ' when it prints to the screen).

How do I get these "several characters" that represent 'Ѡ'?

I have a library called ICU, and I'm using UTF-8.

931

asked Apr 27 '14 10:04

OopsUser

2 Answers

What you call Unicode number is typically called a code point. If you want to work with C++ and Unicode strings, ICU offers a icu::UnicodeString class. You can find the documentation here.

To create a UnicodeString holding a single character, you can use the constructor that takes a code point in a UChar32:

icu::UnicodeString::UnicodeString(UChar32 ch)

Then you can call the toUTF8String method to convert the string to UTF-8.

Example program:

#include <iostream>
#include <string>

#include <unicode/unistr.h>

int main() {
    icu::UnicodeString uni_str((UChar32)1120);
    std::string str;
    uni_str.toUTF8String(str);
    std::cout << str << std::endl;

    return 0;
}

On a Linux system like Debian, you can compile this program with:

g++ so.cc -o so -licuuc

If your terminal supports UTF-8, this will print an omega character.

answered Oct 06 '22 13:10

nwellnhof

note: if you have an error: 'undefined reference to icudt67_dat' you need to link -licudt then your problem will be solved.

answered Oct 06 '22 13:10

krak'175

Related questions
                            
                                Reading in file with delimiter
                            
                                Return object from java native method
                            
                                "GetObjectClass" method and "FindClass" method difference and usage
                            
                                Get all files listed inside .exe directory not knowing location
                            
                                How to replace one char by another using std::string in C++?
                            
                                error: iostream.h due to including cplex
                            
                                in C++: Is const reference means "read-only view of" or it requires immutability of object being referenced?
                            
                                About C++ classes with self reference
                            
                                Mentality behind GNU _M_ prefixing
                            
                                boost thread throwing exception "thread_resource_error: resource temporarily unavailable"
                            
                                Using boost library with different compiler version
                            
                                "Safe" dynamic cast?
                            
                                libpng crashes on png_read_info()
                            
                                Mathematical operation to keep number not less than zero
                            
                                c++ overloading assignment operator of another class
                            
                                Why should you not access the __m128i fields directly?
                            
                                Copying compiled binaries to another machine using Flash Drive
                            
                                Trying to merely simulate the Matlab "unique" function in c++
                            
                                Best algorithm for series expansion of Rational function
                            
                                The order of cout messages is not as expected

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With