I want to get a regex which can only match a string consisted of Chinese character and without English or any other character. [\u4e00-\u9fa5] doesn't work at all, and [^x00-xff] would match the situation with punctuate or other language character. <pre class="prettyprint"><code>boost::wregex reg(L"\\w*"); bool b = boost::regex_match(L"我a", reg); // expected to be false b = boost::regex_match(L"我,", reg); // expected to be false b = boost::regex_match(L"我", reg); // expected to be true </code></pre>

Boost with ICU can use character classes. I think you're looking for <code>\p{Han}</code> script. Alternatively, U+4E00..U+9FFF is <code>\p{InCJK_Unified_Ideographs}</code>

The following regex works fine. <pre class="prettyprint"><code>boost::wregex reg(L"^[\u4e00-\u9fa5]+"); </code></pre>

How can I match a string with only Chinese letters using a regex?

Tags:

I want to get a regex which can only match a string consisted of Chinese character and without English or any other character. [\u4e00-\u9fa5] doesn't work at all, and [^x00-xff] would match the situation with punctuate or other language character.

boost::wregex reg(L"\\w*");
bool b = boost::regex_match(L"我a", reg);    // expected to be false
b = boost::regex_match(L"我,", reg);         // expected to be false
b = boost::regex_match(L"我", reg);          // expected to be true

930

asked Mar 29 '13 07:03

magicyang

2 Answers

Boost with ICU can use character classes. I think you're looking for \p{Han} script. Alternatively, U+4E00..U+9FFF is \p{InCJK_Unified_Ideographs}

183

answered Oct 15 '22 02:10

MSalters

The following regex works fine.

boost::wregex reg(L"^[\u4e00-\u9fa5]+");

answered Oct 15 '22 01:10

magicyang

Related questions
                            
                                How to use a Qt Quick 2 Extension Plugin on .qml with qmlscene (or qmlviewer5)
                            
                                Wait for multiple threads (Posix threads, c++)
                            
                                SqlQuery one named placeholders several times
                            
                                Convert a row of cv::Mat to an int
                            
                                Integer input restricted to four digits only
                            
                                Which boost graph algorithm do I use?
                            
                                x11 - Unable to move window after XGrabKeyboard
                            
                                How to let GDB continue until the program enters another function?
                            
                                In Windows C++ or C# can you ask the OS if it is currently shutting down/restarting/logging off
                            
                                lerp implementation for a "tween"
                            
                                Is there an alternative to CoreBluetooth for OSX
                            
                                Trie Implementation With Map
                            
                                vector out of range/ range check
                            
                                Inheritance and inline?
                            
                                How to figure out function prototype from assembly code?
                            
                                Listing All Physical Drives (Windows)
                            
                                CRTP and dynamic polymorphism compile error
                            
                                Cache-aligned stack variables
                            
                                Integer promotion, signed/unsigned, and printf
                            
                                Function overloading with shared pointer argument ambiguity

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How can I match a string with only Chinese letters using a regex?

Tags:

c++

regex

unicode

boost

visual-c++

magicyang

People also ask

2 Answers

MSalters

magicyang

Recent Activity

Donate For Us